-
Notifications
You must be signed in to change notification settings - Fork 11.8k
Bug: Adreno740 GPU device can't load model in Android system #8965
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Update in 2024/09/13: finallly I CAN use Vulkan backend in Android System with Mali GPU:
BUT the Performance is terrible,I don't know why.I tried other Project such as MediaPipe,MLC_LLM,they can run in GPU and work perfectly.while llama.cpp is bad in this situation. |
More than half the model is running on the CPU:
Try to offload all layers to the GPU |
thank you sir,I tried to offload all layers to GPU:
and I test:
With PURE CPU it will cost 1700ms. and I also found sometimes it will crash:
the crash is happened in |
This issue was closed because it has been inactive for 14 days since being marked as stale. |
What happened?
I tried to run llama.cpp in Samsug Galaxy Tab S9 Ultra,the Android System is Android13.and I have compiled these libraries accoding the guide.I used these libraries in my APK and when I load model it met a fatal crash.
Name and Version
tag:3400,commit:97bdd26e,support GPU acceleration:true
What operating system are you seeing the problem on?
Other? (Please let us know in description)
Relevant log output
The text was updated successfully, but these errors were encountered: