-
Notifications
You must be signed in to change notification settings - Fork 1k
v0.3 pre版本 使用AMX yaml报错 #617
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
加的模型是啥呀,我也是一样的问题 |
@moneyisallyouneed deepseek-r1-671B 4bit量化 |
您所指的是否确认为以下链接中的模型: 推测可能情况:
I assume you mean exactly https://huggingface.co/unsloth/DeepSeek-R1-GGUF/tree/main/DeepSeek-R1-Q4_K_M ??? It may be possible that the mysterious v0.3 build does not work with GGUF but wants to use Intel AMX support to do online quantization from full bf16 into int8 and int4?
|
@ubergarm thank you for your reply! I'll try this |
The methods for building and one - click running the Docker image of version 0.3 are freshly out! Welcome to test them. |
现在可以了么? 我也有相同的问题 |
@montagetao 还在下BF16的模型,但是BF16的GGUF模型非常大(约1.2T),我还在努力下载ORZ |
老哥 你真是个英雄!我们都在等着听你的发现! 是的,可能需要在@cunfate的代码中也使用该方法,通过预加载动态链接库来启用AMX并申请权限,具体可参考#320中的说明以及您提供的指南。 Yes, it may be required for cunfate to also use this code to enable AMX by preloading a dynamic-link library to request permission as described in #320 and detailed in your guide. I like your enthusiasm! |
0.3的版本哪里来的,仓库好像没看到0.3相关的版本啊。。。 |
wheel,source code还没有公布 |
commmand:
The text was updated successfully, but these errors were encountered: