Skip to content

Issue: HuggingFace Documentation Refers to Outdated Binaries #8659

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
HanClinto opened this issue Jul 23, 2024 · 3 comments
Closed

Issue: HuggingFace Documentation Refers to Outdated Binaries #8659

HanClinto opened this issue Jul 23, 2024 · 3 comments
Labels
bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches) stale

Comments

@HanClinto
Copy link
Collaborator

What happened?

This isn't a problem that we control, but if someone knows who to contact at Hugging Face, then it would be good to get the llama.cpp instructions from the "Use This Model" button to refer to the updated binary names from #7809 .

Example:

From a GGUF'd model page (such as this one), click "Use This Model", then "llama.cpp", and you'll see a screen like the following:

Screenshot 2024-07-23 at 4 53 15 PM

# Option 1: use llama.cpp with brew
brew install llama.cpp

# Load and run the model
llama \
	--hf-repo "QuantFactory/Mistral-Nemo-Instruct-2407-GGUF" \
	--hf-file Mistral-Nemo-Instruct-2407.Q2_K.gguf \
	-p "I believe the meaning of life is" \
	-n 128 Copy  # Option 2: build llama.cpp from source with curl support
git clone https://github.com/ggerganov/llama.cpp.git 
cd llama.cpp
LLAMA_CURL=1 make

# Load and run the model
./main \
	--hf-repo "QuantFactory/Mistral-Nemo-Instruct-2407-GGUF" \
	-m Mistral-Nemo-Instruct-2407.Q2_K.gguf \
	-p "I believe the meaning of life is" \
	-n 128

The instructions in "Option 2" should be updated to be llama-cli instead of main.

It's not a huge deal, but I looked through the HF Repositories list and it wasn't immediately obvious which one this should be reported in.

Mainly wanted to report this here so that we had a place to track it.

Name and Version

n/a

What operating system are you seeing the problem on?

No response

Relevant log output

No response

@HanClinto HanClinto added bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches) labels Jul 23, 2024
@ngxson
Copy link
Collaborator

ngxson commented Jul 23, 2024

Thanks for reminding. I actually made a PR on huggingface to resolve this issue, but I'm recently too busy and almost forget about it: huggingface/huggingface.js#778

@oldgithubman
Copy link

oldgithubman commented Jul 24, 2024

This isn't a problem that we control

False.

@github-actions github-actions bot added the stale label Aug 23, 2024
@github-actions github-actions bot removed the stale label Sep 1, 2024
@github-actions github-actions bot added the stale label Oct 1, 2024
Copy link
Contributor

This issue was closed because it has been inactive for 14 days since being marked as stale.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches) stale
Projects
None yet
Development

No branches or pull requests

4 participants
@HanClinto @oldgithubman @ngxson and others