Skip to content

Commit c8ddce8

Browse files
authored
Fix inference example lacks required parameters (#9035)
Signed-off-by: Aisuko <urakiny@gmail.com>
1 parent 23fd453 commit c8ddce8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/quantize/README.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ Run the quantized model:
3434

3535
```bash
3636
# start inference on a gguf model
37-
./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -n 128
37+
./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -cnv -p "You are a helpful assistant"
3838
```
3939

4040
When running the larger models, make sure you have enough disk space to store all the intermediate files.

0 commit comments

Comments
 (0)