llama : add `struct llama_vocab` to the API #11156

ggerganov · 2025-01-09T13:31:58Z

target #11110

patch df1c467

This change decouples the llama_vocab from the llama_model.

API changes

llama_n_vocab() now accepts struct llama_vocab instead of struct llama_model
llama_sampler_init_dry() now accepts struct llama_vocab instead of struct llama_model
The tokenization API now accepts struct llama_vocab instead of struct llama_model
The sampler API now accepts struct llama_vocab instead of struct llama_model

ggml-ci

* llama : functions -> methods (#11110) * llama : add struct llama_vocab to the API (#11156) ggml-ci * hparams : move vocab params to llama_vocab (#11159) ggml-ci * vocab : more pimpl (#11165) ggml-ci * vocab : minor tokenization optimizations (#11160) ggml-ci Co-authored-by: Diego Devesa <slarengh@gmail.com> * lora : update API names (#11167) ggml-ci * llama : update API names to use correct prefix (#11174) * llama : update API names to use correct prefix ggml-ci * cont ggml-ci * cont ggml-ci * minor [no ci] * vocab : llama_vocab_add_[be]os -> llama_vocab_get_add_[be]os (#11174) ggml-ci * vocab : llama_vocab_n_vocab -> llama_vocab_n_tokens (#11174) ggml-ci --------- Co-authored-by: Diego Devesa <slarengh@gmail.com>

* llama : functions -> methods (ggml-org#11110) * llama : add struct llama_vocab to the API (ggml-org#11156) ggml-ci * hparams : move vocab params to llama_vocab (ggml-org#11159) ggml-ci * vocab : more pimpl (ggml-org#11165) ggml-ci * vocab : minor tokenization optimizations (ggml-org#11160) ggml-ci Co-authored-by: Diego Devesa <slarengh@gmail.com> * lora : update API names (ggml-org#11167) ggml-ci * llama : update API names to use correct prefix (ggml-org#11174) * llama : update API names to use correct prefix ggml-ci * cont ggml-ci * cont ggml-ci * minor [no ci] * vocab : llama_vocab_add_[be]os -> llama_vocab_get_add_[be]os (ggml-org#11174) ggml-ci * vocab : llama_vocab_n_vocab -> llama_vocab_n_tokens (ggml-org#11174) ggml-ci --------- Co-authored-by: Diego Devesa <slarengh@gmail.com>

ggerganov requested a review from ngxson as a code owner January 9, 2025 13:31

github-actions bot added testing Everything test related examples server android Issues specific to Android python python script changes labels Jan 9, 2025

ggerganov added the breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. label Jan 9, 2025

ggerganov mentioned this pull request Jan 9, 2025

hparams : move vocab params to llama_vocab #11159

Merged

ngxson approved these changes Jan 9, 2025

View reviewed changes

ggerganov force-pushed the gg/llama-refactor-7 branch from c89e808 to bfe781a Compare January 10, 2025 08:25

ggerganov added 3 commits January 10, 2025 11:02

llama : add struct llama_vocab to the API (#11156)

df1c467

ggml-ci

hparams : move vocab params to llama_vocab (#11159)

446fec5

ggml-ci

vocab : more pimpl (#11165)

b48d763

ggml-ci

ggerganov force-pushed the gg/llama-refactor-8 branch from 06ae9ae to b48d763 Compare January 10, 2025 09:07

ggerganov merged commit 1d1f264 into gg/llama-refactor-7 Jan 10, 2025
54 of 57 checks passed

ggerganov added a commit that referenced this pull request Jan 10, 2025

llama : add struct llama_vocab to the API (#11156)

1439bad

ggml-ci

ggerganov deleted the gg/llama-refactor-8 branch January 10, 2025 09:21

ggerganov added a commit that referenced this pull request Jan 10, 2025

llama : add struct llama_vocab to the API (#11156)

c725f69

ggml-ci

ggerganov mentioned this pull request Jan 11, 2025

llama : add llama_vocab, functions -> methods, naming #11110

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : add `struct llama_vocab` to the API #11156

llama : add `struct llama_vocab` to the API #11156

ggerganov commented Jan 9, 2025 •

edited

Loading

llama : add struct llama_vocab to the API #11156

llama : add struct llama_vocab to the API #11156

Conversation

ggerganov commented Jan 9, 2025 • edited Loading

API changes

llama : add `struct llama_vocab` to the API #11156

llama : add `struct llama_vocab` to the API #11156

ggerganov commented Jan 9, 2025 •

edited

Loading