You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
printf(" -gan N, --grp-attn-n N set the group attention factor to extend context size through self-extend(default: 1=disabled), used together with group attention width `--grp-attn-w`");
1939
1939
printf(" -gaw N, --grp-attn-w N set the group attention width to extend context size through self-extend(default: 512), used together with group attention factor `--grp-attn-n`");
1940
-
printf(" --chat-template FORMAT_NAME");
1941
-
printf(" set chat template, possible value is: llama2, chatml (default %s)", sparams.chat_template.c_str());
1940
+
printf(" --chat-template JINJA_TEMPLATE\n");
1941
+
printf(" set custom jinja chat template (default: template taken from model's metadata)\n");
1942
+
printf(" Note: only commonly used templates are accepted, since we don't have jinja parser\n");
0 commit comments