Automatically set chat format from gguf #1110

abetlen · 2024-01-19T20:06:22Z

Closes #1096

Okay so for now this is going to be the behaviour / priority of the automatic chat format detection

Use provided chat_handler param
Use provided chat_format param
If model has a gguf chat_template try to string match it against known jinja2 templates and use the matching template's eos token
If model has a gguf chat_template use eos_token for stop generation
Use llama 2 chat format

Use jinja formatter to load chat format from gguf

d0fc83c

abetlen mentioned this pull request Jan 19, 2024

Use chat_template from gguf metadata #1096

Closed

abetlen added 3 commits January 29, 2024 13:59

Merge branch 'main' into chat-format-from-gguf

0d8ef95

Fix off-by-one error in metadata loader

321b46e

Implement chat format auto-detection

c2839f6

abetlen merged commit da003d8 into main Jan 29, 2024

abetlen deleted the chat-format-from-gguf branch January 31, 2024 20:27

Provide feedback