Small updates to allow for `gpt-oss` generation #2049

iamlemec · 2025-08-06T16:27:56Z

Compatibility updates for gpt-oss generation:

Seems like seq_rm doesn't accept -1 anymore, so just use seq_id = 0 for generation (is this fine for the server?)
The gpt-oss template uses strftime_now in the template, so provide that function

Of course, this also needs a llama.cpp version bump (not included here) for the new model architecture.

abetlen · 2025-08-07T13:48:43Z

Thank you @iamlemec !

always use seq_id=0 for generation; provide strftime_now to templates

8ba2278

abetlen closed this Aug 7, 2025

Provide feedback