add min_tokens argument #1333

twaka · 2024-04-08T10:55:14Z

Hi,
I'm added min_tokens argument which makes eos token's probability to -inf by logits processor when number of generated tokens is smaller than it.
Note that this implementation doesn't prevent stopping generation by another condition (e.g. max_tokens, stop, stopping_criteria).
Completes #240

abetlen · 2024-04-17T13:19:39Z

Hey @twaka thank you for the contribution, I'd prefer exposing a MinTokensLogitProcessor instead of adding it as an argument.

twaka · 2024-04-17T15:07:32Z

Thank you for taking a look into!
One problem I could think of is that MinTokensLogitProcessor needs prompt tokens' length.
It would not straightforward to obtain before tokenization occurs in llama_cpp._create_completion especially for chat completion.

twaka · 2024-05-08T14:23:38Z

@abetlen Sorry for the delay. I've changed the implementation as you suggested. Could you please take a look?

abetlen · 2024-05-14T13:50:28Z

Hey @twaka sorry for the delay, and thank you for implementing the change, happy to merge this now!

twaka added 6 commits May 8, 2024 22:47

implement min_tokens

9fb9f5c

set default to 0

a9c2ff7

pass min_tokens

b4c9762

fix

138300a

remove copy

e440b03

implement MinTokensLogitsProcessor

8783991

twaka force-pushed the min_tokens branch from 1110ae8 to 8783991 Compare May 8, 2024 14:07

twaka added 2 commits May 8, 2024 23:11

format

5220aa9

fix condition

24a4bb8

abetlen merged commit 5212fb0 into abetlen:main May 14, 2024

twaka deleted the min_tokens branch May 15, 2024 05:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add min_tokens argument #1333

add min_tokens argument #1333

Uh oh!

twaka commented Apr 8, 2024

Uh oh!

abetlen commented Apr 17, 2024

Uh oh!

twaka commented Apr 17, 2024

Uh oh!

twaka commented May 8, 2024

Uh oh!

abetlen commented May 14, 2024

Uh oh!

Uh oh!

add min_tokens argument #1333

add min_tokens argument #1333

Uh oh!

Conversation

twaka commented Apr 8, 2024

Uh oh!

abetlen commented Apr 17, 2024

Uh oh!

twaka commented Apr 17, 2024

Uh oh!

twaka commented May 8, 2024

Uh oh!

abetlen commented May 14, 2024

Uh oh!

Uh oh!