How to access token activations prior to softmax #15645

stingpie · 2025-08-28T19:10:41Z

stingpie
Aug 28, 2025

Hey, I'm playing around with some interesting but unpredictable models (this one, specifically) and it displays very strong creativity and good prose, but it's also prone to breaking. So, I had the idea to do a "poor man's ensemble", and add the raw token activations of a less creative, but more consistent model with the creative one, and then apply softmax. Hopefully, this will maintain some degree of creativity while being more consistent.

I've looked through the code, but the codebase is already large enough to be impenetrable for newcomers. Does this feature already exist? If not, is there some way I can get around it? The best thing I've found are the layer by layer callbacks, but I'm not sure exactly how to use those.

am17an · 2025-08-29T02:25:39Z

am17an
Aug 29, 2025
Collaborator

After llama_decode, you can call llama_get_logits, which should return the activations prior to any normalizations

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to access token activations prior to softmax #15645

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to access token activations prior to softmax #15645

Uh oh!

stingpie Aug 28, 2025

Replies: 1 comment

Uh oh!

am17an Aug 29, 2025 Collaborator

stingpie
Aug 28, 2025

am17an
Aug 29, 2025
Collaborator