How should TensorZero handle reasoning models? #816

virajmehta · 2025-01-21T19:45:53Z

virajmehta
Jan 21, 2025
Maintainer

Now that there are several reasoning models available it seems worth developing the abstractions for models which return thoughts alongside an answer. Here are the current implementations:

OpenAI O1: Standard chat completions API with some limitations. Reasoning is unavailable.
Gemini Flash thinking: returns multiple text parts with part.thought=True for one.
QwQ puts the answer in a LaTeX box (usually) and generates all the text in one undelimited string
DeepSeek-R1 has messages with content and reasoning_content fields which give an answer as well as intermediate reasoning.

My proposal: add a ContentBlock::Thought type that contains text and return it alongside all the other content blocks and store it alongside other content blocks. For models like O1, just don't include it. For models like Gemini and R1 where we can programmatically grab the reasoning, include the blocks in the store and returned responses. For models like QwQ, don't attempt to parse the output and just return everything as text.

In the future we might adopt formats for our own trained models to "escape" reasoning from the generation. We should easily be able to parse out a <thought> ... </thought> block (for example) and do the same behavior as Gemini / R1. This sets the stage nicely also for a Chain of Thought variant type.

GabrielBianconi · 2025-01-21T19:47:18Z

GabrielBianconi
Jan 21, 2025
Maintainer

lgtm

0 replies

nikcaryo-super · 2025-01-23T16:55:27Z

nikcaryo-super
Jan 23, 2025

thought content blocks make sense 👍

0 replies

ckblockit · 2025-01-26T01:39:00Z

ckblockit
Jan 26, 2025

We are also exploring this route for Claude.
When we use the Anthropic's prompt improver, it automatically adds an instruction to put chain of thoughts within some XML tag.
Our use case is to return a structured data, but also want to leverage this long chain of thought. Then the output usually look like

<custom_xml_tag_for_reasoning>
long rationale, which we might want to later use it for reviewing the output
</custom_xml_tag_for_reasoning>

actual JSON output we are interested

Some generalized way to parse this type of output would be useful.
We were thinking of chaining gpt-4o-mini at the end to "enforce some strucutred output" + maybe additional fields.

1 reply

GabrielBianconi Jan 26, 2025
Maintainer

Thanks for the feedback @ckblockit!

As an intermediate solution for models that don't have explicit reasoning functionality (e.g. Claude), you could request a JSON output (T0 JSON Function) with a "reasoning" field as the first field the output schema. Have you tried something like that?

For example, if you were hoping to generate a boolean score, you could prepend this field as follows:

{
  "type": "object",
  "properties": {
    "reasoning": {
      "type": "string"
    },
    "score": {
      "type": "boolean"
    }
  },
  "required": ["reasoning", "score"],
  "additionalProperties": false
}

What do you think?

GabrielBianconi · 2025-08-07T03:24:49Z

GabrielBianconi
Aug 7, 2025
Maintainer

Closing: this was implemented a while ago.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How should TensorZero handle reasoning models? #816

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How should TensorZero handle reasoning models? #816

Uh oh!

virajmehta Jan 21, 2025 Maintainer

Replies: 4 comments · 1 reply

Uh oh!

GabrielBianconi Jan 21, 2025 Maintainer

Uh oh!

nikcaryo-super Jan 23, 2025

Uh oh!

ckblockit Jan 26, 2025

Uh oh!

GabrielBianconi Jan 26, 2025 Maintainer

Uh oh!

GabrielBianconi Aug 7, 2025 Maintainer

virajmehta
Jan 21, 2025
Maintainer

Replies: 4 comments 1 reply

GabrielBianconi
Jan 21, 2025
Maintainer

nikcaryo-super
Jan 23, 2025

ckblockit
Jan 26, 2025

GabrielBianconi Jan 26, 2025
Maintainer

GabrielBianconi
Aug 7, 2025
Maintainer