Add OpenAI Reinforcement Fine-Tuning (RFT) Support #3060

anndvision · 2025-08-11T21:03:44Z

This PR implements support for OpenAI's Reinforcement Fine-Tuning (RFT) API.

Core Implementation:

OpenAI RFT configuration with graders, hyperparameters, and training options
Content block conversion from ContentBlockChatOutput to OpenAIReinforcementOutput
Multiple grader types: StringCheck, TextSimilarity, ScoreModel, LabelModel, Python, and Multi
Template system integration via TomlRelativePath

Testing

Unit tests for conversion logic with tool calls
Mock and live testing infrastructure
Multi-grader configurations with StringCheck and ScoreModel
Template variable resolution validation

TODO

UI
Python

…penai-rft

… parameters

…penai-rft

anndvision added 6 commits August 11, 2025 13:34

add core oenai rft functionality

8b8f762

test conversion to RFTRow

e834c29

add live and mock rft tests

92b88b4

add developer field to OpenAIRequestMessage

f2a5641

flatten output

ab3e445

update model grader input

ffc1bae

anndvision requested review from Aaron1011 and virajmehta August 11, 2025 21:03

anndvision added 13 commits August 11, 2025 18:05

handle json mode for system / developer messages

b646247

clean Grader serialization and fix for http client

a286c10

Merge branch 'main' of github.com:tensorzero/tensorzero into andrew/o…

bc582fd

…penai-rft

build bindings

16e52b5

add python types

b968baa

Merge branch 'main' of github.com:tensorzero/tensorzero into andrew/o…

c8314cb

…penai-rft

make RFT discoverable

6385eb9

add dictionary support for OpenAIRFTConfig grader and response_format…

c39e54b

… parameters

refactor grader -> openaigrader

9462e8e

OpenAIRFTCompatibleResponseFormat → OpenAIRFTResponseFormat

5fa5100

Merge branch 'main' of github.com:tensorzero/tensorzero into andrew/o…

bc58c02

…penai-rft

update stub

396d122

simplify python

ad8f796

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add OpenAI Reinforcement Fine-Tuning (RFT) Support #3060

Add OpenAI Reinforcement Fine-Tuning (RFT) Support #3060

anndvision commented Aug 11, 2025

Uh oh!

Uh oh!

Add OpenAI Reinforcement Fine-Tuning (RFT) Support #3060

Are you sure you want to change the base?

Add OpenAI Reinforcement Fine-Tuning (RFT) Support #3060

Conversation

anndvision commented Aug 11, 2025

Core Implementation:

Testing

TODO

Uh oh!

Uh oh!