Skip to content

Commit c28dfd6

Browse files
committed
rft_cookbook material
1 parent a8ee5d9 commit c28dfd6

File tree

2 files changed

+15
-1
lines changed

2 files changed

+15
-1
lines changed

examples/Reinforcement_Fine_Tuning.ipynb

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,11 @@
44
"cell_type": "markdown",
55
"metadata": {},
66
"source": [
7+
<<<<<<< HEAD
78
"# **Reinforcement Fine-Tuning on the OpenAI Platform**\n",
9+
=======
10+
"# **Exploring Model Graders for Reinforcement Fine-Tuning**\n",
11+
>>>>>>> c715357 (rft_cookbook material)
812
"\n",
913
"*This guide is for developers and ML practitioners who already know their way around OpenAIʼs APIs, have a basic understanding of reinforcement fine-tuning (RFT), and wish to use their fine-tuned models for research or other appropriate uses. OpenAI’s services are not intended for the personalized treatment or diagnosis of any medical condition and are subject to our [applicable terms](https://openai.com/policies/).*\n",
1014
"\n",

registry.yaml

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
# should build pages for, and indicates metadata such as tags, creation date and
55
# authors for each page.
66

7-
- title: Reinforcement Fine-Tuning on the OpenAI Platform
7+
- title: Exploring Model Graders for Reinforcement Fine-Tuning
88
path: examples/Reinforcement_Fine_Tuning.ipynb
99
date: 2025-05-23
1010
authors:
@@ -14,6 +14,16 @@
1414
- fine-tuning
1515
- reinforcement-learning-graders
1616

17+
- title: Reinforcement Fine-tuning with the OpenAI API
18+
path: examples/fine-tuned_qa/reinforcement_finetuning_healthbench.ipynb
19+
date: 2025-05-21
20+
authors:
21+
- theophile-openai
22+
tags:
23+
- reinforcement-learning
24+
- fine-tuning
25+
- reinforcement-learning-graders
26+
1727
- title: Guide to Using the Responses API's MCP Tool
1828
path: examples/mcp/mcp_tool_guide.ipynb
1929
date: 2025-05-21

0 commit comments

Comments
 (0)