rft_cookbook material

theophile-oai · theophile-oai · commit c28dfd6e1ed5 · 2025-05-23T18:32:15.000+02:00
diff --git a/examples/Reinforcement_Fine_Tuning.ipynb b/examples/Reinforcement_Fine_Tuning.ipynb
@@ -4,7 +4,11 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
+<<<<<<< HEAD
     "# **Reinforcement Fine-Tuning on the OpenAI Platform**\n",
+=======
+    "# **Exploring Model Graders for Reinforcement Fine-Tuning**\n",
+>>>>>>> c715357 (rft_cookbook material)
     "\n",
     "*This guide is for developers and ML practitioners who already know their way around OpenAIʼs APIs, have a basic understanding of reinforcement fine-tuning (RFT), and wish to use their fine-tuned models for research or other appropriate uses. OpenAI’s services are not intended for the personalized treatment or diagnosis of any medical condition and are subject to our [applicable terms](https://openai.com/policies/).*\n",
     "\n",
diff --git a/registry.yaml b/registry.yaml
@@ -4,7 +4,7 @@
 # should build pages for, and indicates metadata such as tags, creation date and
 # authors for each page.
 
-- title: Reinforcement Fine-Tuning on the OpenAI Platform
+- title: Exploring Model Graders for Reinforcement Fine-Tuning
   path: examples/Reinforcement_Fine_Tuning.ipynb
   date: 2025-05-23
   authors:
@@ -14,6 +14,16 @@
     - fine-tuning
     - reinforcement-learning-graders
 
+- title: Reinforcement Fine-tuning with the OpenAI API
+  path: examples/fine-tuned_qa/reinforcement_finetuning_healthbench.ipynb
+  date: 2025-05-21
+  authors:
+    - theophile-openai
+  tags:
+    - reinforcement-learning
+    - fine-tuning
+    - reinforcement-learning-graders
+
 - title: Guide to Using the Responses API's MCP Tool 
   path: examples/mcp/mcp_tool_guide.ipynb
   date: 2025-05-21