feat: #1614 gpt-realtime migration (Realtime API GA) #1646

seratch · 2025-09-03T06:13:00Z

this is still in progress but will resolve #1614

rm-openai · 2025-09-03T16:33:08Z

examples/realtime/app/server.py

+        # Disable server-side interrupt_response to avoid truncating assistant audio
+        session_context = await runner.run(
+            model_config={
+                "initial_model_settings": {
+                    "turn_detection": {"type": "semantic_vad", "interrupt_response": False}
+                }
+            }
+        )


do we need to do this by default? why?

I explored some changes to make the audio output quality, but they're not related to the gpt-realtime migration. So, I've reverted all of them. I will continue seeing improvements for this example app, but it can be done with a separate pull request.

rm-openai · 2025-09-03T16:33:34Z

examples/realtime/app/server.py

@@ -93,7 +111,9 @@ async def _serialize_event(self, event: RealtimeSessionEvent) -> dict[str, Any]:
            base_event["tool"] = event.tool.name
            base_event["output"] = str(event.output)
        elif event.type == "audio":
-            base_event["audio"] = base64.b64encode(event.audio.data).decode("utf-8")
+            # Coalesce raw PCM and flush on a steady timer for smoother playback.


is this just a quality improvement? would be nice to make it be a separate PR if so

yeah, same with above (I won't repeat this for the rest)

rm-openai · 2025-09-03T16:33:47Z

examples/realtime/app/server.py

-                "type": event.data.type,
-            }
+            # Surface useful raw events to the UI with details.
+            if getattr(event.data, "type", None) == "transcript_delta":


plz no getattr

rm-openai · 2025-09-03T16:34:14Z

examples/realtime/app/server.py

@@ -142,7 +195,8 @@ async def websocket_endpoint(websocket: WebSocket, session_id: str):
            if message["type"] == "audio":
                # Convert int16 array to bytes
                int16_data = message["data"]
-                audio_bytes = struct.pack(f"{len(int16_data)}h", *int16_data)
+                # Send little-endian PCM16 to the model.


did this change as part of the GA?

seratch requested a review from rm-openai September 3, 2025 06:13

seratch added enhancement New feature or request feature:realtime labels Sep 3, 2025

rm-openai reviewed Sep 3, 2025

View reviewed changes

seratch force-pushed the realtime-ga branch from ce4b33d to a4333dd Compare September 4, 2025 10:19

Migrate to gpt-realtime model

f02b096

seratch force-pushed the realtime-ga branch from a4333dd to f02b096 Compare September 4, 2025 10:20

seratch marked this pull request as ready for review September 4, 2025 10:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: #1614 gpt-realtime migration (Realtime API GA) #1646

feat: #1614 gpt-realtime migration (Realtime API GA) #1646

seratch commented Sep 3, 2025

Uh oh!

rm-openai Sep 3, 2025

Uh oh!

seratch Sep 4, 2025

Uh oh!

rm-openai Sep 3, 2025

Uh oh!

seratch Sep 4, 2025

Uh oh!

rm-openai Sep 3, 2025

Uh oh!

rm-openai Sep 3, 2025

Uh oh!

Uh oh!

feat: #1614 gpt-realtime migration (Realtime API GA) #1646

Are you sure you want to change the base?

feat: #1614 gpt-realtime migration (Realtime API GA) #1646

Conversation

seratch commented Sep 3, 2025

Uh oh!

rm-openai Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

seratch Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

rm-openai Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

seratch Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

rm-openai Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

rm-openai Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!