feat(agents): Add on_llm_start and on_llm_end Lifecycle Hooks #987

uzair330 · 2025-07-01T17:24:54Z

Motivation

Currently, the AgentHooks provide valuable lifecycle events for the start/end of an agent run and for tool execution (on_tool_start/on_tool_end). However, developers lack the ability to observe the agent's execution at the language model level.

This PR introduces two new hooks, on_llm_start and on_llm_end, to provide this deeper level of observability. This change enables several key use cases:

Performance Monitoring: Precisely measure the latency of LLM calls.
Debugging & Logging: Log the exact prompts sent to and raw responses received from the model.
Implementing Custom Logic: Trigger actions (e.g., updating a UI, saving state) immediately before or after the agent "thinks."

Summary of Changes

src/agents/lifecycle.py
Added two new async methods, on_llm_start and on_llm_end, to the AgentHooks base class, matching the existing on_*_start/on_*_end pattern.
src/agents/run.py
Wrapped the call to model.get_response(...) in _get_new_response with invocations of the new hooks so that they fire immediately before and after each LLM call.
tests/test_agent_llm_hooks.py
Added unit tests (using a mock model and spy hooks) to validate:
1. The correct sequence of on_start → on_llm_start → on_llm_end → on_end in a chat‑only run.
2. The correct wrapping of tool execution in a tool‑using run:
  on_start → on_llm_start → on_llm_end → on_tool_start → on_tool_end → on_llm_start → on_llm_end → on_end.
3. That the agent still runs without error when agent.hooks is None.

Usage Examples

1. Async Example (awaitable via `run`)

import asyncio
from typing import Any, Optional

from dotenv import load_dotenv

from agents.agent import Agent
from agents.items import ModelResponse, TResponseInputItem
from agents.lifecycle import AgentHooks, RunContextWrapper
from agents.run import Runner

# Load any OPENAI_API_KEY or other env vars
load_dotenv()


# --- 1. Define a custom hooks class to track LLM calls ---
class LLMTrackerHooks(AgentHooks[Any]):
    async def on_llm_start(
        self,
        context: RunContextWrapper,
        agent: Agent,
        system_prompt: Optional[str],
        input_items: list[TResponseInputItem],
    ) -> None:
        print(
            f">>> [HOOK] Agent '{agent.name}' is calling the LLM with system prompt: {system_prompt or '[none]'}"
        )

    async def on_llm_end(
        self,
        context: RunContextWrapper,
        agent: Agent,
        response: ModelResponse,
    ) -> None:
        if response.usage:
            print(f">>> [HOOK] LLM call finished. Tokens used: {response.usage.total_tokens}")


# --- 2. Create your agent with these hooks ---
my_agent = Agent(
    name="MyMonitoredAgent",
    instructions="Tell me a joke.",
    hooks=LLMTrackerHooks(),
)


# --- 3. Drive it via an async main() ---
async def main():
    result = await Runner.run(my_agent, "Tell me a joke.")
    print(f"\nAgent output:\n{result.final_output}")


if __name__ == "__main__":
    asyncio.run(main())

2. Sync Example (blocking via `run_sync`)

from typing import Any, Optional

from dotenv import load_dotenv

from agents.agent import Agent
from agents.items import ModelResponse, TResponseInputItem
from agents.lifecycle import AgentHooks, RunContextWrapper
from agents.run import Runner

# Load any OPENAI_API_KEY or other env vars
load_dotenv()


# --- 1. Define a custom hooks class to track LLM calls ---
class LLMTrackerHooks(AgentHooks[Any]):
    async def on_llm_start(
        self,
        context: RunContextWrapper,
        agent: Agent,
        system_prompt: Optional[str],
        input_items: list[TResponseInputItem],
    ) -> None:
        print(
            f">>> [HOOK] Agent '{agent.name}' is calling the LLM with system prompt: {system_prompt or '[none]'}"
        )

    async def on_llm_end(
        self,
        context: RunContextWrapper,
        agent: Agent,
        response: ModelResponse,
    ) -> None:
        if response.usage:
            print(f">>> [HOOK] LLM call finished. Tokens used: {response.usage.total_tokens}")


# --- 2. Create your agent with these hooks ---
my_agent = Agent(
    name="MyMonitoredAgent",
    instructions="Tell me a joke.",
    hooks=LLMTrackerHooks(),
)


# --- 3. Drive it via an async main() ---
def main():
    result = Runner.run_sync(my_agent, "Tell me a joke.")
    print(f"\nAgent output:\n{result.final_output}")


if __name__ == "__main__":
    main()

Note

Streaming support for on_llm_start and on_llm_end is not yet implemented. These hooks currently fire only on non‑streamed (batch) LLM calls. Support for streaming invocations will be added in a future release.

Checklist

My code follows the style guidelines of this project (checked with ruff).
I have added tests that prove my feature works.
All new and existing tests passed locally with my changes.

uzair330 added 5 commits July 1, 2025 15:08

feat: add lifecycle hooks to agent

bb2c1f3

feat: update run.py to support new lifecycle hooks

9302c49

add type imports to lifecycle.py

a775d55

test: add unit tests for new agent lifecycle hooks

853b595

style: Apply ruff formatting across project

34e097d

seratch added enhancement New feature or request feature:core labels Jul 8, 2025

fix: Resolve all CI failures for LLM hooks feature

fc50e93

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(agents): Add on_llm_start and on_llm_end Lifecycle Hooks #987

feat(agents): Add on_llm_start and on_llm_end Lifecycle Hooks #987

Uh oh!

uzair330 commented Jul 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

feat(agents): Add on_llm_start and on_llm_end Lifecycle Hooks #987

Are you sure you want to change the base?

feat(agents): Add on_llm_start and on_llm_end Lifecycle Hooks #987

Uh oh!

Conversation

uzair330 commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Summary of Changes

Usage Examples

1. Async Example (awaitable via run)

2. Sync Example (blocking via run_sync)

Note

Checklist

Uh oh!

Uh oh!

uzair330 commented Jul 1, 2025 •

edited

Loading

1. Async Example (awaitable via `run`)

2. Sync Example (blocking via `run_sync`)