2025.8.0
New Features
- Add
gateway.observability.skip_completed_migrations
configuration option to reduce gateway startup time and database load. When enabled, the gateway will skip running the ClickHouse migration workflow (i.e. verifying and potentially applying every migration) on startup for migrations that are already present in a database table that tracks migration history. - Support
raw_text
content blocks in the OpenAI-compatible inference endpoint. (Thanks @hongantran3804 @pykm05 @pycoder49!) - Allow users to collect outputs from "Try with variant" in the UI as demonstrations.
Bug Fixes
- Fix handling of reasoning content blocks for DeepSeek-R1 on AWS Bedrock.
- Set proper default value for
max_tokens
for the Anthropic and GCP Vertex AI Anthropic model providers. The gateway will now error if no value is provided in the configuration or request and the model is unknown. - Skip caching model inferences that generated invalid tool call arguments.
& many under-the-hood and UI improvements (thanks @michaldorsett @K-coder05 @dcaputo-harmoni @masonblier @Nicolasgarbarino!)