Skip to content

2025.8.0

Compare
Choose a tag to compare
@GabrielBianconi GabrielBianconi released this 06 Aug 03:24
· 27 commits to main since this release
12c9d60

New Features

  • Add gateway.observability.skip_completed_migrations configuration option to reduce gateway startup time and database load. When enabled, the gateway will skip running the ClickHouse migration workflow (i.e. verifying and potentially applying every migration) on startup for migrations that are already present in a database table that tracks migration history.
  • Support raw_text content blocks in the OpenAI-compatible inference endpoint. (Thanks @hongantran3804 @pykm05 @pycoder49!)
  • Allow users to collect outputs from "Try with variant" in the UI as demonstrations.

Bug Fixes

  • Fix handling of reasoning content blocks for DeepSeek-R1 on AWS Bedrock.
  • Set proper default value for max_tokens for the Anthropic and GCP Vertex AI Anthropic model providers. The gateway will now error if no value is provided in the configuration or request and the model is unknown.
  • Skip caching model inferences that generated invalid tool call arguments.

& many under-the-hood and UI improvements (thanks @michaldorsett @K-coder05 @dcaputo-harmoni @masonblier @Nicolasgarbarino!)