- NPX
- Docker
✨ Features
- MCP Tool Groups — Added
tool_groupsconfig with governance scoping (virtual key, team, customer, user, provider, API key) and camelCase Helm aliases for MCP client fields - Enterprise Helm Overlays — Suite of composable overlays for guardrails, org governance, access profiles, customer budgets, teams, multi-customer governance, and SCIM/SSO
- Semantic Cache Helm Layers — Added
values-semantic-search-redis.yamlandvalues-semantic-search-weaviate.yamlfor Redis and Weaviate-backed semantic caches, plus a client-config overlay - Period Parameter in Dashboard/Logs APIs — Added
periodparam for relative time range queries on dashboard and logs endpoints provider_key_nameAlias — Human-readable alias for routing targets and pricing overrides, resolved tokey_idat config load timeenv.*References for Proxy and TLS — Proxy and TLS config fields (url,username,password,ca_cert_pem) now acceptenv.VAR_NAMEfor secret injection- MCP Duration Strings and Hash Reconciliation —
tool_sync_intervalaccepts Go duration strings; hash-based reconciliation prevents unnecessary MCP client restarts on config reload - Auto-fill Incoming Model for Fallbacks — Routing rule fallback entries can omit the model; the incoming request model is substituted automatically at runtime
- Namespace Tool Type — Namespace tool container type in Responses API; non-OpenAI providers receive automatically flattened tool lists
- Cache Creation Pricing — Cache creation details for Claude models with 5-minute and 1-hour TTL pricing tiers
- Governance Config Sync — Model configs and provider governance bindings now sync from
config.jsonto DB at startup business_unitsand Team Fields — Addedbusiness_units,team_id,calendar_aligned, andvirtual_key_countto governance schema and Helm
🐞 Fixed
- WebSocket /responses Reliability — Fixed upstream handshake diagnostics, proper error capture, and WebSocket connection lifecycle in the native
/responsespath - Raw Request Passthrough Removed — Removed
SendBackRawRequestfrom all provider passthrough flows; passthrough streaming now sets proper SSE headers - WebSocket Nil Checks — Improved
sendMessageSafelynil guards, panic recovery, and client cleanup - Routing Rule Query Normalization — Normalized
queryfield to validRuleGroupTypeand tightened schema validation - PydanticAI Null Text Fields — Normalized null text content in PydanticAI stream response chunks
- Budget and Team Co-creation — Fixed creation of budgets and teams in the same request
- Provider Reload — Fixed keyless provider status updates during config reload; provider runtime now reloads correctly after key creation
- OTel Metrics — Fixed OpenTelemetry metrics pipeline not working (thanks @tcx4c70!)
- OTel Export — Fixed OTEL exporting to correctly show input and output messages
- Logging Request Type — Resolved request type from pending data before streaming to prevent missing
Objectfield in error logs - Multipart File Uploads — Write multipart metadata before file content to fix upload ordering
- Env Var Redacted Check — Added missing redacted check for env var values
- feat: Standardizes tool stripping and anthropic integration handling against anthropic, vertex and azure
- fix: drops empty thinking block for anthropic provider for claude code
- feat: add support for cache creation details for claude models
- feat: adds missing anthropic params / handling in Vertex
- feat: add namespace tool type support in Responses API
- feat: support
env.*references for proxy and TLS config fields - feat: add Go duration string support for
tool_sync_intervaland hash-based MCP client config reconciliation - fix: add redacted check for env var values
- fix: write multipart metadata before file uploads
- fix: removes redundant IsDefined func
- fix: native WebSocket /responses stream reliability improvements
- fix: Fixes OTEL exporting in
framework/tracing/llmspan.goto show input and output messages correctly - feat: add support for cache creation cost above 1 hour
- feat: add
provider_key_namealias for routing targets and pricing overrides, resolved tokey_idat config load - feat: add Go duration string support for MCP
tool_sync_intervaland hash-based client config reconciliation - feat: support
env.*references for proxy and TLS config fields - feat: sync model config and provider governance bindings from config file to DB
- chore: extend migration tests, fix column ignores, add routing rule search
- feat: drops tools web_search and web_search_preview if model does not support it
- feat: flattens tool type namespace if provider does not support it
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- feat: automatically add incoming model to empty fallbacks in routing rules
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- fix: resolve request type from pending data before streaming chunk processing
- chore: adds debug logging in logging plugin lifecycle
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- fix: OTel metrics not working (thanks @tcx4c70!)
- fix: OTel conversion from built-in trace
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- chore: upgraded core to v1.5.5 and framework to v1.3.5
- chore: upgraded core to v1.5.5 and framework to v1.3.5

