Back to changelog

Claude Opus 4.8, MiMo, guardrails log & model deprecation dates

Anthropic’s latest flagship arrives, Xiaomi MiMo joins the catalog, every guardrail evaluation gets a full audit log, and deprecation dates are now visible across all providers.

Claude Opus 4.8

Anthropic’s latest flagship is now available via both the standard Anthropic endpoint and AWS Bedrock. Opus 4.8 supports a 1M token context window, vision, tool use, streaming, structured output, and reasoning effort levels (low / medium / high / max) — at the same per-token price as Opus 4.7, no markup.

Xiaomi MiMo

Two new models from Xiaomi’s MiMo family are in the catalog: mimo-v2.5-pro and mimo-v2.5-flash. Both support streaming, temperature control, and reasoning parameters.

Guardrails event log

Every guardrail evaluation now generates a logged event. The new Guardrails Events page shows a paginated, filterable view — filter by rule, direction (input/output), and decision (allowed/blocked). Each row shows evaluation timestamp, latency, what was matched, and the final decision.

Multi-region routing for Bedrock and Vertex

Pin any request to a specific geographic region using the X-LLMAPI-Region header — now supported for AWS Bedrock and Google Vertex in addition to Azure. Useful for data residency requirements or reducing latency for users in a specific region.

Back to changelog

Get Started

Cut your AI bill, not your usage

Route every request to the right model. Track every dollar you spend. Cut your LLM costs by up to 60%.