Pricing - LLM API

Save up to $50/mo

Build

For indie devs and early-stage teams validating AI features in production — light traffic, real users.

-10%OFF

Discount applied automatically.

Start Saving

Up to $1,000/mo usage

Self-serve top-up & usage alerts
Community + email support

Save up to $1,000/mo

Ship

For growing teams shipping AI to a real user base — steady traffic, multiple features, predictable spend.

-15%OFF

Discount applied automatically.

Start Saving

Up to $10,000/mo usage

Priority routing & team analytics
Shared budgets & per-key IAM

Save up to $15,000/mo

Scale

For high-volume AI products at scale — multiple workloads, heavy concurrency, strict reliability.

-20%OFF

Discount applied automatically.

Start Saving

Up to $100,000/mo usage

Reserved throughput & SLA
Named solution engineer

Lite

For developers exploring models and testing ideas before committing to a stack.

$30/ month

$1.00/day 
billed monthly

Start Building

Token budget

Monthly pool 20M
Daily limit 4M
5h burst limit 1.5M

X4 tokens

Pro

For developers actively integrating AI into real projects and shipping regularly.

$100/ month

$3.33/day 
billed monthly

Start Building

$3.33/day billed monthly

Monthly pool 80M
Daily limit 13M
5h burst limit 5M

X2 daily limit

Plus

For professional developers running AI as a daily tool across multiple projects at once.

$200/ month

$6.67/day 
billed monthly

Start Building

Token budget

Monthly pool 130M
Daily limit 26M
5h burst limit 11M

Your $30 buys millions of frontier tokens.

Each plan converts your monthly fee into a token pool you can spend across any model.  
Here’s what that pool gets you on the models people actually use to ship.

Up to x10

Token value per $1 spent

Provider rate

Lite

$30/mo

Pro

$100/mo

Plus

$200/mo

Claude Opus 4.8

Anthropic · $25 / 1M tokens

6.9 М

~$173

23.1 М

~$578

46.2 М

~$1,155

Claude Sonnet 4.6

Anthropic · $15 / 1M tokens

11.5 М

~$173

38.5 М

~$578

76.9 М

~$1,154

GPT-5.5

OpenAI · $30 / 1M tokens

5.8 М

~$174

19.4 М

~$582

38.7 М

~$1,161

GPT-5.3 Codex

OpenAI · $14 / 1M tokens

12.5 М

~$175

41.8 М

~$585

83.6 М

~$1,170

Gemini 3.5 Flash

Google · $9 / 1M tokens

19.4 М

~$175

64.5 М

~$581

129 М

~$1,161

Grok 4.3

xAI · $3 / 1M

100 M

~$250

334 M

~$833

667 M

~$1,667

DeepSeek V4 Pro

DeepSeek · $1 / 1M

300 M

~$261

1 B

~$870

2 B

~$1,740

DeepSeek V4 Flash

DeepSeek · $0.28 / 1M

1.1 B

~$300

3.6 B

~$1,000

7.1 B

~$2,000

Kimi K2.6

Moonshot · $4 / 1M

50 M

~$200

167 M

~$667

334 M

~$1,333

Maximum retail value if you spend the whole subscription on one model.

up to ~$300

you pay $30

up to ~$1,000

you pay $100

up to ~$2,000

you pay $200

900% ROI

For every $1 spent, you’re getting up to $10 in token costs across the most powerful models.

See all 400+ models

Enterprise

Bigger commits, 
deeper discounts.

Inference

Up to 50% off

Wholesale rates passed through, no caps.

Deployment

Private & BYOK

VPC, on-prem, or your provider keys.

Support

Dedicated CSM

Private Slack channel, priority response.

Contract

Custom SLAs

Multi-entity invoicing, custom terms.

Claude

Anthropic: Opus, Sonnet, Haiku

Up to 30% OFF

ChatGPT

OpenAI: GPT-5, O-Series, Codex

Up to 30% OFF

Gemini

Google: Pro, Flash, Ultra

Up to 20% OFF

ElevenLabs

Voice: TTS, Conversational

Up to 20% OFF

Open-source models

DeepSeek, Qwen, GLM, MiniMax, Kimi

Up to 50% OFF

Everything included

Same platform on every plan

The toggle changes the billing model. The product doesn’t change.
Every plan ships with the full LLM.API surface.

99.99% Uptime

Forget about downtime with LLM API smart routing, which guarantees you a reliable workspace

Drop-in OpenAI-compatible

Swap your API key, keep your code. Works with every OpenAI SDK and tool on day one.

Smart routing & fallback

Rule-based router, automatic fallback on provider outages, reserved throughput when you need it.

EvalLab + prompt management

Test prompts side-by-side across models, version them, and push to production with one click.

Budget & rate controls

Per-key IAM rules, per-team budgets, hard rate limits. Nothing surprises your finance lead.

Zero data retention

Prompts and responses are never stored. Default-on for every plan, no extra configuration.

BYOK & private deployments

Bring your own provider keys, deploy to VPC or on-prem, pick the region your data lives in.

SOC 2 · GDPR · HIPAA

Audited compliance documentation, BAAs, DPAs and security questionnaires ready when you ask.

Analytics & cost insights

Request-level traces, model-mix breakdowns, exportable usage data, alerts when patterns shift.

Token pricing

Approval is a checkbox, not a project.

Built to the standards your security, legal, and compliance teams already trust.

SOC 2 Type II

ISO 27001

CCPA Compliant

GDPR Compliant

FAQ

Frequently Asked Questions

What’s the difference between “Production” & “Coding”?

AI for Production is pay-as-you-go: you top up credits, get an automatic volume discount (10–20%) on every token, and run with no rate or usage limits. AI for Coding is a flat monthly subscription ($30–$200) with a fixed list of included models and 5-hour, daily, and monthly token limits.

What models are included in “AI for Coding”?

The Coding subscription covers a curated set of frontier models: Claude Opus 4.8, Claude Sonnet 4.6, Claude Haiku 4.5, GPT-5.5, GPT-5.3 Codex, Gemini 3.5 Flash, Grok 4.3, DeepSeek-V4-Pro, DeepSeek-V4-Flash, and Kimi K2.6. Your monthly fee converts into a token pool you spend across these, and the same dollar stretches further on the cheaper ones (up to ~6x token value per $1).

How do the “AI for Production” discounts work?

Your tier (Build/Ship/Scale) is set automatically by your last 30 days of usage and re-evaluated every billing cycle — there are no contracts or negotiation. Just add credits and the discount applies on top-up.

Do my credits expire if I don’t use them?

No. Credits never expire, so unused balance carries forward indefinitely.

Do cheaper plans get a worse product or fewer features?

No. Every plan — and both billing tracks — ships the full LLM.API surface: smart routing, fallback, EvalLab, budget controls, BYOK, zero data retention, and SOC 2 / GDPR / CCPA / ISO compliance. The toggle only changes how you’re billed.

How hard is it to switch my existing code to LLM.API?

It’s drop-in OpenAI-compatible — swap your API key and keep your code, since it works with every OpenAI SDK and tool on day one.

Get Started

Start in one line of code

Swap your API key. Keep your code.

One bill for all LLMs Zero surprises

Build

Ship

Scale

Lite

Pro

Plus

Your $30 buys millions of frontier tokens.

Bigger commits, deeper discounts.

Up to 50% off

Private & BYOK

Dedicated CSM

Custom SLAs

Same platform on every plan

99.99% Uptime

Drop-in OpenAI-compatible

Smart routing & fallback

EvalLab + prompt management

Budget & rate controls

Zero data retention

BYOK & private deployments

SOC 2 · GDPR · HIPAA

Analytics & cost insights

Approval is a checkbox, not a project.

SOC 2 Type II

ISO 27001

CCPA Compliant

GDPR Compliant

Frequently Asked Questions

What’s the difference between “Production” & “Coding”?

What models are included in “AI for Coding”?

How do the “AI for Production” discounts work?

Do my credits expire if I don’t use them?

Do cheaper plans get a worse product or fewer features?

How hard is it to switch my existing code to LLM.API?

Start in one line of code

Bigger commits, 
deeper discounts.