PRICING

One bill for all LLMs Zero surprises

The most cost-efficient AI infrastructure, with full control over every request.

Save up to $50/mo

Build

For indie devs and early-stage teams validating AI features in production — light traffic, real users.

-5%OFF

Discount applied automatically.

Start Build

Save up to $15,000/mo

Scale

For high-volume AI products at scale — multiple workloads, heavy concurrency, strict reliability.

-15%OFF

Discount applied automatically.

Start Scale

Your $30 buys millions of frontier tokens.

Each plan converts your monthly fee into a token pool you can spend across any model. 

Here’s what that pool gets you on the models people actually use to ship.

Up to x6

Token value per $1 spent

Provider rate

Lite

$30/mo

Pro

$100/mo

Plus

$200/mo

Claude Opus 4.8

Anthropic · $20 / 1M tokens

6.9 М

~$140

23.1 М

~$470

46.2 М

~$930

Claude Sonnet 4.6

Anthropic · $12 / 1M tokens

11.5 М

~$140

38.5 М

~$470

76.9 М

~$930

GPT-5.5

OpenAI · $25 / 1M tokens

5.8 М

~$150

19.4 М

~$490

38.7 М

~$970

GPT-5.3 Codex

OpenAI · $13 / 1M tokens

12.5 М

~$170

41.8 М

~$550

83.6 М

~$1090

Gemini 3.5 Flash

Google · $8 / 1M tokens

19.4 М

~$160

64.5 М

~$520

129 М

~$1040

Grok 4.3

xAI · $1.5 / 1M

100 M

~$150

334 M

~$500

667 M

~$1000

DeepSeek V4 Pro

DeepSeek · $0.5 / 1M

300 M

~$150

1 B

~$500

2 B

~$1000

DeepSeek V4 Flash

DeepSeek · $0.14 / 1M

1.1 B

~$150

3.6 B

~$500

7.1 B

~$1000

Kimi K2.6

Moonshot · $3 / 1M

50 M

~$150

167 M

~$500

334 M

~$1000

Maximum retail value if you spend the whole subscription on one model.

up to ~$170

you pay $30

up to ~$550

you pay $100

up to ~$1,090

you pay $200

500% ROI

For every $1 spent, you’re getting up to $5 in token costs across the most powerful models.

Enterprise

Bigger commits,

deeper discounts.

Inference

Up to 50% off

Wholesale rates passed through, no caps.

Deployment

Private & BYOK

VPC, on-prem, or your provider keys.

Support

Dedicated CSM

Private Slack channel, priority response.

Contract

Custom SLAs

Multi-entity invoicing, custom terms.

Model
Enterprise Discount
Claude

Claude

Anthropic: Opus, Sonnet, Haiku

Up to 30% OFF
ChatGPT

ChatGPT

OpenAI: GPT-5, O-Series, Codex

Up to 30% OFF
Gemini

Gemini

Google: Pro, Flash, Ultra

Up to 20% OFF
ElevenLabs

ElevenLabs

Voice: TTS, Conversational

Up to 20% OFF
Open-source models
Open-source models
Open-source models
Open-source models
Open-source models

Open-source models

DeepSeek, Qwen, GLM, MiniMax, Kimi

Up to 50% OFF

Token pricing

Approval is a checkbox, not a project.

Built to the standards your security, legal, and compliance teams already trust.

SOC 2 Type II
ISO 27001
CCPA Compliant
GDPR Compliant

FAQ

Frequently Asked Questions

What’s the difference between “Production” & “Coding”?

AI for Production is pay-as-you-go: you top up credits, get an automatic volume discount (5–15%) on every token, and run with no rate or usage limits. AI for Coding is a flat monthly subscription ($30–$200) with a fixed list of included models and 5-hour, daily, and monthly token limits.

What models are included in “AI for Coding”?

The Coding subscription covers a curated set of frontier models: Claude Opus 4.8, Claude Sonnet 4.6, Claude Haiku 4.5, GPT-5.5, GPT-5.3 Codex, Gemini 3.5 Flash, Grok 4.3, DeepSeek-V4-Pro, DeepSeek-V4-Flash, and Kimi K2.6. Your monthly fee converts into a token pool you spend across these, and the same dollar stretches further on the cheaper ones (up to ~6x token value per $1).

How do the “AI for Production” discounts work?

Your tier (Build/Ship/Scale) is set automatically by your last 30 days of usage and re-evaluated every billing cycle — there are no contracts or negotiation. Just add credits and the discount applies on top-up.

Do my credits expire if I don’t use them?

No. Credits never expire, so unused balance carries forward indefinitely.

Do cheaper plans get a worse product or fewer features?

No. Every plan — and both billing tracks — ships the full LLM.API surface: smart routing, fallback, EvalLab, budget controls, BYOK, zero data retention, and SOC 2 / GDPR / CCPA / ISO compliance. The toggle only changes how you’re billed.

How hard is it to switch my existing code to LLM.API?

It’s drop-in OpenAI-compatible — swap your API key and keep your code, since it works with every OpenAI SDK and tool on day one.

Get Started

Start in one line of code

Swap your API key. Keep your code.