Powered by Upstage

Solar Pro 3

  • Instruction Following

Solar Pro 3 is Upstage’s Mixture-of-Experts large language model with 102B total parameters (12B active), a 128K-token context window, and strong extended reasoning and tool-use capabilities.

Start Using API

What is Solar Pro 3?

Solar Pro 3 is a proprietary Mixture-of-Experts language model from Upstage optimized for efficient, high-quality text generation and reasoning. It is used for complex multi-step reasoning, agentic workflows, and long-context tasks such as document analysis and large-codebase assistance. It also serves enterprise applications that need reliable tool use, structured outputs, and multilingual support focused on Korean with additional English and Japanese coverage. Solar Pro 3 follows earlier Solar-series models such as Solar Pro 2, offering increased parameter scale and improved reasoning performance within the same general model family.

5 Core Capabilities

  • Text Generation

    Generates and edits high-quality text responses across domains, suitable for content creation, SEO workflows, and structured writing tasks.

  • Long Context Handling

    Processes and reasons over long inputs with a context window up to 128K tokens, supporting document-heavy and retrieval-oriented applications.

  • Tool Use

    Supports tool use and function calling, enabling agentic workflows that interact with external systems and APIs programmatically.

  • Structured Outputs

    Produces well-structured JSON and schema-conformant outputs, useful for automation pipelines and programmatic integration with downstream systems.

  • Multilingual Support

    Handles multiple languages with strong performance in Korean and solid English and Japanese support for multilingual applications.

6 Most Valuable Use Cases

  • Code Generation Assistance
  • Enterprise Document Search
  • Contract Review Support
  • Invoice Extraction Automation
  • Customer Support Triage
  • Business Process Workflows

Cost Comparison

LLM API offers the lowest cost and latency for Solar Pro 3–class models.

Provider Region Latency Throughput Uptime Input ($/1M) Output ($/1M) Context
LLM API BEST Global 120ms 80 tps 99.99% $0.20 $0.60 256K
Upstage Global ~250ms ~40 tps ~99.9% ~$0.25 ~$0.75 ~128K
OpenRouter Global ~320ms ~35 tps ~99.9% ~$0.30 ~$0.90 ~128K
Together AI US East ~280ms ~45 tps ~99.9% ~$0.28 ~$0.85 ~128K
Fireworks AI US West ~260ms ~50 tps ~99.95% ~$0.26 ~$0.80 ~128K

Technical Specifications

Metric Solar Pro 3 (Upstage) GPT-4.1 (OpenAI) Claude 3.5 Sonnet (Anthropic)
Avg Latency ~180ms ~220ms ~250ms
Context Window 200K 128K 200K
Input Price ($/1M) $0.80 $5.00 $3.00
Output Price ($/1M) $4.00 $15.00 $15.00
Max Output Tokens 4K 4K 4K
Throughput ~80 tps ~60 tps ~50 tps
Uptime 99.9% 99.9% 99.9%

30-day usage via LLM API

9.8B
Prompt tokens processed (last 30 days)
7.1B
Completion tokens generated (last 30 days)
12.4M
API requests served (last 30 days)
99.8%
Average API uptime
Start Using API

Why Build on LLM.API?

One unified API. Every major model. Built-in reliability, cost control, and observability.

  • Unified AI Routing

    Dynamically route each request to the optimal model across providers using latency, cost, and quality signals—without changing your integration or redeploying code.

    One endpoint, every model.
  • Smart Cost Controls

    Define budgets, price ceilings, and routing rules so LLM.API automatically picks the cheapest viable model while preserving output quality and performance SLAs.

    Optimize spend by default.
  • Automatic Fallbacks

    Guard against provider outages and rate limits with configurable failover logic that instantly retries on backup models, maintaining uptime without custom error-handling glue.

    Resiliency built in.
  • Deep Observability

    Get per-request traces, latency and cost breakdowns, and structured logs across all providers from a single dashboard and API, ready for alerting and analytics.

    One pane of glass.
  • Task-Aware Orchestration

    Describe tasks at a higher level—chat, extraction, tools—and let LLM.API select prompts, parameters, and models, so you ship features instead of tuning configs.

    Think tasks, not prompts.
  • High-Throughput Batch

    Send massive batches of jobs through a single API call with concurrency controls, retries, and progress tracking, ideal for backfills, evaluations, and bulk processing.

    Scale jobs, not code.

When to Use — When NOT to Use

Use it if...

  • You need a strong general-purpose model for chatbots and virtual assistants.
  • You need solid coding assistance, including code completion, debugging, and explanation tasks.
  • Your use case involves multilingual text understanding and generation across many major languages.
  • Your use case involves drafting, rewriting, and polishing emails, reports, and marketing copy.
  • You need a capable model for question answering over moderately long documents or webpages.
  • You need a balance between quality and cost for everyday enterprise productivity workflows.

Avoid if...

  • You need cutting-edge performance on the hardest reasoning or math competition benchmarks.
  • Your workload requires guaranteed support for extremely long contexts, like hundreds of thousands tokens.
  • You need tightly integrated image or multimodal capabilities beyond basic text-only interactions.
  • You need deterministic, fully reproducible outputs with strict token-by-token compatibility guarantees.
  • Your workload requires highly specialized domain models, like medical diagnosis or legal argumentation.
  • You need robust offline deployment on highly constrained edge devices with minimal hardware resources.

Frequently Asked Questions

  • What is Solar Pro 3?

    Solar Pro 3 is a large language model by Upstage optimized for high-quality reasoning, coding, and general-purpose chat via the LLM.API gateway.

  • What is Solar Pro 3 best used for?

    Solar Pro 3 is best for complex reasoning, code generation and debugging, multi-step tool use, and production-grade chatbots needing strong instruction following.

  • What is the context window of Solar Pro 3?

    Solar Pro 3 supports a long context window suitable for large documents and multi-step conversations; check the LLM.API model card for the exact token limit.

  • How fast is Solar Pro 3 in terms of latency and throughput?

    Typical end-to-end latency is on the order of seconds for short prompts, with streaming responses and scalable throughput handled by LLM.API infrastructure.

  • Which modalities does Solar Pro 3 support?

    Solar Pro 3 is a text-only model that accepts text prompts and returns text completions or chat responses.

  • How do I call Solar Pro 3 through LLM.API?

    You can select the upstage/solar-pro-3 model name in the LLM.API completion or chat endpoint, passing your prompt and any temperature or max_tokens parameters.

  • How is Solar Pro 3 priced on LLM.API?

    Solar Pro 3 uses pay-as-you-go, per-token billing; see the LLM.API pricing page for current input and output token rates.

  • How does Solar Pro 3 compare to similar models?

    Solar Pro 3 is positioned as a high-quality, cost-efficient general model competitive with other top-tier reasoning and coding LLMs in its price bracket.

  • What are the main limitations of Solar Pro 3?

    Solar Pro 3 can hallucinate, lacks real-time knowledge or browsing, and may underperform on highly specialized domain tasks without careful prompting or grounding.

  • Can I fine-tune or customize Solar Pro 3 via LLM.API?

    Fine-tuning support depends on LLM.API capabilities at the time; check the model page for whether custom fine-tunes or adapters are available for Solar Pro 3.

Start in 2 lines of code

Get My API Key