Lyria 3 Pro Preview

Text Generation

Lyria 3 Pro Preview is Google DeepMind’s flagship music generation model, optimized for producing full-length, structurally coherent songs from text or image prompts. It outputs high-quality 48 kHz stereo audio and is available in preview via the Gemini API and Google AI Studio.

Start Using API

API Performance

Latency: ~8s avg generation time for 30–90s clip
Context: 180s max duration
Input: ~$0.80 per 1,000 seconds generated
Output: ~$0.00 included in input cost
Uptime: 99% 99%

About the model

What is Lyria 3 Pro Preview?

Lyria 3 Pro Preview is Google’s advanced AI music generation model designed to create full songs with complex structural coherence, including multiple verses, choruses, and bridges. It is mainly used to generate three-minute, high-fidelity stereo music tracks from text prompts or image inputs, giving creators fine-grained control over song sections and styles. It also serves developers and enterprises that need scalable music generation through platforms like Vertex AI, Gemini API, and Google AI Studio. The model follows earlier Lyria versions such as Lyria 3, extending their 30-second clip capabilities into full-length compositions within the same Lyria family.

Input / Output

Input

Text prompts
Image prompts

Output

Music audio (MP3, stereo)
Generated lyrics as text

Model capabilities

5 Core Capabilities

Conversational Chat

Generates context-aware, multi-turn conversational responses, handling complex instructions, following user intent, and maintaining coherent dialogue in natural language.
Music Generation

Produces high-quality music audio from prompts, supporting varied styles, structures, and instrumentation, optimized for creative composition and experimentation.
Lyric Handling

Understands and generates song lyric text aligned to musical concepts, styles, and themes for use alongside audio-based music generation workflows.
Multilingual Text

Processes and generates text across multiple languages, enabling cross-lingual interaction with prompts and outputs for global creative use cases.
Text Prompting

Interprets detailed text prompts describing musical mood, genre, and structure to guide resulting music generation outputs effectively and consistently.

Use cases

6 Most Valuable Use Cases

Music Style Transfer
Melody To Orchestration
Generative Film Scoring
Adaptive Game Soundtracks
Advertising Jingle Creation
AI Music Prototyping

Transparent pricing

Cost Comparison

LLM API offers the lowest cost and latency for Lyria 3 Pro–class music generation.

Provider	Region	Latency	Throughput	Uptime	Input ($/1M)	Output ($/1M)	Context
LLM API BEST	Global	200ms	120 tracks/min	99.99%	$0.40/1K tokens	$0.40/1K tokens	32K tokens
Google	Global	~350ms	~60 tracks/min	99.9%	~$0.80/1K tokens	~$0.80/1K tokens	32K tokens
AWS Bedrock (3rd-party music model)	US East	~420ms	~45 tracks/min	99.9%	~$1.00/1K tokens	~$1.00/1K tokens	~16K tokens
Azure AI Studio (music model)	EU West	~380ms	~50 tracks/min	99.9%	~$0.90/1K tokens	~$0.90/1K tokens	~32K tokens

Performance benchmarks

Technical Specifications

Metric	Lyria 3 Pro Preview	GPT-4o (mini)	Claude 3 Haiku
Avg Latency	~180ms	~220ms	~250ms
Context Window	128K	128K	200K
Input Price ($/1M)	$0.20	$0.15	$0.25
Output Price ($/1M)	$0.60	$0.60	$0.75
Max Output Tokens	8K	4K	4K
Throughput	~60 tps	~50 tps	~45 tps
Uptime	99.9%	99.9%	99.9%

30-day usage via LLM API

3.8B: Prompt tokens processed (30 days)
260M: Completion tokens generated (30 days)
7.5M: API requests served (30 days)
99.8%: Avg uptime over last 30 days

Start Using API

Architecture & Integration

Why Build on LLM.API?

One unified API. Every major model. Built-in reliability, cost control, and observability.

Unified AI Routing

Automatically route each request to the best model across providers based on latency, cost, and quality—without changing your integration or redeploying.
One endpoint, any model
Cost-Aware Controls

Set budgets, price caps, and model-level policies so LLM.API always picks the most cost-efficient option while meeting your performance and quality targets.
Optimize spend by design
Resilient Fallbacks

Define automatic cross-provider fallbacks so outages, rate limits, or slow regions don’t break your app—responses keep flowing with no client-side logic.
Never go dark
Deep Observability

Get unified logs, traces, and metrics across all models and vendors, with per-request insights for latency, errors, and spend in one place.
See every token
Task-Level Orchestration

Express work as high-level tasks—like RAG, agents, or tools—and let LLM.API handle prompt shaping, model selection, and retries for consistent outcomes.
Ship features, not glue
High-Throughput Batch

Submit massive batches across providers with automatic chunking, parallelization, and retries to maximize throughput while keeping costs and rate limits under control.
Scale to millions

Decision guide

When to Use — When NOT to Use

Use it if...

You need high-quality music generation from text prompts, focusing on melody and structure.
You need to quickly iterate musical ideas or demos without professional composing skills.
Your use case involves generating varied musical styles to prototype game or app soundtracks.
You need AI-assisted music content for social media, marketing clips, or short videos.
Your use case involves exploring AI music capabilities within Google's Gemini and generative tools ecosystem.

Avoid if...

You need a general-purpose language model for coding, reasoning, or text-heavy workloads.
Your workload requires enterprise-grade guarantees around content licensing, rights management, and ownership.
You need precise control over multi-instrument arrangements comparable to professional DAW workflows.
Your workload requires predictable latency and throughput for large-scale production audio pipelines.
You need fully transparent, fine-grained safety controls and governance for regulated industry deployments.

FAQ

Frequently Asked Questions

What is Lyria 3 Pro Preview?

Lyria 3 Pro Preview is a Google large language model accessible via LLM.API for high-quality text generation and reasoning workloads.
What is the context window of Lyria 3 Pro Preview?

Lyria 3 Pro Preview supports up to a 128K token context window for combined prompt and response.
Which modalities does Lyria 3 Pro Preview support?

Lyria 3 Pro Preview currently supports text input and text output only through LLM.API.
How is Lyria 3 Pro Preview priced on LLM.API?

Lyria 3 Pro Preview uses a pay-per-token pricing model; check your LLM.API dashboard for current input and output token rates.
How fast is Lyria 3 Pro Preview for typical requests?

For short prompts, Lyria 3 Pro Preview usually responds within a few seconds, depending on prompt size and LLM.API load.
What is Lyria 3 Pro Preview best suited for?

Lyria 3 Pro Preview is best for complex reasoning, code assistance, long-form content generation, and multi-step data processing tasks.
How do I call Lyria 3 Pro Preview via LLM.API?

Specify the provider as "google" and the model name "lyria-3-pro-preview" in your LLM.API completion or chat request.
How does Lyria 3 Pro Preview compare to similar Google models?

Compared to lighter Google models, Lyria 3 Pro Preview generally offers stronger reasoning and coding quality at higher compute cost.
Does Lyria 3 Pro Preview support streaming responses on LLM.API?

Yes, you can enable streaming in your LLM.API request to receive Lyria 3 Pro Preview tokens incrementally.
What are the main limitations of Lyria 3 Pro Preview?

Lyria 3 Pro Preview can hallucinate facts, reflect training data biases, and should not be relied on for unsupervised high-risk decisions.

Start in 2 lines of code

Get My API Key

Lyria 3 Pro Preview

What is Lyria 3 Pro Preview?

5 Core Capabilities

Conversational Chat

Music Generation

Lyric Handling

Multilingual Text

Text Prompting

6 Most Valuable Use Cases

Cost Comparison

Technical Specifications

Why Build on LLM.API?

Unified AI Routing

Cost-Aware Controls

Resilient Fallbacks

Deep Observability

Task-Level Orchestration

High-Throughput Batch

When to Use — When NOT to Use

Use it if...

Avoid if...

Start in 2 lines of code