Rnj 1 Instruct

Text Generation

Rnj-1 Instruct is an 8B-parameter, instruction-tuned open-weight model from EssentialAI, optimized for code generation, STEM reasoning, and agentic tool-using workflows with a 32K context window.

Start Using API

API Performance

Latency: ~0.7s time to first token (API, typical)
Context: 32K token context
Input: $0.15 per 1M tokens
Output: $0.15 per 1M tokens
Uptime: 99% 99%

About the model

What is Rnj 1 Instruct?

Rnj-1 Instruct is an 8B-parameter instruction-tuned language model from EssentialAI, trained from scratch and released as open weights under Apache 2.0. It is mainly used for software engineering assistants, autonomous coding agents, and complex multi-step tool-using workflows thanks to strong performance on benchmarks like SWE-bench Verified, BigCodeBench, and the Berkeley Function Calling Leaderboard. It is also used for math and scientific reasoning tasks, achieving competitive results on GSM8K, AIME 2025, and GPQA-style science benchmarks while remaining small enough for cost-efficient deployment. Rnj-1 Instruct belongs to the Rnj-1 family of 8B dense models, where Rnj-1 is the base model and newer variants like Rnj-1.5 Instruct extend its long-context and coding capabilities.

Input / Output

Input

Text prompts (chat-style messages)

Output

Text responses (chat-style completions)
Source code generation and editing

Model capabilities

5 Core Capabilities

Coding Assistance

Generates, edits, debugs, and infills code across multiple languages, excelling on HumanEval+, BigCodeBench, and SWE-bench style tasks.
STEM Reasoning

Solves math and science problems, including quantitative reasoning and competition-style questions, using its strong STEM-focused training.
Agentic Workflows

Drives multi-step software engineering agents, coordinating tools and environment interactions to resolve complex, real-world coding issues.
Tool and Function Use

Performs structured function calling and API orchestration, enabling integration into pipelines requiring reliable tool invocation and handling.
Long-Context Chat

Supports instruction-following and conversational tasks over a 32K-token context, maintaining coherence across lengthy technical discussions.

Use cases

6 Most Valuable Use Cases

Agentic coding assistant
Code generation automation
STEM problem solving
Math tutoring chatbot
Tool-using coding agents
Technical workflow orchestration

Transparent pricing

Cost Comparison

LLM API offers the lowest cost and fastest access for Rnj 1 Instruct–class models.

Provider	Region	Latency	Throughput	Uptime	Input ($/1M)	Output ($/1M)	Context
LLM API BEST	Global	120ms	120 tps	99.99%	$0.04	$0.08	128K
EssentialAI	US East	~220ms	~60 tps	~99.9%	~$0.10	~$0.20	~64K
OpenAI (gpt-4.1-mini equivalent)	Global	~250ms	~80 tps	~99.9%	~$0.15	~$0.60	~128K
Anthropic (Claude 3.5 Sonnet equivalent)	US East	~260ms	~50 tps	~99.9%	~$3.00	~$15.00	~200K

Performance benchmarks

Technical Specifications

Metric	Rnj 1 Instruct (EssentialAI)	GPT-4.1 Mini (OpenAI)	Claude 3.5 Sonnet (Anthropic)
Avg Latency	~180ms	~220ms	~250ms
Context Window	128K	128K	200K
Input Price ($/1M)	$0.20	$0.15	$3.00
Output Price ($/1M)	$0.60	$0.60	$15.00
Max Output Tokens	4K	4K	4K
Throughput	80 tps	60 tps	40 tps
Uptime	99.9%	99.9%	99.9%

30-day usage via LLM API

1.8B: Prompt tokens processed (last 30 days)
95M: Completion tokens generated (last 30 days)
6.4M: API requests served (last 30 days)
99.8%: Avg API uptime (last 30 days)

Start Using API

Architecture & Integration

Why Build on LLM.API?

One unified API. Every major model. Built-in reliability, cost control, and observability.

Unified AI Routing

Dynamically route each request to the best model across providers based on latency, cost, and quality—without changing your integration or redeploying code.
One endpoint, every model
Cost-Aware Controls

Set hard budgets, price caps, and routing rules to keep LLM spend predictable while automatically choosing cheaper equivalents when they meet your quality thresholds.
Optimize spend by default
Automatic Provider Fallback

Survive rate limits and outages with built-in failover to backup models and providers, keeping mission-critical workflows up without custom retry logic.
Resilient by design
End-to-End Observability

Get centralized traces, metrics, and logs for every request across providers, so you can debug prompts, compare models, and spot regressions in real time.
See every token
Task-Level Orchestration

Compose multi-step LLM tasks—tools, retrieval, workflows—behind a single API, letting LLM.API manage state, retries, and model selection for each step.
Workflows, not raw calls
High-Volume Batch Jobs

Run millions of inferences in parallel with server-side batching, concurrency control, and automatic chunking tuned for provider limits and throughput.
Scale to millions

Decision guide

When to Use — When NOT to Use

Use it if...

You need a cost-efficient, general-purpose instruct model for everyday app backends.
You need reliable English instruction-following for chatbots, agents, or support assistants.
Your use case involves moderate-length context summarization, rewriting, and content transformation tasks.
Your use case involves prototyping LLM-powered features without paying flagship-model prices.
You need a straightforward instruct model that behaves predictably with clear, explicit prompts.
Your use case involves batch-processing many small to medium text requests per minute.

Avoid if...

You need frontier-level reasoning and advanced tool-use comparable to the strongest commercial models.
You need state-of-the-art performance on complex coding, debugging, or large codebase navigation.
Your workload requires extremely long-context processing, such as entire books or codebases.
Your workload requires highly optimized, battle-tested ecosystem integrations and extensive third-party tooling.
You need cutting-edge multimodal capabilities like image understanding, generation, or audio handling.
Your workload requires rigorous, externally audited safety, compliance certifications, and regulatory guarantees.

FAQ

Frequently Asked Questions

What is Rnj 1 Instruct?

Rnj 1 Instruct is an instruction-tuned large language model by EssentialAI optimized for general-purpose text generation and code assistance.
What is the context window of Rnj 1 Instruct?

Rnj 1 Instruct supports up to a 16K token context window for prompts plus generated output combined.
What modalities does Rnj 1 Instruct support?

Rnj 1 Instruct is a text-only model that accepts text prompts and returns text completions.
How does Rnj 1 Instruct compare to similar instruction-tuned models?

Rnj 1 Instruct targets a balance of quality and efficiency, performing similarly to mid-sized open-source instruction models at lower inference cost.
What is the pricing for using Rnj 1 Instruct via LLM.API?

LLM.API charges per token for Rnj 1 Instruct usage, with separate rates for input and output tokens defined in your LLM.API pricing plan.
How fast is Rnj 1 Instruct in terms of latency?

Rnj 1 Instruct is designed for low-latency interactive use, typically returning initial tokens within a few hundred milliseconds under normal load.
How do I call Rnj 1 Instruct through LLM.API?

You invoke Rnj 1 Instruct by passing its model name to the LLM.API completion or chat endpoint along with your prompt and configuration parameters.
What is Rnj 1 Instruct best suited for?

Rnj 1 Instruct is best for instruction following, multi-step reasoning, code drafting, and transforming or summarizing textual data.
Does Rnj 1 Instruct support streaming responses on LLM.API?

Yes, you can enable streaming for Rnj 1 Instruct on LLM.API to receive tokens incrementally as they are generated.
What limitations should I be aware of when using Rnj 1 Instruct?

Rnj 1 Instruct can hallucinate incorrect facts, is not connected to real-time data, and should not be solely relied on for safety-critical decisions.

Start in 2 lines of code

Get My API Key

Rnj 1 Instruct

What is Rnj 1 Instruct?

5 Core Capabilities

Coding Assistance

STEM Reasoning

Agentic Workflows

Tool and Function Use

Long-Context Chat

6 Most Valuable Use Cases

Cost Comparison

Technical Specifications

Why Build on LLM.API?

Unified AI Routing

Cost-Aware Controls

Automatic Provider Fallback

End-to-End Observability

Task-Level Orchestration

High-Volume Batch Jobs

When to Use — When NOT to Use

Use it if...

Avoid if...

Start in 2 lines of code