Powered by EssentialAI
Rnj 1 Instruct
- Text Generation
Rnj-1 Instruct is an 8B-parameter, instruction-tuned open-weight model from EssentialAI, optimized for code generation, STEM reasoning, and agentic tool-using workflows with a 32K context window.
About the model
What is Rnj 1 Instruct?
Rnj-1 Instruct is an 8B-parameter instruction-tuned language model from EssentialAI, trained from scratch and released as open weights under Apache 2.0. It is mainly used for software engineering assistants, autonomous coding agents, and complex multi-step tool-using workflows thanks to strong performance on benchmarks like SWE-bench Verified, BigCodeBench, and the Berkeley Function Calling Leaderboard. It is also used for math and scientific reasoning tasks, achieving competitive results on GSM8K, AIME 2025, and GPQA-style science benchmarks while remaining small enough for cost-efficient deployment. Rnj-1 Instruct belongs to the Rnj-1 family of 8B dense models, where Rnj-1 is the base model and newer variants like Rnj-1.5 Instruct extend its long-context and coding capabilities.
Model capabilities
5 Core Capabilities
-
Coding Assistance
Generates, edits, debugs, and infills code across multiple languages, excelling on HumanEval+, BigCodeBench, and SWE-bench style tasks.
-
STEM Reasoning
Solves math and science problems, including quantitative reasoning and competition-style questions, using its strong STEM-focused training.
-
Agentic Workflows
Drives multi-step software engineering agents, coordinating tools and environment interactions to resolve complex, real-world coding issues.
-
Tool and Function Use
Performs structured function calling and API orchestration, enabling integration into pipelines requiring reliable tool invocation and handling.
-
Long-Context Chat
Supports instruction-following and conversational tasks over a 32K-token context, maintaining coherence across lengthy technical discussions.
Use cases
6 Most Valuable Use Cases
- Agentic coding assistant
- Code generation automation
- STEM problem solving
- Math tutoring chatbot
- Tool-using coding agents
- Technical workflow orchestration
Transparent pricing
Cost Comparison
LLM API offers the lowest cost and fastest access for Rnj 1 Instruct–class models.
| Provider | Region | Latency | Throughput | Uptime | Input ($/1M) | Output ($/1M) | Context |
|---|---|---|---|---|---|---|---|
| LLM API BEST | Global | 120ms | 120 tps | 99.99% | $0.04 | $0.08 | 128K |
| EssentialAI | US East | ~220ms | ~60 tps | ~99.9% | ~$0.10 | ~$0.20 | ~64K |
| OpenAI (gpt-4.1-mini equivalent) | Global | ~250ms | ~80 tps | ~99.9% | ~$0.15 | ~$0.60 | ~128K |
| Anthropic (Claude 3.5 Sonnet equivalent) | US East | ~260ms | ~50 tps | ~99.9% | ~$3.00 | ~$15.00 | ~200K |
Performance benchmarks
Technical Specifications
| Metric | Rnj 1 Instruct (EssentialAI) | GPT-4.1 Mini (OpenAI) | Claude 3.5 Sonnet (Anthropic) |
|---|---|---|---|
| Avg Latency | ~180ms | ~220ms | ~250ms |
| Context Window | 128K | 128K | 200K |
| Input Price ($/1M) | $0.20 | $0.15 | $3.00 |
| Output Price ($/1M) | $0.60 | $0.60 | $15.00 |
| Max Output Tokens | 4K | 4K | 4K |
| Throughput | 80 tps | 60 tps | 40 tps |
| Uptime | 99.9% | 99.9% | 99.9% |
30-day usage via LLM API
- 1.8B
- Prompt tokens processed (last 30 days)
- 95M
- Completion tokens generated (last 30 days)
- 6.4M
- API requests served (last 30 days)
- 99.8%
- Avg API uptime (last 30 days)
Architecture & Integration
Why Build on LLM.API?
One unified API. Every major model. Built-in reliability, cost control, and observability.
-
Unified AI Routing
Dynamically route each request to the best model across providers based on latency, cost, and quality—without changing your integration or redeploying code.
One endpoint, every model -
Cost-Aware Controls
Set hard budgets, price caps, and routing rules to keep LLM spend predictable while automatically choosing cheaper equivalents when they meet your quality thresholds.
Optimize spend by default -
Automatic Provider Fallback
Survive rate limits and outages with built-in failover to backup models and providers, keeping mission-critical workflows up without custom retry logic.
Resilient by design -
End-to-End Observability
Get centralized traces, metrics, and logs for every request across providers, so you can debug prompts, compare models, and spot regressions in real time.
See every token -
Task-Level Orchestration
Compose multi-step LLM tasks—tools, retrieval, workflows—behind a single API, letting LLM.API manage state, retries, and model selection for each step.
Workflows, not raw calls -
High-Volume Batch Jobs
Run millions of inferences in parallel with server-side batching, concurrency control, and automatic chunking tuned for provider limits and throughput.
Scale to millions
Decision guide
When to Use — When NOT to Use
Use it if...
- You need a cost-efficient, general-purpose instruct model for everyday app backends.
- You need reliable English instruction-following for chatbots, agents, or support assistants.
- Your use case involves moderate-length context summarization, rewriting, and content transformation tasks.
- Your use case involves prototyping LLM-powered features without paying flagship-model prices.
- You need a straightforward instruct model that behaves predictably with clear, explicit prompts.
- Your use case involves batch-processing many small to medium text requests per minute.
Avoid if...
- You need frontier-level reasoning and advanced tool-use comparable to the strongest commercial models.
- You need state-of-the-art performance on complex coding, debugging, or large codebase navigation.
- Your workload requires extremely long-context processing, such as entire books or codebases.
- Your workload requires highly optimized, battle-tested ecosystem integrations and extensive third-party tooling.
- You need cutting-edge multimodal capabilities like image understanding, generation, or audio handling.
- Your workload requires rigorous, externally audited safety, compliance certifications, and regulatory guarantees.
FAQ
Frequently Asked Questions
-
What is Rnj 1 Instruct?
Rnj 1 Instruct is an instruction-tuned large language model by EssentialAI optimized for general-purpose text generation and code assistance.
-
What is the context window of Rnj 1 Instruct?
Rnj 1 Instruct supports up to a 16K token context window for prompts plus generated output combined.
-
What modalities does Rnj 1 Instruct support?
Rnj 1 Instruct is a text-only model that accepts text prompts and returns text completions.
-
How does Rnj 1 Instruct compare to similar instruction-tuned models?
Rnj 1 Instruct targets a balance of quality and efficiency, performing similarly to mid-sized open-source instruction models at lower inference cost.
-
What is the pricing for using Rnj 1 Instruct via LLM.API?
LLM.API charges per token for Rnj 1 Instruct usage, with separate rates for input and output tokens defined in your LLM.API pricing plan.
-
How fast is Rnj 1 Instruct in terms of latency?
Rnj 1 Instruct is designed for low-latency interactive use, typically returning initial tokens within a few hundred milliseconds under normal load.
-
How do I call Rnj 1 Instruct through LLM.API?
You invoke Rnj 1 Instruct by passing its model name to the LLM.API completion or chat endpoint along with your prompt and configuration parameters.
-
What is Rnj 1 Instruct best suited for?
Rnj 1 Instruct is best for instruction following, multi-step reasoning, code drafting, and transforming or summarizing textual data.
-
Does Rnj 1 Instruct support streaming responses on LLM.API?
Yes, you can enable streaming for Rnj 1 Instruct on LLM.API to receive tokens incrementally as they are generated.
-
What limitations should I be aware of when using Rnj 1 Instruct?
Rnj 1 Instruct can hallucinate incorrect facts, is not connected to real-time data, and should not be solely relied on for safety-critical decisions.
