OpenRouter: Elephant

Public pricingIntelligence 79/100Large memory

Elephant is a text model for general chat, analysis, and production use. It combines low latency and efficient inference with a 262K tokens context window and a free or zero-cost profile. Use it for general chat, analysis, and production workloads when latency, cost, and throughput matters.

Calculate your Elephant bill.

Set your workload โ€” see cost at your exact volume.

What would Elephant cost you?

Adjust the workload to see your monthly bill.

1,00010,00050,000250,0001M10M

Technical specifications

Elephant at a glance.

Memory

262,144

tokens

Max reply

32,768

tokens

Memory tier

Large

an entire book or large codebase

Tokenizer

โ€”

Released

Apr 2026

Training cutoff

โ€”

Availability

Public pricing

Status

active

What it can do

Capabilities & limits.

  • Understands images
  • Deep step-by-step thinking
  • Uses tools / calls functions
  • Strict JSON output
  • Streams replies
  • Fine-tunable on your data

When to pick Elephant

  • Long documents, full codebases, or extensive chat histories.
  • High-volume workloads where unit cost matters.

When to look elsewhere

  • Your workload involves images โ€” pick a vision-capable model instead.
  • You need tool-use / function calling for agent workflows.

FAQ

Elephant โ€” the questions we see most.

Pricing, capabilities, alternatives โ€” generated from the same data that powers the calculator above.

Get instant answers from our AI agent

Elephant has a 262,144-token context window (large memory โ€” an entire book or large codebase). That means you can fit about 49,152 words of input and history in a single call.
Elephant was released in April 2026.
Models in a similar class include Auto Router, Free Models Router, Gemma 4 26B A4B. The "Similar models" section below this FAQ links into each.

Still unsure?

Compare Elephant against 100+ other models.

Open the full wizard โ€” pick a use case, set your usage, and see side-by-side monthly costs in under a minute.