Undi95 Models API Cost Calculator & Comparison

Every Undi95 model, side by side — current API rates, context window, benchmarks, and a live calculator that ranks them at your exact workload. 1 active model, 1 with public pricing. Prices refreshed daily.

Models tracked

Active

With public pricing

Cheapest input

$0.45/1M

Calculate your Undi95 API cost at your workload.

Set your workload — every priced model ranks in real time.

Adjust the workload

Every model below updates in real time.

Conversations per month 50,000

1,00010,00050,000250,0001M10M

Input length

Output length

Repeated questions?

Replies can wait a few hours (batch discount)

Ranked by your monthly bill

No models with public pricing available to compare right now.

Pricing at a glance

Blended $/1M tokens across the lineup.

Blended price uses a 3-to-1 input/output ratio. Green bar = cheapest.

Quality vs price

Undi95 benchmarks at a glance.

No benchmark data yet for Undi95.

Open weights

Open Models from Undi95

Undi95 ships 1 open-source or open-weights model you can self-host or fine-tune. Each links to its Hugging Face card.

ReMM SLERP 13BOpen Weights Undi95/ReMM-SLERP-L2-13B

Every model

Every Undi95 model — pricing, context & capabilities.

Model	Context	Input /1M	Output /1M	Cached /1M	Batch /1M	Capabilities
ReMM SLERP 13B	6K	$0.45	$0.65	$0.113	$0.225	—

FAQ

अक्सर पूछे जाने वाले प्रश्न

Pricing patterns, best-known use cases, and how this provider stacks up.

Get instant answers from our AI agent

Undi95 API pricing ranges starting at $0.45 per 1M input tokens. Output tokens cost more than input on every model. Prices are per 1 million tokens (1M ≈ 750,000 words). Use the calculator above to estimate your monthly spend at your actual workload.

ReMM SLERP 13B is the lowest-priced Undi95 model with public pricing at $0.45/1M input tokens. It suits high-volume tasks where cost matters most — classification, extraction, summarization, and similar workloads that don't need frontier reasoning.

ReMM SLERP 13B is Undi95's highest-tier model at $0.45/1M input. It delivers the most sophisticated reasoning, instruction-following, and nuance. For workloads that don't require frontier performance, a mid-tier model typically cuts inference costs substantially.

Yes — Undi95 supports prompt caching (discounts for repeated context) and batch processing (accept a delay, cut costs ~50%). These rates appear in the table above under "Cached /1M" and "Batch /1M." Caching pays off quickly if your prompts share a long system prompt or document prefix across many calls.

Undi95 has historically adjusted prices when launching new model generations, often cutting rates to stay competitive. Buzzi.ai snapshots pricing daily — you can subscribe to price-drop alerts on any Undi95 model using the "Alert me" button on its detail page.

Use the main comparison wizard to run the same calculator across Undi95, Anthropic, Google, Meta, Mistral, and 20+ other providers. Set your exact workload and get a ranked cost chart in under a minute.

Look wider

Compare Undi95 against other providers.

Open the full wizard — pick a use case, set your usage, and cross-compare against OpenAI, Anthropic, Google, and 20+ more.

Open the comparison wizard

About

Insights

Streamline

Integration

Solutions

Healthcare AI

Use Cases

Industries