NVIDIA: Llama 3.1 Nemotron 70B Instruct
NVIDIA: Llama 3.1 Nemotron 70B Instruct is a text model for general chat, analysis, and production use. It combines steady general-purpose performance with a 131K tokens context window and a balanced-cost profile. Use it for general chat, analysis, and production workloads when quality, speed, and cost matters.
Input
$1.20/1M
Output
$1.20/1M
Cached
$0.01/1M
Batch
$0.06/1M