Microsoft: WizardLM-2 8x22B
WizardLM-2 8x22B is a text model for general chat, analysis, and production use. It combines steady general-purpose performance with a 66K tokens context window and a balanced-cost profile. Use it for general chat, analysis, and production workloads when quality, speed, and cost matters. It is a practical choice for teams that need reliable output, flexible deployment, and room to scale.
Input
$0.62/1M
Output
$0.62/1M
Cached
$0.05/1M
Batch
$0.25/1M