ByteDance: UI-TARS 7B
ByteDance: UI-TARS 7B is a multimodal model for agent workflows and tool use. It combines multimodal input handling and reliable tool use and agent behavior with a 128K tokens context window and a low-cost profile. Use it for agent workflows, tool use, and orchestration when quality, speed, and cost matters.
Input
$0.10/1M
Output
$0.20/1M
Cached
$0.10/1M
Batch
$0.05/1M