UI-TARS 7B
Bytedance
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
What is UI-TARS 7B?
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
Specifications
- Developer: Bytedance
- Context window: 128K tokens
- Max output: 2.0K tokens
- Input modalities: image, text
- Output modalities: text
- Input price: $0.1000 per 1M tokens
- Output price: $0.2000 per 1M tokens
- Knowledge cutoff: 2025-01-31
- Supported parameters: frequency_penalty, logit_bias, max_tokens, presence_penalty, repetition_penalty, seed, stop, temperature, top_k, top_p
Use UI-TARS 7B with PromptLayer
PromptLayer lets teams manage, evaluate, and observe prompts that run on UI-TARS 7B alongside every other model in their stack. Version prompts, run evals across models, and ship safe rollouts from the same dashboard.
Ready to try it yourself? Sign up for PromptLayer and start managing your prompts in minutes.