UI-TARS 7B

ByteDance · Mid

Compare →

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

Input / 1M

$0.1

Output / 1M

$0.2

Cached input / 1M

$0.1

Context window

128K

Where it sits

35th-cheapest mid model

by blended $/Mtok among 262 listed mid models

78% below the mid median

blended $/Mtok across 262 mid models

Output costs 2× input

$0.1 in / $0.2 out per 1M

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.1 $0.2 $0.1 Imported from OpenRouter openrouter.ai