Qwen3 VL 32B Instruct
Alibaba · Mid
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
Input / 1M
$0.104
Output / 1M
$0.416
Cached input / 1M
—
Context window
131K
Where it sits
63rd-cheapest mid model
by blended $/Mtok among 262 listed mid models
68% below the mid median
blended $/Mtok across 262 mid models
Output costs 4× input
$0.104 in / $0.416 out per 1M
Held flat since launch (Jun 2026)
no blended price change recorded
Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.
Price history
Only one price on record so far — the history chart appears once a price changes.
Snapshots
| Effective | Input | Output | Cached in | Note | Source |
|---|---|---|---|---|---|
| 11 Jun 2026 | $0.104 | $0.416 | — | Imported from OpenRouter | openrouter.ai |