Qwen3 VL 8B Instruct

Alibaba · Mid

Compare →

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

Input / 1M

$0.08

Output / 1M

$0.5

Cached input / 1M

Context window

131K

Where it sits

64th-cheapest mid model

by blended $/Mtok among 262 listed mid models

68% below the mid median

blended $/Mtok across 262 mid models

Output costs 6.3× input

$0.08 in / $0.5 out per 1M

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.08 $0.5 Imported from OpenRouter openrouter.ai

More from Alibaba