Qwen3.5-Flash

Alibaba · Mid

Compare →

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Input / 1M

$0.065

Output / 1M

$0.26

Cached input / 1M

Context window

1M

Where it sits

32nd-cheapest mid model

by blended $/Mtok among 262 listed mid models

80% below the mid median

blended $/Mtok across 262 mid models

Output costs 4× input

$0.065 in / $0.26 out per 1M

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.065 $0.26 Imported from OpenRouter openrouter.ai

More from Alibaba