Qwen3.5-Flash

Alibaba · Multimodal · Released Feb 2026

Qwen3.5-Flash is a lightweight text and vision model from Alibaba with a 1 million token context window, positioned as a faster alternative to the mid-tier Qwen3.5 Plus and Qwen3.6 Plus.

Strengths: Low latency and efficient inference across both text and vision tasks without sacrificing broad capability support.
Best for: Real-time applications, streaming workloads, and vision tasks where speed and cost-efficiency are priorities over maximum accuracy.
Limitations: Smaller effective capacity than the Plus-tier models means it trades some accuracy and reasoning depth for speed and efficiency.

Input / 1M

$0.065

Output / 1M

$0.26

Cached input / 1M

Context window

Price history

Snapshots

Effective	Input	Output	Cached in	Note	Source
11 Jun 2026	$0.065	$0.26	—	Imported from OpenRouter	openrouter.ai

More from Alibaba

Qwen3.7 Flash

in $0.03 · out $0.13

Qwen3.7 Plus

in $0.32 · out $1.28

Qwen 3.7 Max

in $2.50 · out $7.50

Qwen3.5 Plus 2026-04-20

in $0.3 · out $1.80

Data updated Jul 28, 2026 Report a problem