Ling-2.6-flash

inclusionAI · Mid

Compare →

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

Input / 1M

$0.01

Output / 1M

$0.03

Cached input / 1M

$0.002

Context window

262K

Where it sits

Cheapest mid model

by blended $/Mtok among 262 listed mid models

97% below the mid median

blended $/Mtok across 262 mid models

Output costs 3× input

$0.01 in / $0.03 out per 1M

Cached input saves 80%

$0.002 vs $0.01 per 1M fresh

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.01 $0.03 $0.002 Imported from OpenRouter openrouter.ai

More from inclusionAI