GLM 4.7 Flash

Z.ai · Mid

Compare →

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Input / 1M

$0.06

Output / 1M

$0.4

Cached input / 1M

$0.01

Context window

202K

Where it sits

47th-cheapest mid model

by blended $/Mtok among 262 listed mid models

75% below the mid median

blended $/Mtok across 262 mid models

Output costs 6.7× input

$0.06 in / $0.4 out per 1M

Cached input saves 83%

$0.01 vs $0.06 per 1M fresh

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.06 $0.4 $0.01 Imported from OpenRouter openrouter.ai

More from Z.ai