GLM 4.6

Z.ai · Mid

Compare →

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

Input / 1M

$0.43

Output / 1M

$1.74

Cached input / 1M

$0.08

Context window

202K

Where it sits

152nd-cheapest mid model

by blended $/Mtok among 262 listed mid models

33% above the mid median

blended $/Mtok across 262 mid models

Output costs 4× input

$0.43 in / $1.74 out per 1M

Cached input saves 81%

$0.08 vs $0.43 per 1M fresh

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.43 $1.74 $0.08 Imported from OpenRouter openrouter.ai

More from Z.ai