GLM 4.5

Z.ai · Mid

Compare →

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

Input / 1M

$0.6

Output / 1M

$2.20

Cached input / 1M

$0.11

Context window

131K

Where it sits

176th-cheapest mid model

by blended $/Mtok among 262 listed mid models

75% above the mid median

blended $/Mtok across 262 mid models

Output costs 3.7× input

$0.6 in / $2.20 out per 1M

Cached input saves 82%

$0.11 vs $0.6 per 1M fresh

Held flat since launch (Jun 2026)

no blended price change recorded

Computed live from current prices and this model's history — not hand-written, so it stays accurate as prices move.

Price history

Only one price on record so far — the history chart appears once a price changes.

Snapshots

Effective Input Output Cached in Note Source
11 Jun 2026 $0.6 $2.20 $0.11 Imported from OpenRouter openrouter.ai

More from Z.ai