What an AI feature is actually made of.
A feature is a chain of calls, each with a different job. The step that drives the bill and the step that needs the capable model are usually different ones.
The thing people picture as one model call is almost never one call. A support chatbot classifies the message, retrieves help-centre passages, then writes a reply. A coding agent plans, edits, runs tools, and re-checks across many steps. Each link in the chain has a different job, a different token shape, and a different right model.
Two of those steps matter most, and they pull in opposite directions. One step is the cost-driver step: where the tokens, and so the bill, concentrate. Another is the capable-model step: the one that actually needs a frontier model to get the job right. The common mistake is putting one frontier model across the whole chain because a single step needs it.
The fix is to read the chain step by step. Below are four common shapes, each rendered from the same data the guide uses. For each, the cost-driver step and the capable-model step are named from the chain itself.
Output costs 4.0× input, on average
Output costs more than input across every provider. Across 283 models the multiple ranges from 0.1× to 12.2×.
Live from the index — the per-1M spread every chain below is multiplied against.
That spread sets the stakes for the capable-model step. Today the cheapest frontier model on the index is Llama 4 Maverick, at $0.15 in / $0.6 out per 1M tokens. Put that rate on one step that needs it, not on every step that doesn't.
Here the cost-driver step and the capable-model step are the same one: generate reply. Spend there, keep the rest small.
Here the cost-driver step and the capable-model step are the same one: generate answer. Spend there, keep the rest small.
The cost-driver step is subagent search. The capable-model step is orchestrate / plan. They are different steps, so the capable model goes on orchestrate / plan and the rest stay small.
The bill concentrates on classify / extract. No step here needs a capable model, so a small model runs the whole chain.
The cost-driver step and the capable-model step are usually different.
Where this goes next
The shapes above are the skeleton. Two pages put numbers and choices on them.