Step 1 — Understand the three cost drivers before you touch a spreadsheet
Every OpenAI (and Anthropic, Google) invoice is the product of three numbers: **model price × token volume × call frequency**. Getting the mental model right before running any numbers prevents the most common budgeting mistakes. Most startup founders underestimate token volume, overestimate how often they need the premium model, and forget that output tokens typically cost 4-20x more per token than input tokens.
**Model price** is the per-million-token rate for a specific model variant. As of June 2026, the OpenAI GPT-5 family spans a 100x cost range from GPT-5.4-nano ($0.05/1M input) to GPT-5.5 ($5.00/1M input). Anthropic spans a 60x range from Claude Haiku 4.5 ($0.25/1M) to Claude Opus 4.8 ($15.00/1M). Google's Gemini 2.5 family spans a 17x range. Llama 3.x models hosted on providers like Together.ai or Fireworks AI are 3-10x cheaper than Anthropic's cheapest tier for similar quality benchmarks on narrow tasks. Choosing the right model for each task is the single highest-leverage budget decision you can make.
**Token volume** is harder to estimate pre-launch. A rough rule: one word ≈ 0.75 tokens in English. So a 500-word product description is ~375 tokens input; a 1,000-word blog draft is ~750 tokens output. Most production API calls involve a system prompt (50-500 tokens), user message (50-1,000 tokens), and model response (100-2,000 tokens). At volume, the system prompt repeated on every call becomes the dominant input-token cost — which is why prompt caching matters so much (see section 6).
**Call frequency** is your DAU × calls per active session × feature set. At 1,000 DAU with 5 calls per session at 1,000 tokens average per call, you're at 5M tokens per day = 150M tokens per month. At GPT-5.4-mini pricing ($0.15/$0.60 per 1M), that's roughly $22.50/month in input + $90/month in output = ~$112/month. Move to GPT-5.5 for the same volume and it's $750 + $4,500 = $5,250/month — a 47x difference. That delta is why model selection is the first conversation you need to have, not the last.