What gpt-5-mini actually is (vs gpt-5)
gpt-5-mini is a smaller, faster, cheaper variant of GPT-5 built on the same architecture and training pipeline. OpenAI does not publish parameter counts for either model, but in practice gpt-5-mini is roughly the GPT-5 family's equivalent of gpt-4o-mini in the GPT-4 family — same instruction-following discipline, weaker on hard reasoning, dramatically cheaper.
Crucially, the feature surface is identical: gpt-5-mini supports the same function calling, parallel tool calls, structured outputs (JSON Schema-validated), prompt caching, the Responses API endpoint, vision input, and reasoning-effort control. Anything you write for GPT-5 runs on gpt-5-mini with a single model-ID change. The difference shows up on tasks that need multi-step reasoning, complex code synthesis, or strict factual accuracy.
OpenAI's positioning: gpt-5-mini is the 'default for production' tier. Most chat assistants, structured-data extractors, classification pipelines, content scaffolders, and routing agents should live here. Reserve GPT-5 for the small fraction of traffic where cost-of-error genuinely dominates per-call cost.