What Sonnet 4.6 actually is (and why it's the default)
Sonnet 4.6 is Anthropic's production-tier model: not as expensive as Opus, not as small as Haiku, with the full Claude 4 feature set. Anthropic positions Sonnet as the right choice for 'most production workloads' and the data backs that up — across customer deployments, Sonnet handles the bulk of traffic with Opus reserved for the hard subset and Haiku reserved for the cheap subset.
Same architecture as Opus 4.7 (same Claude 4 base), trained on the same data with the same RLHF discipline, with a smaller model size that trades a few percentage points of quality on hard reasoning for a 5× price reduction. The feature surface is identical: tool use, parallel tool calls, extended thinking, prompt caching, vision input, structured outputs via tool schemas, the Batch API.
Sonnet's standout feature vs every other mid-tier model: the optional 1M-token context window. Opt-in via the `anthropic-beta: context-1m-2025-08-07` header. Anthropic doubles input price and adds 50% to output for the portion of a request that exceeds 200K tokens — but the long-context option lets Sonnet compete with Gemini 2.5 Pro on document-scale workloads. No other Claude tier exposes 1M.