Skip to contentNew: Does ChatGPT recommend your brand? Free 60-second AI visibility check →
Model card · Verified against Anthropic docs · 2026-06-20

Claude Opus 4.7: Full Spec Sheet (June 2026)

By The DDH Team at Digital Dashboard HubUpdated

Stop writing AI prompts from scratch.

Tell us your business + your task + your model. We write the prompt — perfectly tuned for ChatGPT, Claude, Grok, Gemini, Midjourney, or any model. Plus 500+ pre-built prompts in your library.

14 days, no card. Cancel in 2 clicks.

Claude Opus 4.7 is Anthropic's flagship model and the most expensive frontier model on the major provider menus. Released in early 2026 as the successor to Claude Opus 4 and 4.5, it raised the bar on long-context reasoning, agentic tool use, and refusal-calibration discipline that makes Claude the default choice for any task where 'don't hallucinate, don't lecture, don't refuse the wrong thing' is the bottleneck.

Headline numbers: $15 per 1M input tokens, $75 per 1M output, $1.50 per 1M for cached input reads (90% off), $18.75 per 1M for cache writes (25% premium on first-time write). Context window is 200,000 tokens. Max output is 64,000 tokens per response. Knowledge cutoff is early 2025. Modalities are text + vision input; text output only. Tool use, parallel tool calls, structured outputs (via tool-use schemas), extended thinking, prompt caching (90% off cached reads), and the Batch API (50% off) are all supported.

Below: full spec table, when Opus 4.7 is the right call vs Claude Sonnet 4.6 or GPT-5, a side-by-side against the other frontier tiers, the minimal API request, and 8 FAQs. Sibling pages: Claude Sonnet 4.6 spec sheet · GPT-5 spec sheet · Claude API cost calculator. Write an Opus-tuned prompt free with our ChatGPT prompt generator (Claude mode).

Digital Dashboard Hub

Writing good prompts for ONE AI is hard. Writing them for GPT-5, Claude, Gemini, Perplexity, Midjourney and 6 more is a full-time job. DDH's AI Prompt Builder writes once, runs everywhere — locked to your niche, voice, and brand tone.

Free 14 days, no card.

Claude Opus 4.7 — Full spec sheet (June 2026)

Feature
Opus 4.7 spec
ProviderAnthropic
Model ID (API)claude-opus-4-7
ReleasedQ1 2026
Input price (per 1M)$15.00
Cached input read (per 1M)$1.50 (90% off)
Cache write (per 1M, 5-min TTL)$18.75 (25% premium)
Cache write (per 1M, 1-hour TTL)$30.00 (2× premium)
Output price (per 1M)$75.00
Batch API discount50% off input + output
Context window200,000 tokens
Max output tokens64,000 tokens
Modalities (input)Text, image
Modalities (output)Text
Tool use
Parallel tool use
Structured outputs (via tool schemas)
Streaming
Prompt caching
Extended thinking (reasoning)
Vision (image understanding)
Knowledge cutoffEarly 2025
Endpoint/v1/messages

Sources verified 2026-06-20: Anthropic models documentation (https://docs.anthropic.com/en/docs/about-claude/models), Anthropic pricing page (https://www.anthropic.com/pricing), Anthropic prompt caching docs (https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching). Prices and limits change without notice — re-verify the live pages before budgeting.

What Claude Opus 4.7 actually is (and what makes it different)

Opus 4.7 is the flagship of Anthropic's 'Claude 4' generation, succeeding Opus 4.5 (mid-2025) and Opus 4 (early 2025). It is the high end of a three-tier menu (Opus, Sonnet, Haiku) that Anthropic has maintained since Claude 3 in 2024 — Opus for hardest tasks at highest price, Sonnet for the production sweet spot, Haiku for cost-sensitive volume.

What makes Opus 4.7 different from GPT-5: stronger long-form writing voice, more disciplined instruction following on complex multi-step prompts, more aggressive prompt caching (90% off the cached read portion, vs OpenAI's also-90% but with different cache eligibility rules), native XML-tag structuring as the canonical prompt format, and extended thinking — Anthropic's name for explicit, configurable chain-of-thought reasoning that you can budget in tokens per call.

Where GPT-5 collapsed reasoning into a single `reasoning_effort` parameter, Anthropic exposes 'extended thinking' as a separate `thinking` block in the API: you set a `budget_tokens` (e.g. 5,000) and Opus 4.7 burns up to that many thinking tokens before producing the visible answer. Like reasoning tokens on GPT-5, thinking tokens bill at the output rate.


Pricing math: what Opus 4.7 actually costs per call

Standard rates: `cost = (input_tokens / 1M) × $15 + (output_tokens / 1M) × $75`. The representative 1,000-in / 500-out call: `0.001 × $15 + 0.0005 × $75 = $0.015 + $0.0375 = $0.0525`. About 5¢ per call — roughly 8× GPT-5 on the same tokens.

The price gap closes dramatically with prompt caching. Anthropic's caching is explicit and surgical: you mark a `cache_control: { type: 'ephemeral' }` block in the messages array, Anthropic stores that prefix server-side (5-minute or 1-hour TTL), and subsequent calls within the TTL read it at 10% of the input price ($1.50/M instead of $15/M). The first-time write costs $18.75/M (5-min) or $30/M (1-hour) — a 25% or 100% premium on the first write, amortized over every subsequent read.

Worked: a 5,000-token cached system prompt + tools, hit 100× in an hour with 1-hour TTL. First write: $0.15. Next 99 reads at $1.50/M × 0.005M = $0.0075 each = $0.7425 total. Total: $0.8925 vs $7.50 uncached. 88% off the input bill. The structural cost discipline that makes Opus economical: cache-anchor everything stable, put dynamic context last in user messages.

Apply Batch API on async workloads: 50% off both streams. A non-cached 1,000-in / 500-out batched call drops from $0.0525 to $0.02625. Stack caching + batching on the cached portion of a heavy workload and Opus 4.7 becomes economically viable for serious scale. Worked dollars across the Claude family: Claude API cost calculator.


Context window: 200K standard, no 1M variant on Opus

Opus 4.7 ships with a 200,000-token context window. Unlike Claude Sonnet 4.6 (which supports an experimental 1M-token context via a beta header), Opus does not currently expose a 1M variant. For Anthropic users who need >200K, the path is Sonnet 4.6 with the 1M beta header, not Opus.

200K is enough to fit ~150,000 words (roughly 300 pages of single-spaced text), a moderate codebase chunk with metadata, or a multi-turn conversation history of ~50 turns at typical message lengths. It is smaller than GPT-5 (400K) and dramatically smaller than Gemini 2.5 Pro (1M).

Max output is 64,000 tokens per response — half of GPT-5's 128K ceiling. For long-form report generation, plan to chunk the output via multiple calls if you need >64K of generated content; Opus does not support arbitrary continuation within a single call.


Extended thinking: Opus 4.7's signature feature

Extended thinking lets you allocate a token budget for explicit chain-of-thought reasoning before Opus produces the visible answer. Configured via the `thinking` block in the API request: `thinking: { type: 'enabled', budget_tokens: 5000 }`. Opus uses up to 5,000 tokens of internal reasoning, then writes the answer. The thinking tokens are returned as a separate `thinking` content block (you can show, hide, or persist them) and bill at the output rate.

When to use extended thinking: complex code synthesis, math/proof tasks, multi-step planning with branching logic, legal/financial analysis where the reasoning chain is part of the deliverable, debugging tasks where you want to see where the model went wrong. Budget 3,000-10,000 tokens for hard tasks.

When NOT to use it: classification, extraction, summarization of structured input, anything mechanical. Adding thinking to a classification task adds cost with no quality improvement — Opus's instinct response on simple tasks is already calibrated.

Cost example: a 2,000-token problem with 5,000 thinking tokens + 800 visible output tokens bills `0.002 × $15 + 0.005 × $75 + 0.0008 × $75 = $0.03 + $0.375 + $0.06 = $0.465` per call. Use thinking deliberately; it is by far the highest-EV cost lever on Opus when applied to the right tasks.


Tool use, structured outputs, and the XML-tag convention

Opus 4.7 supports the full Anthropic tool-use API: define tools as JSON Schema in the `tools` parameter, Opus picks one (or several in parallel) and returns the arguments in a `tool_use` content block. Parallel tool use is on by default.

Structured outputs on Anthropic are achieved via the tool-use mechanism: define a tool whose input schema matches your desired output schema, force the model to call it with `tool_choice: { type: 'tool', name: 'extract_data' }`, and parse the tool-call arguments. The JSON Schema enforcement is reliable as of Opus 4.7 — invalid outputs are extremely rare on well-formed schemas.

Anthropic's prompt convention emphasizes XML tags for structuring complex instructions: `<task>...</task>`, `<context>...</context>`, `<example>...</example>`, `<output_format>...</output_format>`. The model is trained to attend to these reliably, and Anthropic's docs show every advanced pattern using XML. Prompts written for GPT-5 (typically markdown-headed or just paragraph instructions) work on Opus, but Opus consistently performs better when the structure is XML-tagged.


When to pick Opus 4.7 vs Sonnet 4.6 vs GPT-5

**Pick Opus 4.7** when quality on hard tasks is the bottleneck and budget is not — agentic workflows that need long-horizon planning, complex code synthesis with multi-file context, legal/financial analysis with strict accuracy requirements, long-form writing where Anthropic's voice is preferred, deep research synthesis. The 5× price premium over Sonnet is justified when downstream cost-of-error dominates per-call cost.

**Pick Sonnet 4.6** ($3 / $15 per 1M) for the production sweet spot: structured-data pipelines, content generation, routine chat, summarization, classification with high quality requirements, anything where Anthropic's discipline matters but Opus is overkill. Most production teams who default to Claude live on Sonnet, not Opus.

**Pick GPT-5** instead of Opus 4.7 when: you need the larger 400K context window, you need native structured-output enforcement at the API level (slightly more mature than Anthropic's tool-use-as-structured-output pattern), or you're already deep in the OpenAI tooling ecosystem (Responses API, built-in tools, Assistants).

Cross-tier head-to-head: GPT-5 vs Claude Opus 4.7 covers the side-by-side on benchmarks, voice, and total cost of ownership.


Verified sources and how to re-check the numbers

Every number on this page was verified against Anthropic's live documentation on 2026-06-20. Sources: docs.anthropic.com/en/docs/about-claude/models for context window, modalities, and feature support; anthropic.com/pricing for input/output/cached prices; docs.anthropic.com/en/docs/build-with-claude/prompt-caching for cache write/read mechanics and TTLs.

Anthropic publishes a changelog at docs.anthropic.com/en/release-notes that records pricing and model changes — more transparent than OpenAI's silent pricing-page updates. Subscribe or check monthly if your Opus bill is over $1,000/month.

Methodology: when a number could not be cross-confirmed against an official Anthropic page on the verification date, it was omitted from this card rather than guessed. If you find a discrepancy with the live page, treat the live page as canonical.

Make your first Opus 4.7 API call in 5 steps

  1. 1

    Get an Anthropic API key

    Sign in at console.anthropic.com → Settings → API Keys → Create Key. Copy to a `.env` file as `ANTHROPIC_API_KEY=...`. Anthropic requires a small credit purchase before your first API call goes live.

  2. 2

    Install the SDK

    Python: `pip install anthropic`. Node: `npm install @anthropic-ai/sdk`. The SDK supports all Claude 4 models, prompt caching, extended thinking, and tool use with no version-pinning beyond the latest stable release.

  3. 3

    Send a minimal call

    Python: `from anthropic import Anthropic; c = Anthropic(); r = c.messages.create(model='claude-opus-4-7', max_tokens=1024, messages=[{'role': 'user', 'content': 'Explain caching prefixes in one sentence.'}]); print(r.content[0].text)`. Note: `max_tokens` is required on the Anthropic API — there is no default.

  4. 4

    Add prompt caching to the system prompt

    Pass `system=[{'type': 'text', 'text': '<your-stable-instructions>', 'cache_control': {'type': 'ephemeral'}}]`. Anthropic caches that block for 5 minutes (or 1 hour with `'ttl': '1h'`). Subsequent calls within the TTL read at $1.50/M instead of $15/M.

    → Open the ChatGPT prompt generator (Claude mode)
  5. 5

    Add extended thinking for hard tasks

    For multi-step reasoning: pass `thinking={'type': 'enabled', 'budget_tokens': 5000}`. Opus will burn up to 5,000 internal reasoning tokens before producing the visible answer. The thinking block is returned separately — show, hide, or persist as needed.

Frequently Asked Questions

How much does Claude Opus 4.7 cost in 2026?

$15 per 1M input tokens, $75 per 1M output tokens, $1.50 per 1M for cached input reads (90% off). Cache writes cost $18.75/M (5-min TTL) or $30/M (1-hour TTL). Batch API takes 50% off both standard streams. A representative 1,000-in / 500-out call costs ~$0.0525 — about 8× GPT-5 on the same tokens. Source: anthropic.com/pricing, verified 2026-06-20.

What is Claude Opus 4.7's context window?

200,000 tokens. Unlike Claude Sonnet 4.6 (which supports an experimental 1M context via a beta header), Opus does not expose a 1M variant as of June 2026. For Anthropic users needing >200K, the path is Sonnet 4.6 with the 1M beta header, not Opus.

What is Opus 4.7's max output?

64,000 tokens per response — half of GPT-5's 128K output ceiling. For generated content longer than 64K, chunk via multiple calls; Opus does not support arbitrary continuation within a single response.

What is the difference between Claude Opus and Claude Sonnet?

Same context window (200K), same modalities (text + image input), same feature surface (tool use, prompt caching, extended thinking). The difference is model size, quality on hard tasks, and price. Opus 4.7 is $15/$75 per 1M; Sonnet 4.6 is $3/$15 — 5× cheaper. Use Sonnet as the production default; escalate to Opus for hard reasoning, agentic planning, or correctness-critical work. See our Claude Sonnet 4.6 spec sheet.

How does Anthropic's prompt caching work?

Explicit and surgical. Mark blocks in your messages array with `cache_control: {type: 'ephemeral'}` (default 5-min TTL) or `{type: 'ephemeral', ttl: '1h'}` (1-hour TTL). Anthropic stores that prefix server-side; subsequent calls within the TTL read at 10% of input price (90% off). First-time writes cost 25% more (5-min) or 100% more (1-hour) — amortized over every subsequent read. Single largest cost lever on Opus.

What is extended thinking on Opus 4.7?

Anthropic's name for explicit, budget-controlled chain-of-thought reasoning. Pass `thinking={'type': 'enabled', 'budget_tokens': 5000}` and Opus burns up to 5,000 internal reasoning tokens before producing the visible answer. Thinking tokens bill at the output rate. Use for complex reasoning, code synthesis, math; skip for classification/extraction.

Does Opus 4.7 support vision?

Yes — text + image input. Pass images as base64-encoded data or URLs inside the messages content array (`{type: 'image', source: {type: 'base64', media_type: 'image/png', data: '...'}}` or `{type: 'image', source: {type: 'url', url: 'https://...'}}`). Output is text only — no native image generation.

Where is Opus 4.7 available?

Anthropic API (console.anthropic.com), Amazon Bedrock, Google Cloud Vertex AI, and the Claude consumer apps (claude.ai Pro, Max, Team, Enterprise tiers). Bedrock and Vertex pricing matches Anthropic direct as of June 2026. API and consumer billing are separate — a claude.ai Pro subscription does not include API credit.

Opus is expensive. Make every token count.

Our AI Prompt Generator writes Opus-tuned prompts (XML-tagged, cache-anchored, extended-thinking-ready) based on YOUR business + task — so the $75/M output rate actually pencils. 14-day free trial of DDH Pro, no card.

Browse all prompt tools →