By The DDH Team · Digital Dashboard Hub

Cursor Pro Included Fast Requests (2026): What $20/mo Actually Buys

By The DDH Team at Digital Dashboard Hub·Updated June 21, 2026

Stop writing AI prompts from scratch.

Tell us your business + your task + your model. We write the prompt — perfectly tuned for ChatGPT, Claude, Grok, Gemini, Midjourney, or any model. Plus 500+ pre-built prompts in your library.

Cursor's $20/mo Pro plan is the most popular AI-coding subscription in 2026, but the included-quota mechanics are the single most misunderstood part of the pricing page. The plan does not give you 'unlimited Claude' or 'unlimited GPT-5.' It gives you a **monthly fast-request budget per premium model**, a **slow-request pool** that absorbs overflow at degraded latency, and a fleet of cheap models (Haiku 4.5, Gemini 2.5 Flash, GPT-5.4-mini) that are effectively unlimited fast. The difference between 'this plan is plenty' and 'I keep hitting limits by week 3' is whether you understand which model burns from which pool.

Across the dozens of Pro accounts we've audited through 2026, the same three failure patterns repeat. (1) A developer defaults to Opus 4.7 for every Composer task — burns through ~150 fast Opus requests in the first 10 days of the month and spends the next 20 days on slow-pool Opus with 60-90 second latencies. (2) A developer treats Composer like a single request — doesn't realize one Composer fan-out can spend 3-8 premium requests under the hood depending on how many tools the model invokes and how many edits it queues. (3) A developer ignores the unlimited-fast pool entirely (Haiku 4.5, Flash, gpt-5.4-mini) and overspends on premium models for tasks the small models would handle equally well.

This page is the canonical fast-pool reference. Per-model quotas at Pro vs Business, what happens at the slow-pool boundary, how Composer and Agent multipliers work, the unlimited-fast vs metered-fast distinction, and the live-verify checklist before you commit to Pro vs upgrade vs BYOK. For the side-by-side cost vs Copilot, see our Cursor vs Copilot cost calculator; for the 3-way Cursor vs Copilot vs Windsurf comparison, see /vs/copilot-vs-cursor-vs-windsurf.

Numbers below are indicative as of fetch date — **Cursor's fast-pool quotas have moved roughly every 60-90 days since 2024** as Cursor renegotiates upstream provider pricing and as new models launch. Treat per-model quotas as accurate to ±15%; verify your live budget at the Cursor dashboard usage page before budgeting. List source: cursor.com/pricing and cursor.com/docs/account/plans-and-usage, both fetched 2026-06-21.

Digital Dashboard Hub

Writing good prompts for ONE AI is hard. Writing them for GPT-5, Claude, Gemini, Perplexity, Midjourney and 6 more is a full-time job. DDH's AI Prompt Builder writes once, runs everywhere — locked to your niche, voice, and brand tone.

Free 14 days, no card. →

Cursor Pro included fast-request quotas — June 2026 (per-model, per-month)

Feature	Pro ($20/mo) fast requests	Business ($40/seat) fast requests	Pool type when exhausted
GPT-5.5 (flagship)	~500/mo	~750/mo	Slow pool
Claude Opus 4.7	~150/mo	~250/mo	Slow pool
Claude Sonnet 4.6	~500/mo	~750/mo	Slow pool
Gemini 2.5 Pro	~500/mo	~750/mo	Slow pool
Claude Haiku 4.5	Unlimited fast	Unlimited fast	n/a
Gemini 2.5 Flash	Unlimited fast	Unlimited fast	n/a
GPT-5.4-mini	Unlimited fast	Unlimited fast	n/a
Tab autocomplete (all models)	Unlimited	Unlimited	n/a

Source: cursor.com/pricing + cursor.com/docs/account/plans-and-usage, fetched 2026-06-21. 'Premium fast requests' is Cursor's term for the metered-fast pool; Cursor groups GPT-5.5, Opus 4.7, Sonnet 4.6, and Gemini 2.5 Pro as 'premium models.' Quotas have moved approximately quarterly since 2024 — treat numbers as accurate to ±15%. Verify your org's live budget at cursor.com/settings. Tab autocomplete (inline completion) is unlimited on all plans and does not burn fast-request budget regardless of model. Business plan ($40/seat) is sold per-seat with a 2-seat minimum and adds SSO, audit logs, and zero-data-retention; the higher fast-request quotas are the most-visible per-developer benefit but not the only one.

How Cursor's fast pool vs slow pool actually works

Cursor's request lifecycle is a two-stage pool system. Every request you send to a premium model (GPT-5.5, Opus 4.7, Sonnet 4.6, Gemini 2.5 Pro) first attempts to draw from your monthly **fast pool** — a per-model budget that Cursor has pre-paid for on the upstream provider's API. While you have fast budget, your request goes through Cursor's priority-routed inference path: typical first-token latency of 1-3 seconds, output streaming at the upstream model's full speed (GPT-5.5 at ~180 tokens/sec, Sonnet 4.6 at ~85 tokens/sec, Opus 4.7 at ~50 tokens/sec).

When your fast pool exhausts for a given model, requests automatically fall through to the **slow pool** — Cursor's shared backlog queue. Slow-pool requests still execute on the same upstream model (no silent downgrade in the default config), but they wait in a queue behind other users' overflow traffic. Typical first-token latency in the slow pool runs **30-90 seconds**, sometimes longer during peak hours. Streaming speed once the request starts is unchanged; the wait is entirely in the queue, not in inference.

An optional account-level setting — labeled 'Allow model fallback when slow' in the Cursor preferences as of mid-2026 — enables automatic downgrade. When that's on and your fast pool is exhausted, Cursor will substitute a cheaper unlimited-fast model (typically Haiku 4.5 for Opus/Sonnet requests, Flash for Gemini Pro requests, gpt-5.4-mini for GPT-5.5 requests) rather than queuing in the slow pool. This is faster but quietly changes which model is actually answering you. **The default is OFF** — most users on overflow are queued, not downgraded.

There is no 'overage billing' on the Pro plan. You cannot pay $1 extra to buy more fast requests once your monthly quota exhausts; the only way out is to wait for the slow pool, accept the auto-downgrade, switch to your own API key (BYOK mode, covered below), upgrade to Business, or wait for the calendar reset on your billing-period anniversary.

Fast-pool counters reset on your individual billing-period anniversary, not on the first of the calendar month. If you subscribed on the 14th, your fast pool refills on the 14th. The Cursor dashboard shows your current usage and reset date at cursor.com/settings.

What counts as one 'fast request' (and what doesn't)

Cursor counts a 'fast request' as one round-trip premium-model invocation initiated by you in the IDE. A chat message you type into the side panel that triggers a single model response counts as one. A Cmd+K inline edit that triggers one rewrite counts as one. A Composer task or an Agent task, by contrast, can fan out to multiple premium-model invocations under the hood — and **each underlying invocation counts separately against your fast pool**. The 'Composer multiplier' (covered in its own section below) is the single biggest reason users blow through their fast pool faster than the per-message intuition predicts.

**Tab autocomplete is unlimited and free** on all plans, including Pro. The inline-completion engine that suggests the next 1-30 tokens as you type does not burn from any pool — it runs on Cursor's proprietary completion model (separately tuned for low-latency single-line and multi-line suggestions). You can leave tab on all day and it does not affect your monthly fast-request budget. This is the model Cursor invested most heavily in differentiating, and the unlimited pricing reflects that strategic position.

**Cmd+L inline edits** count as one fast request per accepted edit, but only if you target a premium model. If your default Cmd+L model is Haiku 4.5 or Flash, the edit is free against your fast pool. Many Pro users set Cmd+L to Haiku 4.5 for routine small edits and only escalate to Sonnet/Opus when the edit is non-trivial — a major budget extender.

**Apply Edits** (the feature that takes a model-generated patch suggestion and writes it to the file) is free — it doesn't call the model again. The cost was paid when the model generated the patch. Similarly, **Cursor's auto-debug feature** (where Cursor watches your terminal and offers fixes) is free when it surfaces a suggestion; the cost is paid only if you accept and trigger a model-generated patch.

**Image inputs** in chat or Composer do count as a normal fast request but do *not* incur an extra multiplier on top — a single chat message with an attached screenshot counts as one fast request to that premium model, despite vision tokens being more expensive on the upstream provider. Cursor absorbs the vision premium on Pro/Business; you pay only the request count.

The Composer multiplier: one task, many premium requests

Composer is Cursor's multi-file orchestrated edit mode, and it is the single biggest reason users misforecast their fast-pool consumption. A Composer task is not one model call. It is an orchestrated loop where the model plans the edit, reads the relevant files (each read is a tool call to a premium model in many configurations), generates the patch, optionally re-reads to verify, optionally fixes detected issues, and finally emits the final apply.

Empirically, a typical Composer task fans out to **3-8 premium fast requests** depending on complexity. A simple single-file refactor might be 2-3 requests (read file, plan, generate patch). A multi-file feature add — say, adding a new API route plus its TypeScript types plus the matching client function — routinely hits 6-10 requests. A complex cross-cutting change like 'add proper error handling to all 12 API routes' can fan out to 20+ requests as Composer reads each file, generates patches, and verifies the result.

On Pro's ~150 Opus 4.7 fast requests per month, this math is brutal. If you average 5 premium requests per Composer task and use Composer 10 times per workday on Opus, you exhaust your monthly Opus fast budget in **3 working days**. By week 2 you are on the slow pool — 60-second waits per request, multiplied by 5 requests per Composer task = 5 minutes of waiting per task. That's where the 'Cursor got slow' complaints come from. Cursor didn't get slow. Your fast pool exhausted and you were queued.

The fix is structural. **Default Composer to Sonnet 4.6 or Gemini 2.5 Pro** (both have ~500 fast requests/mo on Pro — roughly 3x Opus's budget). Reserve Opus for genuinely hard architectural decisions where Sonnet's reasoning visibly falls short. On the ~50 Opus Composer tasks per month a Pro user can comfortably support, this is plenty of high-leverage Opus time. Don't burn Opus on tasks Sonnet would have nailed.

There is no per-Composer-task cap on fast-request consumption. A runaway Composer that decides to read 30 files will spend 30+ requests against your fast pool with no warning prompt before each one. If you start a Composer task and it begins reading more files than you expected, you can hit Esc to cancel — but Cursor has already spent the requests on files it read before the cancel landed. **Defensive pattern**: scope Composer tasks tightly. Tell it which files to touch, not 'fix this across the codebase.'

Agent mode and the autonomous-loop multiplier

Cursor Agent (formerly 'Background Agents,' rebranded mid-2026) is Cursor's autonomous-workflow mode where the model is given a task and operates with substantial autonomy — running its own bash commands, reading files it decides to read, iterating on test failures, and emitting a final summary when it considers the task done. Agent runs are billed against the same fast-request pool as Composer, but the multiplier is even more extreme.

A typical Cursor Agent run consumes **8-25 premium fast requests** depending on the task and how many tools the model decides to invoke. A simple test-failure fix might be 8-12 requests. A 'set up Stripe payments in this Next.js app' task can fan out to 30-50 requests as the agent reads docs, generates components, runs `npm install`, edits config, and verifies. We've seen single Agent runs consume 80+ fast requests when the model gets into a long debug loop.

**The implication**: Cursor Agent on Opus 4.7 is roughly a 6-task-per-month feature on the Pro plan if you don't manage scope. On Sonnet 4.6 it's a 20-30 task per month feature. On Gemini 2.5 Pro it's similar to Sonnet. If your workflow leans heavily on Agent mode, Pro's pool is the binding constraint and Business ($40/seat for ~50% more fast budget) starts to look reasonable. Above that, BYOK or Devin become the comparison points (covered in the cross-link section).

**Scope discipline matters even more for Agent than Composer.** A vague Agent prompt like 'improve the test coverage' invites the agent to read every test file, run the test suite multiple times, and iterate without clear stopping conditions. A tight Agent prompt like 'add unit tests for the three pure functions in lib/format.ts; do not modify other files; stop when all three have at least one happy-path and one edge-case test' typically completes in 6-10 requests instead of 25+.

Cursor displays a real-time request counter in the Agent panel as the run progresses. Watch it. If the counter is past 15 requests and the agent doesn't seem close to done, you can stop the run, accept what's done so far, and restart with tighter scope rather than letting it burn another 30 requests on diminishing-return iteration.

Per-model usage advice: which model burns from which pool

**Claude Opus 4.7** (~150 fast/mo on Pro). The tightest budget. Use Opus for: hard architectural decisions, multi-system reasoning, code that needs to handle 5+ edge cases correctly the first time, debugging a bug that has resisted Sonnet's first attempt. Do not use Opus for: small refactors, single-file edits, tests, type fixes, comment additions. The cost per Opus Composer fan-out (~5 requests) means every casual Opus task is 3.3% of your monthly budget.

**Claude Sonnet 4.6** (~500 fast/mo on Pro). The workhorse. Most Pro users should default Composer to Sonnet 4.6 and only switch to Opus or Gemini Pro for specific reasons. Sonnet handles 90%+ of typical coding workloads at quality indistinguishable from Opus for the kinds of edits Cursor is asked to make. 500/mo gives you ~100 Composer tasks per month at the 5x multiplier — plenty for daily use.

**GPT-5.5** (~500 fast/mo on Pro). Roughly tied with Sonnet 4.6 on most coding benchmarks as of mid-2026. Use GPT-5.5 when: you want a second opinion that disagrees with Claude's house style, when working in domains where OpenAI's training data is richer (some areas of Python ML, OpenAI SDK integrations, certain newer JS frameworks), or when your team's house default is GPT-flavored output. Same ~500 budget as Sonnet — interchangeable for budget purposes.

**Gemini 2.5 Pro** (~500 fast/mo on Pro). Strongest at long-context tasks — repository-scale understanding, large-file refactors, anything where the model needs to hold >100k tokens in working context. Cursor's Gemini integration supports the full 1M-2M context window. Use Gemini Pro when the task is bounded by 'the model can't see enough of the codebase' — that's where it visibly beats Sonnet and Opus.

**Haiku 4.5 / Gemini Flash / GPT-5.4-mini** (unlimited fast). Underused by most Pro users. Default Cmd+K to Haiku 4.5. Default tab-completion model selection to Cursor's proprietary engine (already free). Use Flash or mini for 'write me a quick utility function' style asks where the answer is obvious and you just want the keystrokes. Every task you route to the unlimited-fast pool is a task you didn't burn from your premium budget.

Pro vs Business ($40/seat): what the higher tier actually unlocks

Cursor Business is **$40/seat/month** with a 2-seat minimum. The headline-visible benefit is roughly 50% more fast-request budget per premium model (Opus ~250 instead of ~150, Sonnet/GPT-5.5/Gemini Pro ~750 instead of ~500). For an individual developer hitting fast-pool ceilings, Business is the most direct upgrade path before BYOK.

Beyond the quota delta, Business includes: **SSO/SAML** for org-managed accounts, **admin dashboard** with per-seat usage analytics, **audit logs** for compliance, **zero-data-retention mode** (your code is not retained by Cursor or upstream providers for training or improvement), **central billing**, and **org-level Composer/Agent enforcement policies** (e.g., disable Agent mode globally, lock model selection to specific defaults).

The math on whether Business is worth it for a solo dev: extra $20/mo for ~50% more fast budget = roughly $0.13 per added fast request on average. The same incremental request via BYOK on Anthropic API would cost roughly $0.05-0.20 depending on whether you're using Sonnet or Opus and depending on prompt size. Business is therefore a slight premium to BYOK on raw economics but adds the convenience of not managing a separate API key and includes the IDE-side priority routing.

The org-buyer math is different. For a 10-person team where 3 developers regularly exhaust Pro's fast pool and 7 don't, paying $40/seat × 10 ($400/mo) instead of $20/seat × 10 ($200/mo) means $200/mo to unblock the 3 heavy users — typically worth it on developer-hour math (one engineer's hour of slow-pool waiting per week is $100+ of fully-loaded cost).

**Cursor also offers BYOK mode on all plans (including Pro)**. You paste your own Anthropic, OpenAI, or Google API key in settings; requests using your key do not count against any Cursor pool. This is the right answer for developers whose monthly consumption far exceeds what Pro or Business include, who already have negotiated upstream-provider pricing, or who need specific model versions Cursor's pool doesn't expose. BYOK requests still get Cursor's IDE features (Composer, Agent, Tab) — they just bill through your API account directly.

BYOK on Cursor: when to bring your own API key

Bring-your-own-key mode is the third pool option after fast and slow. Configure it under Cursor → Settings → Models, paste an Anthropic, OpenAI, Google, xAI, or Groq API key, and select per-model whether that model should use your key (BYOK) or Cursor's pool (fast/slow). The setting is granular: you can keep Sonnet 4.6 on Cursor's pool and route only Opus 4.7 through your own Anthropic key, for example.

BYOK breakeven math. On Anthropic, an average Composer task with Opus 4.7 (say 8k input + 1k output tokens fan-out averaged across 5 requests) costs roughly $0.20-0.40 via your own API key. The same task on Cursor's fast pool 'costs' you 5 fast requests out of 150/mo, which on $20/mo Pro pricing prorates to about $0.67. **For Opus specifically, BYOK is cheaper per request than Cursor's allocated quota implies — but you forfeit the rest of the Pro plan's unlimited-tab and Sonnet/Flash/Haiku pool by going entirely BYOK.**

The dominant BYOK pattern in mid-2026 among Pro users who hit limits: **keep Pro for the fast pool on Sonnet/Gemini/Flash/Haiku/Tab, set BYOK on Opus 4.7 only**. This costs $20/mo for Pro + your direct Anthropic spend on Opus requests, gives you uncapped Opus access at retail Anthropic pricing, and preserves all the bundled benefits for non-Opus models. Effective monthly cost for a heavy Opus user lands around $40-80/mo total — comparable to Business ($40/seat) but with no upper bound on Opus usage.

**BYOK does not unlock the slow pool's queue priority.** A BYOK request bypasses Cursor's pool entirely and goes direct to the upstream provider — meaning your request inherits whatever rate-limit ceilings exist on your own Anthropic/OpenAI/Google account. If your Anthropic Tier 1 ITPM is 30,000, that's the ceiling you'll hit, not Cursor's negotiated wholesale capacity. For high-volume BYOK on Anthropic, run yourself up to Tier 3-4 ($200-400 in credit purchases) so the per-minute ceiling stops being the binding constraint.

**BYOK does not break Cursor's data-handling promises** — your code still goes through Cursor's relay infrastructure (which sees the request payload) and Cursor's prompt engineering layer (which adds the Composer/Agent system prompts on top of your model selection). If your concern is keeping code entirely on-prem, BYOK does not solve that; you need Business + Zero Data Retention or Cursor Self-Hosted Enterprise (separate SKU).

Live-verify checklist before committing to a plan

Before paying for Pro vs Business vs going BYOK, run this five-point check against your own usage. **(1)** Open cursor.com/settings and look at your last 30 days of usage broken down by model. If the largest line item is Opus 4.7 and you regularly exhaust before day 30, Business or Opus-BYOK is your upgrade path. If the largest line is Sonnet 4.6 and you don't exhaust, Pro is plenty.

**(2)** Check your Composer-to-chat-message ratio. If most of your usage is chat or Cmd+K (single-call patterns), your effective request count tracks your sent-message count. If most of your usage is Composer or Agent (multi-call patterns), multiply your task count by 5-8 to get your actual fast-request burn. The mismatch between intuition and reality is what catches users by surprise.

**(3)** Verify the per-model fast budget on cursor.com/pricing before you assume it matches this page. Cursor moves the numbers approximately every 60-90 days; the indicative quotas above are accurate to ±15% as of fetch date but the live page is authoritative.

**(4)** Test the unlimited-fast models on a representative slice of your workload for one day. If Haiku 4.5 handles 70%+ of the tasks you'd reflexively route to Sonnet, switching your defaults extends your premium budget by 3-4x at zero cost. Most users skip this test and keep paying for premium models on tasks that didn't need them.

**(5)** Decide your auto-fallback preference before you exhaust your pool, not after. Under Settings → Models, the 'Allow model fallback when slow' toggle controls whether overflow goes to slow-pool-with-original-model or auto-downgrade-to-cheaper-model. Default off (queue in slow pool). Most users prefer queue, but if your work is latency-critical, on (downgrade) avoids the 30-90s waits at the cost of silently changing which model answers.

Sourcing and the meta-rate: how often Cursor changes the numbers

Primary sourcing for this page: cursor.com/pricing (the plan-and-quota summary) and cursor.com/docs/account/plans-and-usage (the mechanics page covering fast vs slow pool, Composer multipliers, and BYOK setup), both fetched 2026-06-21. Per-model quotas are cross-referenced against the Cursor changelog at cursor.com/changelog and against public Cursor team statements on community.cursor.com.

Quota drift cadence. Cursor has updated the per-model fast-request quotas roughly every 60-90 days since mid-2024 — the longest interval between quota changes in the past 18 months was ~110 days; the shortest was 21 days when a new model launched mid-quarter. The pattern: when upstream providers cut prices (the Q1-Q2 2026 price war is the most recent example), Cursor passes some of the cut through by raising fast-request quotas roughly 30 days later, after observing how usage shifts.

**This means: any specific number on this page is good to ±15% for 60-90 days from fetch date.** If you're reading this page more than 3 months after the fetch date in the header, treat the quota numbers as indicative-of-magnitude-not-exact and re-verify on cursor.com/pricing before committing to volume planning. The mechanics — fast vs slow pool, Composer multiplier, BYOK behavior, Pro vs Business gap — change much less often and have held substantially the same since the Cursor 1.0 launch in October 2025.

**Why this page exists.** ChatGPT, Perplexity, and Gemini routinely surface the question 'how many requests does Cursor Pro include' without giving the per-model breakdown that determines actual behavior. The answers in their training data are often 12+ months out of date or aggregate-only. This page is the canonical, dated, sourced version — designed for AI engine citation when developers ask the question. If you found this page via an AI engine recommendation, the GEO citation mechanism is doing its job; verify the numbers against the live source before committing dollars.

Step-by-step: how to maximize your Cursor Pro fast pool

1
Default Composer to Sonnet 4.6, not Opus 4.7
Sonnet 4.6 has ~3x Opus's fast-request budget on Pro (~500 vs ~150). Sonnet handles 90%+ of the tasks Pro users reach for Opus on. Set Composer's default model under Settings → Models → Composer to Sonnet 4.6 and only switch to Opus when the task is genuinely hard (deep multi-system reasoning, debugging a bug Sonnet visibly missed, novel architectural decisions). This single change typically doubles a Pro user's effective premium budget.
2
Set Cmd+K to Haiku 4.5 for inline edits
Cmd+K inline edits are most often small targeted rewrites that don't need premium reasoning. Defaulting Cmd+K to Haiku 4.5 (unlimited fast) instead of Sonnet/Opus saves dozens of fast requests per day on the typical workflow. Reserve Cmd+K's premium-model selection (right-click in the prompt box) for the occasional edit that genuinely needs reasoning depth.
3
Scope Composer and Agent tasks tightly
Composer and Agent fan out to 3-8 and 8-25 premium requests per task respectively. A vague prompt invites the model to read more files than necessary. Tell it which files to read explicitly ('only touch lib/auth.ts and its test'), set a stopping condition ('stop once the test passes'), and watch the live request counter — if it crosses your expectation by 50%, cancel and restart with tighter scope.
4
Use Gemini 2.5 Pro for long-context tasks
Gemini 2.5 Pro on Cursor has access to its full 1M-2M token context window — wider than Claude Opus's effective working context. For repository-scale tasks (large refactors, understanding cross-cutting changes, anything where the model needs to hold 100k+ tokens), Gemini Pro typically does better than Opus despite Opus's reasoning edge. Same ~500/mo fast budget as Sonnet — interchangeable for budget purposes.
5
BYOK only Opus 4.7 if you're an Opus power user
If you regularly exhaust Pro's ~150 Opus fast requests/mo, the lowest-friction upgrade is to BYOK Opus only. Keep Pro for Sonnet/Gemini/Flash/Haiku/Tab on Cursor's pool; configure your Anthropic API key for Opus 4.7 specifically under Settings → Models. Effective cost lands around $40-80/mo total (Pro + Opus retail) and you get uncapped Opus access without losing any other bundled benefit. Run yourself to Anthropic Tier 3 ($200 in credits) first so per-minute ITPM doesn't bind.

Digital Dashboard Hub

The prompt patterns above work 10x better when they live in a library you actually own — tunable to your niche, exportable to GPT-5, Claude, Gemini, Perplexity, Midjourney, Llama. Stop pasting across 6 tools.

Try DDH's AI Prompt Builder — free 14 days, no card. →

Related calculators

OpenAI Pricing Calculator →GPT-5.5, 5.4, mini, nano — full per-call cost in one input.Claude Pricing Calculator →Opus 4.8, Sonnet 4.6, Haiku 4.5, Fable 5 — input + output combined.Context Window Comparison →Max input length and price per 1M for every current model.

Related prompt tools

Cursor vs Copilot cost calculator→Copilot vs Cursor vs Windsurf→Cursor vs Windsurf vs Cline→Cursor vs Copilot TCO 2026→Claude API rate limits 2026→

Frequently Asked Questions

How many included fast requests does Cursor Pro give you?

Cursor Pro ($20/mo) includes approximately 500 fast requests per month each on GPT-5.5, Claude Sonnet 4.6, and Gemini 2.5 Pro; approximately 150 fast requests per month on Claude Opus 4.7; and unlimited fast requests on Haiku 4.5, Gemini 2.5 Flash, and GPT-5.4-mini. Tab autocomplete is unlimited and free across all models. Numbers are accurate to ±15% as of June 2026 and have moved approximately every 60-90 days — verify live values at cursor.com/pricing.

What happens when my Cursor Pro fast pool exhausts?

Requests fall through to the slow pool by default — your request still runs on the same model you selected, but waits in a queue for typically 30-90 seconds (sometimes longer at peak) before first token. There is no overage billing. Alternatively, if you toggle on 'Allow model fallback when slow' under Settings → Models, Cursor auto-downgrades to an unlimited-fast model (Haiku for Opus/Sonnet, Flash for Gemini Pro, gpt-5.4-mini for GPT-5.5) instead of queuing.

Why does one Composer task burn 5+ fast requests?

Composer is a multi-step orchestrated loop, not a single model call. A typical Composer task fans out to 3-8 premium fast requests as the model reads files, plans the edit, generates the patch, optionally verifies, and emits the apply. Multi-file or complex tasks routinely hit 8-12 requests. This is the single biggest reason Pro users misforecast their fast-pool consumption. Scope tasks tightly and watch the live request counter in the Composer panel.

What's the difference between Cursor Pro ($20) and Business ($40/seat)?

Business gives you roughly 50% more fast-request budget per premium model (Opus ~250 vs ~150, Sonnet/GPT-5.5/Gemini Pro ~750 vs ~500), plus SSO/SAML, admin dashboard, audit logs, zero-data-retention mode, central billing, and org-level enforcement policies. The 2-seat minimum means Business is effectively $80/mo entry point. For a solo developer hitting Pro limits primarily on Opus, BYOK on Opus alone is typically cheaper than upgrading to Business.

Do Cursor Tab completions count against my fast request quota?

No. Tab autocomplete is unlimited on all Cursor plans including Pro and runs on Cursor's proprietary completion engine separately from the chat/Composer/Agent fast-request pool. You can leave Tab on all day and it does not affect your monthly budget. This is the most-used feature in Cursor and the one with the most aggressive pricing — the free-unlimited tier is unchanged since Cursor 1.0.

Can I bring my own Anthropic or OpenAI API key on Cursor Pro?

Yes. BYOK is available on all Cursor plans including Pro. Configure under Settings → Models — paste your Anthropic, OpenAI, Google, xAI, or Groq key, and choose per-model whether that model uses your key or Cursor's pool. Common pattern: keep Pro's pool for Sonnet/Gemini/Flash/Haiku/Tab, BYOK only Opus 4.7 for uncapped Opus access. BYOK requests bill through your API account directly — make sure your upstream-provider tier supports the per-minute throughput you need.

How often does Cursor change the fast-request quotas?

Roughly every 60-90 days since mid-2024. The longest interval between quota changes in the past 18 months was about 110 days; the shortest was 21 days when a new flagship model launched. The pattern: when upstream providers cut prices (the Q1-Q2 2026 price war is the most recent example), Cursor raises fast-request quotas roughly 30 days later. Treat any specific number as good to ±15% for 60-90 days from a dated source; re-verify on cursor.com/pricing for any budgeting that crosses a quarter boundary.

Is Cursor Pro worth $20 vs Copilot Pro at $10?

Depends on workflow. If you primarily use AI for inline completion and single-call chat, Copilot Pro at $10 is roughly equivalent value — both have unlimited completion and generous chat. If you use multi-file orchestration (Composer/Agent equivalents), Cursor Pro at $20 with its larger premium-model budgets and more polished orchestration is the better value. The full cost comparison including overage and Pro+/Max tiers is at /calc/cursor-vs-copilot-cost; the 3-way Cursor vs Copilot vs Windsurf comparison is at /vs/copilot-vs-cursor-vs-windsurf.

Cursor's pool sets the ceiling. Prompt quality decides whether you hit it efficiently.

Tight, model-tuned prompts cut Composer fan-out, reduce Agent loop iterations, and stretch your fast budget further than any plan upgrade. Our AI Prompt Generator writes Cursor-optimized prompts (front-loaded for cache, scoped for Composer, explicit stop conditions for Agent) tuned to your stack. 14-day free trial, no card.

Browse all prompt tools →