Skip to contentNew: Does ChatGPT recommend your brand? Free 60-second AI visibility check →
By The DDH Team · Digital Dashboard Hub

AI Data Residency by Provider 2026: Where OpenAI, Azure OpenAI, AWS Bedrock, Google Vertex AI, Anthropic, Cohere, Mistral, and Together AI Actually Run Your Inference

Eight providers, eight different residency stories. OpenAI direct serves inference from US data centers. Azure OpenAI puts the same models in 20-plus Azure regions including Frankfurt, Paris, and Zurich. AWS Bedrock varies region-by-region per model. Google Vertex AI offers region-locked plus multi-region endpoints. Anthropic added EU inference in 2024. Cohere serves US, EU, and Japan. Mistral runs FR-sovereign. Together AI is US-first. Sources cited inline, June 2026.

By DDH Research Team at Digital Dashboard HubUpdated

Procurement asks one question first in 2026: where will the prompt actually be processed. The answer is rarely on the vendor's marketing page. "EU data residency" can mean any of four different things — the customer account is billed from an EU entity, customer metadata is stored in an EU database, model weights are mirrored in an EU region, or the actual GPU that processes the prompt sits in an EU data center. Only the last one is what regulators, hospital CIOs, and German works councils actually mean. Before you sign anything, run your spend through the enterprise LLM compliance comparison and the AI tools GDPR-compliant 2026 guide so the residency story holds up under audit.

**OpenAI direct API** runs inference from US data centers and adds EU data residency for ChatGPT Enterprise and the API per https://openai.com/enterprise-privacy/. **Azure OpenAI** ships the same GPT-5, GPT-5-mini, and o-series models across Azure regions documented at https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models — with EU, UK, AU, JP, and Canada options. **AWS Bedrock** model availability is region-specific and tracked at https://aws.amazon.com/bedrock/ and the Bedrock region matrix. **Google Vertex AI** publishes its endpoint locations at https://cloud.google.com/vertex-ai/generative-ai/docs/learn/locations including a multi-region endpoint that explicitly stays inside the EU. **Anthropic direct** documents its supported regions at https://docs.anthropic.com/en/docs/about-claude/regions and now serves both US and EU inference. **Cohere** publishes its US, EU (Frankfurt), and Japan deployments at https://cohere.com/security. **Mistral** runs FR-sovereign infrastructure documented at https://mistral.ai/security. All numbers and region lists in this guide are sourced from vendor pages as of June 2026.

The rest of this article breaks down the difference between data-in-transit, at-rest, and in-use; what training-data residency actually controls versus inference residency; which sovereign cloud variants (AWS GovCloud, Azure Government, Google Sovereign Cloud) actually carry these models; and an opinionated decision matrix for the four most common buyer profiles — EU-only SaaS, US federal contractor, Japanese financial services, and Australian healthcare. We also link to the AI cloud region compliance matrix and to the zero-data-retention LLM options for the ZDR-specific procurement language you will need.

Digital Dashboard Hub

Compliance reviews ask for prompt receipts. DDH's Saved Prompt Library has them — every version, every branch, exportable to JSON. Built by indie operators who hate spreadsheet evidence too.

Start free 14-day trial — AICHAT30 = 30% off Pro for 3 months.

OpenAI direct, Azure OpenAI (EU), AWS Bedrock (EU), Google Vertex (EU), Anthropic direct, Mistral (FR) — residency matrix, June 2026

Feature
OpenAI direct
Azure OpenAI (EU)
AWS Bedrock (EU)
Google Vertex (EU)
Anthropic direct
Mistral (FR)
Inference regions advertisedUS (primary); EU residency for ChatGPT Enterprise + API per openai.com/enterprise-privacy20+ Azure regions globally; EU: France Central, Sweden Central, Switzerland North, West Europe (Netherlands), Germany West Central10+ commercial regions including Frankfurt (eu-central-1), Ireland (eu-west-1), Paris (eu-west-3), Stockholm (eu-north-1), London (eu-west-2)20+ Vertex locations; EU: europe-west1 (Belgium), europe-west4 (Netherlands), europe-west9 (Paris), europe-west3 (Frankfurt), plus multi-region 'eu'US and EU (Ireland) inference per docs.anthropic.com/en/docs/about-claude/regionsFR-sovereign infrastructure; Mistral La Plateforme + on-prem options
Default inference regionUSCustomer-selected at resource creation; no defaultCustomer-selected per request via region endpointCustomer-selected per request; global endpoint defaults to USUS (must explicitly request EU routing)FR
In-transit encryptionTLS 1.2+TLS 1.2+TLS 1.2+ (default to 1.3 in new SDKs)TLS 1.2+TLS 1.2+TLS 1.2+
At-rest encryptionAES-256 (per OpenAI enterprise privacy)AES-256 with platform-managed keys defaultAES-256 via AWS KMSAES-256 via Google-managed keys defaultAES-256AES-256
Customer-managed keys (BYOK / CMK)Not offered on direct APIYes — Azure Key Vault customer-managed keys for stored dataYes — AWS KMS CMK; also supported via Bedrock model invocation loggingYes — Cloud KMS CMEK on Vertex AINot offered on direct APIYes on enterprise tier (Mistral on-prem and dedicated)
In-use (confidential compute / TEE)Not offeredLimited — Azure Confidential Computing on select VM SKUs; not yet GA for Azure OpenAI inference itselfAWS Nitro Enclaves available on Bedrock-adjacent workloads; native Bedrock TEE inference is limitedConfidential VMs available on Vertex training; confidential inference is in preview for select modelsNot offered on direct API as of June 2026Available via Mistral on-prem deployments on confidential-VM hardware
Sovereign cloud variantAvailable through Azure OpenAI in Azure Government (US fed); no direct OpenAI sovereignAzure Government (US fed, IL5), Azure China (operated by 21Vianet), Microsoft Cloud for Sovereignty in EUAWS GovCloud (US) Bedrock availability is limited to specific models; AWS European Sovereign Cloud (Brandenburg) planned/launchingGoogle Sovereign Cloud partnerships (T-Systems in DE, Thales in FR, PSN in IT); Vertex AI availability varies by partnerNot offered; use Bedrock or Vertex for sovereign-cloud ClaudeFR sovereign by default; partnered with Bleu (Capgemini + Orange) for SecNumCloud-aligned hosting
EU-only routing guaranteedYes for ChatGPT Enterprise EU residency tier; API EU residency requires opt-in configurationYes — when resource is created in an EU region, inference + storage stay in that regionYes when calling the EU regional endpoint; cross-region inference must be explicitly disabledYes for region-locked endpoints and the 'eu' multi-region; global endpoint can route to USYes when EU routing is explicitly enabled on the workspaceYes — FR-default architecture
AU (Australia) inferenceNot offered as a direct residency optionYes — Australia East, Australia Southeast (model availability varies)Yes — ap-southeast-2 (Sydney) with limited model set; check Bedrock region matrixYes — australia-southeast1 (Sydney), australia-southeast2 (Melbourne)Not offered as a direct residency option; use Bedrock Sydney or Vertex SydneyNot offered as a direct residency option
JP (Japan) inferenceNot offered as a direct residency optionYes — Japan East (Tokyo), Japan West (Osaka)Yes — ap-northeast-1 (Tokyo) with limited model setYes — asia-northeast1 (Tokyo), asia-northeast2 (Osaka)Not offered as a direct residency option as of June 2026Not offered as a direct residency option
CA (Canada) inferenceNot offered as a direct residency optionYes — Canada East (Quebec City), Canada Central (Toronto)Yes — ca-central-1 (Montreal) with limited model setYes — northamerica-northeast1 (Montreal), northamerica-northeast2 (Toronto)Not offered as a direct residency optionNot offered as a direct residency option
Zero data retention (ZDR)Yes — opt-in for eligible API customers per openai.com/enterprise-privacyYes — abuse-monitoring opt-out available via Microsoft form for qualifying customersYes — Bedrock does not store prompt or completion data by default; CloudTrail logs metadata onlyYes — Vertex AI generative endpoints do not retain customer prompts by default; explicit opt-in required for cached loggingYes — zero retention is the default for direct API customers per anthropic.com/legalYes — Mistral La Plateforme defaults to no prompt retention for enterprise
Data-export / portability toolingWorkspace data export via admin portal; API has no persistent prompt store to exportStandard Azure data export (Storage, Cosmos DB) for logs you configure; model data is not storedBedrock model invocation logs export to S3 / CloudWatch if enabled by customerVertex AI logs to Cloud Logging / BigQuery if configured; prompts not stored by defaultConsole-based workspace export; no persistent prompt store on ZDR tierMistral console export; on-prem deployments are fully customer-controlled
Best fitUS-based teams wanting fastest model access and a single vendor relationshipMicrosoft-shop enterprises needing region-locked GPT in the EU, UK, AU, JP, or CAAWS-shop enterprises wanting multi-model choice (Claude, Llama, Mistral, Titan) under one IAMGoogle Cloud shops wanting Gemini plus third-party models with strong EU multi-region supportTeams using Claude as primary model that want vendor-direct billing + ZDR by defaultEU buyers with strict FR/EU-sovereignty requirements or open-weights flexibility

Sources as of June 2026 — verify directly: https://openai.com/enterprise-privacy/, https://docs.anthropic.com/en/docs/about-claude/regions, https://aws.amazon.com/bedrock/, https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models, https://cloud.google.com/vertex-ai/generative-ai/docs/learn/locations, https://cohere.com/security, https://mistral.ai/security. Region availability and model-by-region matrices change monthly — confirm in the vendor console and in your contract before any procurement decision.

What "data residency" actually means (and the marketing copy to ignore)

Procurement teams routinely conflate three things under the same phrase. **Billing residency** — your invoice comes from an EU entity, your contract is governed by Irish or Dutch law. This is the cheapest version to provide and the least meaningful to a regulator. **Storage residency** — customer metadata, account config, audit logs, and any cached prompts sit on disks physically located in a specified region. **Inference residency** — the GPU that runs the forward pass on your prompt sits in the specified region. Only the third matters for GDPR Article 44 transfer analysis, German BfDI guidance on US cloud providers, and FedRAMP boundary definitions.

**OpenAI direct** publishes its enterprise privacy commitments at https://openai.com/enterprise-privacy/ and now distinguishes between US default routing and the EU data residency option for ChatGPT Enterprise, ChatGPT Edu, and the API. Inference for the EU residency tier runs on EU-located infrastructure; without that opt-in, prompts and completions are processed in US data centers. The default is US, and most teams that signed standard OpenAI contracts before mid-2024 are still on US default routing whether they realize it or not.

**Azure OpenAI** does residency the simplest way: the resource you create is region-locked. A resource created in `swedencentral` runs inference in Sweden, stores associated metadata in Sweden, and returns errors if you accidentally call a global endpoint. Per https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models the per-region model availability matrix tells you which exact GPT, o-series, embedding, and image model you can call from each region — a moving target month-to-month.

**AWS Bedrock** also region-locks by endpoint — `bedrock.eu-central-1.amazonaws.com` runs inference in Frankfurt — but adds a twist with cross-region inference profiles introduced in 2024, where Bedrock can route requests across multiple regions for capacity. You have to explicitly disable cross-region routing if you need hard residency. Documentation at https://aws.amazon.com/bedrock/ and the AWS regional services list spell out which Claude, Llama, Mistral, and Titan models are GA in which Bedrock region.

**Google Vertex AI** offers three modes documented at https://cloud.google.com/vertex-ai/generative-ai/docs/learn/locations: region-locked endpoints (one specific region), multi-region endpoints (which stay within a continent — the `eu` multi-region stays in EU member states), and the global endpoint (which can route to any Google region including the US). For EU procurement, the `eu` multi-region is usually the right answer — it gets you capacity flexibility without crossing the Atlantic.

**Anthropic direct** documents its supported regions at https://docs.anthropic.com/en/docs/about-claude/regions and supports both US and EU inference for direct API customers. Workspaces are configured to a region at creation time and zero data retention is the default for direct API customers per https://www.anthropic.com/legal/commercial-terms. The practical upshot: Anthropic direct is now a credible EU vendor where it was US-only as recently as 2023.


In-transit vs at-rest vs in-use: the encryption story most vendors skip

Every vendor on this list uses TLS 1.2 or higher in transit and AES-256 at rest. That story is well-told. The story most enterprise buyers do not pressure-test is **in-use** encryption — what happens to your prompt while it is loaded into GPU memory during inference. By default, the prompt is plaintext in memory on the GPU. Anyone with administrative access to the hypervisor or host OS could, in principle, read it. For most workloads this is acceptable. For regulated health, finance, and government workloads it is increasingly not.

The remediation is **confidential computing** — running inference inside a hardware-attested Trusted Execution Environment (TEE) where memory is encrypted and the host operator cannot peek. As of June 2026, native confidential-compute inference is partial across the providers. **Azure** offers Azure Confidential Computing on AMD SEV-SNP and Intel TDX VMs, but Azure OpenAI inference itself does not yet GA on confidential VMs — verify status at https://learn.microsoft.com/en-us/azure/confidential-computing/. **AWS** Nitro Enclaves are mature but Bedrock native TEE inference is still limited; AWS Nitro System guarantees provide a different model of operator-isolation that some buyers accept in lieu of full TEE attestation.

**Google Vertex AI** has confidential VMs for training widely available and confidential inference in preview for select models per https://cloud.google.com/confidential-computing. The trajectory across all three hyperscalers is the same — confidential GPU inference is shipping incrementally on H100 confidential mode and Blackwell confidential mode hardware, but you cannot assume it for any specific model in any specific region today. Ask the vendor in writing.

**OpenAI direct** and **Anthropic direct** do not offer customer-controlled confidential-compute options on their public APIs as of June 2026. They run their own infrastructure and you trust their operator-isolation. For some buyers that is fine — the OpenAI SOC 2 Type II and Anthropic SOC 2 Type II reports cover those operator controls. For other buyers (financial services with prompt-injection-as-data-exfiltration threat models, defense contractors) it is not fine, and the answer is to use Azure OpenAI, AWS Bedrock, or Vertex with vendor-side confidential compute as it rolls out.

**Mistral** is the cleanest answer on confidential compute today — on-prem and dedicated deployments give you full physical control of the host hardware, and Mistral partnered with Bleu (the Capgemini + Orange joint venture) for SecNumCloud-aligned hosting in France. Per https://mistral.ai/security, the enterprise tier supports both customer-managed keys and customer-controlled deployment topology. This is the path for FR / EU sovereign workloads where the GDPR Article 44 transfer analysis cannot tolerate any US-headquartered processor.

Whatever your in-use story is, write it into the data processing agreement (DPA), not just the security one-pager. Encryption-in-transit, encryption-at-rest, KMS configuration, and confidential-compute commitments belong as contract obligations, not as marketing claims that change at the next platform release.


Training-data residency vs inference residency: do not conflate them

A common procurement mistake is asking "where is the model trained" when the question that matters is "where will my prompts and outputs be processed." Foundation models from OpenAI, Anthropic, Google, Meta, Mistral, and Cohere were trained on infrastructure the customer does not control and that has nothing to do with the residency of the customer's runtime data. Once a model exists as weights, it can be served from anywhere a GPU lives. Training residency does not constrain inference residency.

The relevant residency question for inference is therefore: which physical region runs the forward pass on my prompt, and is any version of my prompt or completion persisted anywhere outside that region. **Azure OpenAI** answers this cleanly per https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/data-privacy — for the abuse-monitoring opt-out tier, no prompt or completion data leaves your resource's region and none is persisted beyond the immediate inference call. **AWS Bedrock** answers similarly per https://aws.amazon.com/bedrock/ — prompts are not stored, completions are not stored, and model invocation logs only fire if the customer explicitly enables them.

**Google Vertex AI** documents its generative AI data governance at https://cloud.google.com/vertex-ai/generative-ai/docs/data-governance — prompts to Vertex Gemini and partner-model endpoints are not used to train models and are not persisted beyond the inference call when the customer has not opted into prompt caching or logging. The region commitment ties to the endpoint's region or multi-region.

**OpenAI direct** answers this in https://openai.com/enterprise-privacy/ — API inputs and outputs are not used to train models, and ZDR (zero data retention) is available for eligible customers. **Anthropic direct** answers it in https://www.anthropic.com/legal/commercial-terms — customer inputs and outputs are not used to train models and are not retained beyond the inference call on the direct API. **Cohere** answers it at https://cohere.com/security with similar commitments — and Cohere explicitly markets its data isolation as a differentiator, including the option to deploy in a customer's own cloud account through the Cohere Private Deployments program.

The thing none of the vendors will give you in writing is a guarantee that training data was geographically segregated by the user's jurisdiction. The training corpus is global. If your compliance framework requires that no training data ever originated outside a specific region, no commercial foundation model on this list satisfies that requirement and you should be looking at customer-trained or jurisdiction-trained open-weights models — Mistral, Llama, or a sovereign-cloud-hosted equivalent. **Mistral**'s European positioning makes this the cleanest fit for sovereign training-data requirements, though even Mistral does not publish a per-token geographic-origin manifest.

Practical advice: write the inference-residency commitment into the contract and write the training-data commitment as a separate clause. They are different facts and they need different language. Procurement teams that bundle them together end up with weaker contracts than teams that separate them.


Sovereign cloud variants: AWS GovCloud, Azure Government, Google Sovereign Cloud, Bleu

Sovereign cloud sits above ordinary region-locked commercial cloud. It adds operator nationality controls, separate authentication boundaries, and (in the US fed case) FedRAMP High and DoD IL4/IL5 accreditations. The catch with sovereign cloud and AI models is that model availability lags commercial regions by 6 to 18 months. The shiny new model you want is rarely available in the sovereign region you need.

**Azure Government** runs in dedicated US-fed regions with Azure OpenAI availability at https://learn.microsoft.com/en-us/azure/azure-government/documentation-government-cognitiveservices. As of June 2026, Azure Government has GPT-4-class models GA and newer GPT-5 models rolling in, lagging commercial Azure by a quarter or two. **Azure China** is operated by 21Vianet under separate Chinese data-protection law. **Microsoft Cloud for Sovereignty** layers sovereignty controls on standard Azure regions for EU public sector — verify OpenAI inclusion at https://www.microsoft.com/en-us/industry/sovereignty/cloud.

**AWS GovCloud (US)** runs Bedrock with a limited model subset per https://aws.amazon.com/govcloud-us/ — specific Claude, Llama, and Titan models, but not everything GA in commercial Bedrock. **AWS European Sovereign Cloud** is the dedicated EU sovereign instance with first region in Brandenburg, Germany, operated by AWS European personnel and EU-headquartered legal entities. Confirm Bedrock service inclusion and timeline at https://aws.amazon.com/blogs/security/.

**Google Sovereign Cloud** is structured as partner-operated cloud — T-Systems in Germany, Thales in France (S3NS), and PSN in Italy. These partners run Google Cloud services on EU-located infrastructure under EU operator control. Vertex AI availability varies by partner and by service — confirm at https://cloud.google.com/sovereign-cloud-and-trust-portal and ask the partner directly which Gemini and partner-model endpoints are live in the sovereign offering.

**Mistral** is in some sense the native sovereign answer for EU buyers — the company is French, the infrastructure is French, and the Bleu partnership with Capgemini and Orange targets SecNumCloud-aligned hosting. SecNumCloud is the French ANSSI certification that goes beyond standard ISO 27001 for sensitive public sector data. For French public sector and regulated industry, Mistral on Bleu is the cleanest single-vendor sovereign story available today.

What sovereign cloud cannot give you: the latest frontier model the moment it ships. If your buyer profile is "latest GPT-5 capabilities, EU sovereign, today," that combination does not exist as a single product in June 2026. Pick two of three: latest capability, EU sovereign, immediate availability. Most regulated buyers settle for a one-quarter-lag model in the sovereign region; some run a hybrid where exploratory work happens on commercial cloud and production on sovereign.


The four buyer profiles: which provider to actually pick

**Buyer profile one: EU-only SaaS startup with GDPR-conscious customers.** You want the simplest "customer data never leaves the EU" story. The cleanest answer is Azure OpenAI with a resource in `swedencentral` or `francecentral` plus abuse-monitoring opt-out. Per https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/data-privacy this gives region-locked inference, no prompt persistence, and the same GPT-5 family models. Second choice: Anthropic direct with EU routing, which gives Claude in the EU with ZDR by default per https://docs.anthropic.com/en/docs/about-claude/regions.

**Buyer profile two: US federal contractor or US healthcare with strict FedRAMP / HIPAA boundaries.** You want a sovereign-cloud answer with an audited compliance boundary. The default is Azure OpenAI in Azure Government for FedRAMP High and DoD IL4/IL5 workloads, or AWS Bedrock in AWS GovCloud (US) if you are already AWS-shop. For HIPAA-only workloads, commercial Azure OpenAI, commercial AWS Bedrock, and Google Vertex AI all sign HIPAA business associate agreements — verify the exact BAA language with vendor counsel.

**Buyer profile three: Japanese financial services or Japanese government.** You want Tokyo or Osaka inference under Japanese FSA expectations. Azure OpenAI in Japan East (Tokyo) or Japan West (Osaka) is the broadest model menu. AWS Bedrock in `ap-northeast-1` has a smaller model set but covers Claude and Llama. Google Vertex in `asia-northeast1` or `asia-northeast2` covers Gemini natively. OpenAI direct and Anthropic direct do not offer JP residency as a primary commitment as of June 2026 — you go through one of the three hyperscalers.

**Buyer profile four: Australian healthcare, financial services, or government.** You want Sydney or Melbourne inference under APRA CPS 234 and Australian Privacy Principles. Azure OpenAI in Australia East is the broadest. AWS Bedrock in `ap-southeast-2` and Google Vertex in `australia-southeast1` or `australia-southeast2` are the alternates. Same caveat as Japan — neither OpenAI direct nor Anthropic direct lists AU as a primary residency commitment, so you go through a hyperscaler.

**Cross-cutting note:** the vendor capability to weight more than any other is the per-region model availability matrix — it changes monthly. A region that does not have the specific GPT-5, Claude Sonnet 4.7, or Gemini 2.5 model you need is functionally useless even if residency is perfect. Bookmark the matrices at https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models, https://aws.amazon.com/bedrock/, and https://cloud.google.com/vertex-ai/generative-ai/docs/learn/locations and check before every model decision.

**Cohere** deserves a separate mention for the buyer who wants EU plus Japan natively from the model vendor rather than from a hyperscaler. Per https://cohere.com/security, Cohere offers US, EU (Frankfurt), and Japan deployments directly. If you are standardizing on Cohere Command and Embed for RAG plus retrieval, the multi-region native story is competitive with the hyperscaler-mediated alternatives and you avoid one layer of vendor relationship.


Procurement: what to ask and what to get in writing

The most useful single document in any AI vendor procurement is the data processing agreement (DPA). Get the latest version, get it reviewed by counsel, and verify three clauses. First: the precise list of regions where customer data — prompts, completions, embeddings, fine-tuning data, and metadata — may be processed and stored. Vague phrases like "OpenAI may process data in OpenAI's data centers" are not acceptable. Specify regions.

Second: the sub-processor list, with countries and the right to object to new sub-processors with reasonable notice. Every cloud-mediated AI service has a long sub-processor chain — the model vendor, the hyperscaler, any third-party safety filter, any logging or observability vendor. Audit the chain. Per https://openai.com/policies/subprocessor-list, https://www.anthropic.com/legal/subprocessors, https://aws.amazon.com/compliance/sub-processors/, https://www.microsoft.com/en-us/trust-center/privacy/data-processing-agreements, and https://cloud.google.com/terms/subprocessors, each vendor publishes a current list — those lists are the starting point, not the final word.

Third: the data residency commitment as a contractual obligation, not a marketing claim. Specifically, "Provider shall not process Customer Data outside of [region]" with named regions and a breach-notification clause. Vendors will push back on absolute language; settle on language that is specific enough to be enforceable. For Azure OpenAI, AWS Bedrock, and Google Vertex AI, the region-locked architecture makes this language easier to negotiate; for OpenAI direct and Anthropic direct, it requires explicit opt-in to a residency tier.

Ask for the latest **SOC 2 Type II** report (Type I is not enough — verify at vendor's trust center), the **ISO 27001** certificate with current validity date, the **ISO 27018** certificate where you handle EU personal data, **HIPAA BAA** if applicable, and **PCI DSS** attestation if you transmit cardholder data through prompts. For EU public sector, ask about **ENS** (Spain), **C5** (Germany), and **SecNumCloud** (France) where relevant. The vendor trust centers — https://trust.openai.com, https://trust.anthropic.com, https://aws.amazon.com/compliance/, https://servicetrust.microsoft.com, https://cloud.google.com/security/compliance/ — index the current set.

Verify **ZDR status** explicitly. Zero data retention is a contractual posture, not a checkbox. For OpenAI direct, ZDR requires customer eligibility approval and opt-in per https://openai.com/enterprise-privacy/. For Azure OpenAI, the analog is abuse-monitoring opt-out via a Microsoft form. For AWS Bedrock and Google Vertex AI, no-retention is closer to the default but customer-enabled logging will defeat it. For Anthropic direct, ZDR is the default for the API.

Finally, get the **incident notification** clause right. The DPA should commit the vendor to notify you within 72 hours of any confirmed data breach affecting your data — aligned with GDPR Article 33 timelines. Verify the notification channel (email to a security@ address you control, in-product banner, etc.) and the escalation contacts. The 72-hour clock is regulatory and you cannot meet it if the vendor does not meet a shorter clock.


Build vs. buy: when self-hosting open weights actually wins

For the strictest residency requirements — air-gapped government, classified workloads, certain financial-services data that contractually cannot transit any third-party cloud — the only credible answer is self-hosted open-weights models. Llama 3 and 4, Mistral Large 2 and the open Mixtral family, Qwen, and DeepSeek can be deployed entirely inside the customer's network on customer-owned GPUs. Download the model once; no prompt ever leaves the customer's data center.

Realistic cost of self-hosting in 2026: an 8x H100 or 8x H200 node lists at roughly $250,000 to $400,000 in capex per https://www.nvidia.com/en-us/data-center/h100/ and partner pricing, plus power, cooling, networking, and 24x7 ops headcount. For models above 70B parameters you typically need at least two nodes. Per-token economics only beat hyperscaler API pricing at 30 to 50 percent sustained GPU utilization across the year. Most teams that benchmark honestly find Bedrock or Vertex cheaper for their actual usage.

Where self-hosted open weights wins decisively: environments where the contract simply prohibits any third-party processing, regardless of region. Examples include EU national security agencies, US DoD classified workloads above IL5, and Swiss private banking. The choice is "self-host or do not use LLMs at all."

The hybrid pattern that increasingly works in 2026: use hyperscaler Bedrock or Azure OpenAI for the broad bulk of LLM workloads where the residency story is good-enough, and reserve self-hosted Mistral or Llama for the narrow set of workloads where regulation prohibits any vendor processing. This gets you the capability breadth of frontier models for 95 percent of workloads and the residency airtightness of self-hosted for the 5 percent that actually requires it. Use the RAG cost per query calculator and the vector DB cost per 1M embeddings calculator to model the per-workload economics honestly before you decide which side of the line each use case lands on.

**Cohere** sits interestingly in the middle of this build-vs-buy axis. Cohere Private Deployments per https://cohere.com/private-deployments deploy Cohere models inside the customer's own AWS, Azure, GCP, or Oracle Cloud account. The model weights stay under Cohere's commercial license but the inference runs entirely in customer-controlled infrastructure. For buyers who want frontier model quality with customer-controlled infrastructure — particularly common in EU financial services — this is a uniquely compelling middle path.

Whatever you decide, write the deployment model into a one-page architecture decision record (ADR) and circulate it through security and legal before procurement signs. The most expensive mistake in 2026 AI procurement is buying the wrong deployment model and being locked in for 24 months. The least expensive mistake is taking three extra weeks to verify the architecture against the actual compliance requirement.


The opinionated 2026 pick: what I would buy for each scenario

If I were CTO of a 200-employee EU SaaS company with German and French enterprise customers asking pointed GDPR questions, I would buy **Azure OpenAI** in `francecentral` or `swedencentral` with abuse-monitoring opt-out, plus **Anthropic direct** with EU routing. Azure gives me region-locked GPT-5 with a clean Microsoft DPA. Anthropic gives me Claude with EU inference and ZDR by default per https://docs.anthropic.com/en/docs/about-claude/regions. Two vendors, two clean residency stories.

If I were a US federal systems integrator working on a FedRAMP High program, I would buy **Azure OpenAI in Azure Government**, full stop. The accreditation paperwork is the value. Yes, the model set lags commercial Azure by a quarter or two; that lag is a feature, not a bug, because it means the FedRAMP boundary is stable. For models not yet in Azure Government, I would either wait or run a self-hosted Llama on GovCloud-equivalent infrastructure.

If I were the head of engineering at a Japanese megabank, I would buy **Azure OpenAI in Japan East** and **Google Vertex AI in `asia-northeast1`** as a two-vendor strategy. Azure for the GPT and o-series families, Vertex for Gemini and any partner-model needs. Both keep inference inside Japan under FSA-compatible operator controls. I would not route any prompts through OpenAI direct or Anthropic direct because neither offers JP residency as a primary commitment.

If I were running an Australian regulated workload, I would buy **AWS Bedrock in `ap-southeast-2`** because the AWS Sydney region has the deepest AU enterprise footprint and the Bedrock model menu in Sydney covers Claude, Llama, and Mistral. Azure Australia East is the credible alternate. Google Vertex in `australia-southeast1` works equally well if Gemini is the primary model. The pick is more about your existing hyperscaler relationship than about a residency differentiator at this point.

If I were a French defense or critical-infrastructure operator with strict ANSSI guidance, I would buy **Mistral on Bleu** and stop there. The vendor is French, the infrastructure is French, the operators are French, the legal entity is French. No GDPR Article 44 transfer analysis, no FISA 702 worry, no Schrems III hypothetical. Capability is slightly behind frontier OpenAI and Anthropic models, but the sovereignty story is the cleanest available in 2026.

The one thing I would not do in 2026 is route customer-regulated data through OpenAI direct on US default routing while telling my European customers that their data "stays in OpenAI's secure infrastructure." That sentence is not a residency commitment, and the moment a German works council, an Irish DPC investigation, or an enterprise customer's procurement team asks a pointed question, the answer will not survive. Pick a deployment that matches the residency commitment you are making, and write it into the contract.

How to pick the right AI residency architecture for your team

  1. 1

    Step 1: Write down the regulatory and contractual residency requirements that actually bind you

    Before you look at any vendor, write a one-page residency requirements document. List every regulatory framework you are subject to (GDPR, HIPAA, PIPEDA, APRA CPS 234, FSA Japan, FedRAMP, ITAR, EAR), every customer contractual residency commitment you have already made, and every internal policy that adds further restriction. For each, identify the specific clause that drives the requirement and the specific data categories it covers. Most teams discover at this step that their actual binding requirements are narrower than they assumed — only some workloads, only some data classes — and that materially expands their architecture options. Without this step, procurement defaults to the strictest interpretation and you pay sovereign-cloud prices for workloads that did not need it.

  2. 2

    Step 2: Pick the deployment model per workload, not per company

    Stop trying to choose a single provider for all AI workloads. Map each AI workload — customer-facing chatbot, internal coding assistant, RAG over engineering docs, marketing copy generation, financial analysis — against your residency requirements document from Step 1. For each, pick the cheapest deployment that satisfies the binding constraint. A marketing-copy workload almost certainly tolerates OpenAI direct in the US; a customer-data RAG workload over EU personal data probably needs Azure OpenAI in an EU region; a classified-government workload needs sovereign cloud or self-hosted. Document the deployment model per workload in an architecture decision record so the next engineer who joins does not relitigate. The enterprise LLM compliance comparison is useful here.

  3. 3

    Step 3: Verify per-model regional availability before you commit

    Open the regional model availability matrices for every vendor you have shortlisted — https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models for Azure OpenAI, https://aws.amazon.com/bedrock/ and the AWS regional services page for Bedrock, https://cloud.google.com/vertex-ai/generative-ai/docs/learn/locations for Vertex AI, https://docs.anthropic.com/en/docs/about-claude/regions for Anthropic direct, https://cohere.com/security for Cohere — and confirm that the exact models you need are GA in the exact regions you need. Models in preview do not count for production planning. Models in one region but not the other will force latency-sensitive cross-region calls. This step kills more shortlists than any other; doing it early saves weeks of contract negotiation on a deployment that cannot actually serve your workload.

  4. 4

    Step 4: Get the residency commitment into the DPA in writing, not in marketing copy

    When you negotiate the contract, push for three specific clauses in the data processing agreement. First, an enumerated list of regions where customer data may be processed and stored — vague language about "the provider's data centers" is unacceptable for any regulated workload. Second, a sub-processor list with countries plus a right to object to new sub-processors with reasonable notice. Third, a 72-hour breach notification clause aligned with GDPR Article 33. Vendors will resist absolute language; negotiate to specific-enough-to-enforce language. Get the latest SOC 2 Type II, ISO 27001, ISO 27018, HIPAA BAA (where applicable), and any sovereign-cloud-specific certifications (FedRAMP, IL5, C5, ENS, SecNumCloud) attached to the contract. The trust centers — https://trust.openai.com, https://trust.anthropic.com, https://servicetrust.microsoft.com, https://aws.amazon.com/compliance/, https://cloud.google.com/security/compliance/ — index the current state.

  5. 5

    Step 5: Build the monitoring to detect residency drift in production

    Residency is not a one-time procurement decision; it is an ongoing operational invariant that can drift the moment an engineer changes a config. Set up monitoring that flags any API call going to a non-approved region. For Azure OpenAI, enforce this with Azure Policy and Microsoft Defender for Cloud rules. For AWS Bedrock, use AWS Config rules and SCP guardrails at the AWS Organizations level to prevent calls to non-approved regions. For Google Vertex, use Organization Policy constraints on Vertex AI location. For OpenAI direct and Anthropic direct, instrument your SDK client to assert region in the request and alert on mismatches. Audit logs from the providers — OpenAI activity logs, Azure Monitor, AWS CloudTrail, Google Cloud Audit Logs — should feed your SIEM. The point is not to trust that the architecture stayed correct; the point is to know within minutes when it drifted.

Frequently Asked Questions

Does OpenAI direct API offer EU data residency for inference, or only US?

Both, but it requires explicit opt-in. Per https://openai.com/enterprise-privacy/, OpenAI offers EU data residency for ChatGPT Enterprise, ChatGPT Edu, and the API — when enabled, prompts and completions are processed on EU-located infrastructure rather than US. Without that opt-in, the default is US. Any contract signed before mid-2024 is almost certainly on US default routing, and any new EU-customer-facing deployment should explicitly request the EU residency option in writing. For workloads that need EU residency plus the broadest model menu, Azure OpenAI in a specific EU region per https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models is often the cleaner architecture.

What is the difference between Azure OpenAI's regions and OpenAI direct's regions?

Azure OpenAI runs the OpenAI models (GPT-5, GPT-5-mini, GPT-4o, o-series, embeddings, Whisper) inside Microsoft Azure data centers across 20-plus regions, including a deep EU footprint. OpenAI direct runs the same models from OpenAI's own infrastructure, which is primarily US with an opt-in EU residency tier. The contracts differ (Microsoft DPA vs. OpenAI DPA), the SLAs differ, and model availability lags slightly on Azure for the newest versions. Most enterprises pick based on existing hyperscaler relationship. Verify model-by-region availability at https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models before committing.

Can I get Claude in the EU with zero data retention?

Yes, two ways. **Anthropic direct** offers EU inference per https://docs.anthropic.com/en/docs/about-claude/regions, and ZDR is the default per https://www.anthropic.com/legal/commercial-terms. **AWS Bedrock** offers Claude in Frankfurt (eu-central-1), Paris (eu-west-3), and other EU regions per https://aws.amazon.com/bedrock/ — Bedrock does not retain prompts or completions by default, and logging only fires if you explicitly enable it. Pick Anthropic direct for vendor-direct billing and the simplest Claude-only architecture; pick Bedrock if you are already AWS-shop or want IAM-mediated access alongside Llama, Mistral, and Titan. Both give a defensible EU-residency-plus-ZDR posture.

Does AWS GovCloud actually have Bedrock?

Yes, but with a limited model menu compared to commercial AWS Bedrock. Bedrock in GovCloud (US) supports specific Anthropic Claude, Meta Llama, and Amazon Titan models as of June 2026 — verify the current list at https://aws.amazon.com/govcloud-us/. The model set lags commercial Bedrock by one to two quarters because each model must pass GovCloud's separate accreditation review. For FedRAMP High and DoD IL5, this is acceptable — boundary stability is the value. If you need a model not yet in GovCloud Bedrock, alternatives are Azure OpenAI in Azure Government, self-hosted open-weights on GovCloud GPU instances, or waiting for the model to clear accreditation.

How does Google Vertex AI's multi-region 'eu' endpoint actually work?

Per https://cloud.google.com/vertex-ai/generative-ai/docs/learn/locations, Vertex AI offers location-locked endpoints (one specific region), multi-region endpoints (a continent — the 'eu' multi-region stays inside EU member states), and the global endpoint (which can route anywhere including the US). The 'eu' multi-region is the right answer for most EU procurement: requests are routed to whichever EU region has capacity, and prompts and completions never leave EU member states. Per https://cloud.google.com/vertex-ai/generative-ai/docs/data-governance, prompts are not used to train models and are not persisted beyond the inference call. Avoid the global endpoint for any EU-residency-bound workload.

Is Mistral really a sovereign EU option or is that marketing?

It is meaningfully more sovereign than the alternatives, but verify the specific deployment topology. Mistral is a French legal entity with French infrastructure per https://mistral.ai/security, and the Bleu partnership (Capgemini and Orange) targets SecNumCloud-aligned hosting in France. For French public sector and regulated industry, this is the cleanest single-vendor sovereign answer in 2026 — no US-headquartered processor in the chain, no GDPR Article 44 transfer analysis. For open-weights models (Mistral 7B, Mixtral, Mistral Large 2 base), customers can self-host entirely inside their own infrastructure for maximum sovereignty. Capability is slightly behind frontier OpenAI and Anthropic models, which is the trade-off.

What does 'confidential computing' actually mean for LLM inference, and is it production-ready?

Confidential computing means running inference inside a hardware-attested Trusted Execution Environment (TEE) where GPU memory is encrypted and the host operator cannot read it. The relevant hardware is NVIDIA H100 confidential mode, H200, and Blackwell confidential mode; on CPUs, AMD SEV-SNP and Intel TDX. As of June 2026, confidential GPU inference is shipping incrementally across the hyperscalers — Azure Confidential Computing per https://learn.microsoft.com/en-us/azure/confidential-computing/, AWS Nitro Enclaves, and Google Confidential VMs per https://cloud.google.com/confidential-computing — but native confidential Azure OpenAI, Bedrock, or Vertex AI inference is not yet GA across all models in all regions. Ask the vendor in writing which specific model in which region runs on confidential-mode hardware.

How do I get zero data retention (ZDR) on each provider?

ZDR is a contractual posture, configured per provider. For **OpenAI direct**, ZDR requires eligibility approval and opt-in per https://openai.com/enterprise-privacy/. For **Azure OpenAI**, the equivalent is the abuse-monitoring opt-out via a Microsoft form per https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/data-privacy. For **AWS Bedrock**, no prompt or completion data is stored by default; logging only fires if explicitly enabled. For **Google Vertex AI**, prompts are not retained by default; cached logging requires opt-in. For **Anthropic direct**, ZDR is the default per https://www.anthropic.com/legal/commercial-terms. For **Cohere** and **Mistral**, enterprise tiers default to non-retention. Verify the current posture in writing as part of the DPA.

Is residency the same thing as data sovereignty?

No, and conflating the two leads to bad procurement. **Residency** means the data physically sits in a specific geographic region. **Sovereignty** is stronger — it adds that the operator entity is domiciled in the jurisdiction, is not subject to extraterritorial legal demands (notably the US CLOUD Act and FISA 702), and operator personnel are nationals of the jurisdiction. Standard Azure West Europe, AWS eu-central-1, and Google europe-west4 are residency, not sovereignty — the operators are US companies subject to US legal process even for EU-located data. True sovereignty requires Azure Government, Microsoft Cloud for Sovereignty, AWS European Sovereign Cloud, Google Sovereign Cloud partners, Mistral on Bleu, or fully self-hosted. Write the requirement as one or the other, not both interchangeably.

You now know where every major AI provider will actually run your prompts. Now make every prompt those providers run actually hit.

AI Prompt Generator builds production-ready system prompts that work across ChatGPT, Claude, Gemini, Azure OpenAI, AWS Bedrock, Google Vertex, Mistral, and every other compliance-conscious AI tool in this article — so your residency-locked deployments get sharper outputs, not generic AI fluff. Stop tweaking prompts by hand and start shipping prompts that drive measurable lift. 14-day free trial, no credit card required.

Browse all prompt tools →