By The DDH Team · Digital Dashboard Hub

AI Content Moderation API Cost by Provider: OpenAI, Perspective, Azure, AWS, Hive, Sightengine — Real Prices, Real Trade-offs (2026)

Six moderation APIs, six different theories of how to keep user-generated content safe. OpenAI Moderation is free with the omni-moderation-latest model. Perspective API from Jigsaw is free under a QPS quota. Azure AI Content Safety bills per 1,000 records across text and image. AWS pairs Comprehend for text with Rekognition Content Moderation for images and video. Hive sells deep enterprise category coverage. Sightengine is the indie-friendly per-call option. Prices below are sourced from vendor pricing pages, June 2026.

By DDH Research Team at Digital Dashboard Hub·Updated June 21, 2026

Browse all 40+ free prompt tools

Trust and safety teams in 2026 are no longer asking whether to ship a moderation API — they are asking which one to ship and how the bill scales when traffic doubles overnight. The category is split into roughly three sub-markets: free or near-free policy APIs from the model providers themselves (OpenAI Moderation, Perspective API), per-record cloud APIs from the hyperscalers (Azure AI Content Safety, AWS Comprehend plus Rekognition, Google Cloud Natural Language plus Vertex AI Safety), and specialist vendors with deeper category coverage (Hive, Sightengine). Pick wrong and you either spend $0.001 per call on a model that misses NSFW images, or $0.10 per call on enterprise moderation you do not need. Before you commit, run your traffic profile through the OpenAI API cost calculator so the per-million-token math survives contact with reality.

**OpenAI Moderation** is the free omni-moderation-latest endpoint that catches text and image policy violations across thirteen categories at https://platform.openai.com/docs/guides/moderation. **Perspective API** is Jigsaw's free toxicity scorer used by The New York Times, The Wall Street Journal, and Reddit, documented at https://perspectiveapi.com/. **Azure AI Content Safety** prices text and image moderation per 1,000 records at https://azure.microsoft.com/en-us/pricing/details/cognitive-services/content-safety/. **AWS Comprehend** handles text classification while **Rekognition Content Moderation** handles image and video, priced at https://aws.amazon.com/comprehend/pricing/. **Google Cloud Natural Language** moderation plus Vertex AI safety filters appear at https://cloud.google.com/natural-language/pricing. **Hive Moderation** quotes per-call pricing at https://hivemoderation.com/pricing and **Sightengine** publishes a transparent tiered model at https://sightengine.com/pricing. All prices in this guide come from those vendor pages as of June 2026.

The rest of this guide breaks down what each API actually classifies, where they fail, what they cost at real traffic volumes, and which one to ship for which moderation surface. You get an opinionated decision matrix, a five-step rollout plan, and answers to the nine questions your legal team will ask. We also compare the two free options head-to-head in OpenAI Moderation API vs Perspective API and cover the adjacent jailbreak detection space in LLM jailbreak prevention 2026.

Digital Dashboard Hub

Writing good prompts for ONE AI is hard. Writing them for GPT-5, Claude, Gemini, Perplexity, Midjourney and 6 more is a full-time job. DDH's AI Prompt Builder writes once, runs everywhere — locked to your niche, voice, and brand tone.

Free 14 days, no card. →

OpenAI Moderation, Perspective API, Azure AI Content Safety, AWS Comprehend / Rekognition, Hive Moderation, Sightengine — pricing + capability overview, June 2026

Feature	OpenAI Moderation	Perspective API (Jigsaw)	Azure AI Content Safety	AWS Comprehend / Rekognition	Hive Moderation	Sightengine
Text price (per 1M characters)	Free (omni-moderation-latest)	Free under QPS quota	~$0.38 per 1k text records (≈$0.76 per 1M chars at 500 chars/record)	~$0.50 per 1M chars (Comprehend toxicity classification)	~$1.00 per 1k text calls (custom enterprise pricing)	~$0.45 per 1M chars at Standard tier
Image price (per 1k images)	Free for omni-moderation-latest image inputs	Not supported (text-only)	~$1.00 per 1k images (Standard)	~$1.00 per 1k images (Rekognition Content Moderation, first 1M)	~$1.50 per 1k images (custom enterprise)	~$0.96 per 1k images at Standard tier
Video moderation	Not natively supported (frame-extract + image endpoint)	Not supported	Preview: ~$2 per minute analyzed	~$0.10 per minute (Rekognition Video Content Moderation)	~$0.08-0.15 per minute (custom)	~$0.05 per minute at Standard tier
Languages supported (text)	40+ across omni-moderation-latest	18 fully supported (en, es, fr, de, pt, it, ru, etc.)	100+ via Azure auto-translation routing	12 native for Comprehend toxicity (en, es, fr, de, it, pt, ja, ko, ar, zh, hi, plus auto-translate)	50+ via Hive's multilingual classifiers	English, Spanish, French, German, Italian, Portuguese, plus 5+ via plug-in models
Categories detected	13 (sexual, sexual/minors, harassment, hate, self-harm, violence, illicit, illicit/violent, plus image variants)	6 production (toxicity, severe toxicity, identity attack, insult, profanity, threat)	4 harm categories (hate, sexual, violence, self-harm) at 4 severity levels each + Prompt Shields + Groundedness	Comprehend: 7 toxicity labels; Rekognition: 35+ moderation labels including weapons, drugs, gambling	30+ specialized models (NSFW, weapons, gore, drugs, hate symbols, demographic, celebrity, OCR, deepfake)	30+ classifiers (nudity, weapons, drugs, hate, gore, QR codes, deepfake, AI-generated, scam links, faces)
Median latency (text, p50)	~150-300ms via api.openai.com	~200-400ms (depends on region routing)	~80-200ms via regional Azure endpoint	~120-250ms via regional AWS endpoint	~250-500ms (custom endpoint)	~200-400ms via Sightengine global edge
SLA (uptime guarantee)	No formal SLA on moderation endpoint (best-effort)	No formal SLA (research/public-good service)	99.9% Azure Cognitive Services standard SLA	99.9% AWS service SLA (Comprehend + Rekognition)	99.9% on enterprise contracts only	99.9% on Pro tier and above
Free tier / quota	Unlimited free for omni-moderation-latest (subject to TPM rate limits)	Free up to 1 QPS by default; higher via application	5,000 text + 5,000 image records/month free for first 12 months	AWS Free Tier: 50k units Comprehend, 5k images Rekognition for first 12 months	No public free tier — demo + sales contact required	500 free operations per month, renewing monthly
Data residency / regions	US-based by default; ZDR available for Enterprise customers	Google US/EU data centers; no EU-only commitment	30+ Azure regions including EU, UK, AU, JP, BR	20+ AWS regions including EU (Frankfurt, Ireland), AP (Tokyo, Sydney)	US-only by default; EU on enterprise contract	EU (France) and US options; GDPR-aligned default
Best fit	OpenAI-stack builders shipping LLM apps that need free policy guardrails	News comments, forums, civic platforms tuning for nuanced toxicity	Enterprises already on Azure shipping multi-modal moderation + Prompt Shields	AWS-native teams handling video moderation and S3-resident UGC	Marketplaces, dating apps, social platforms needing deepfake + custom labels	Indie devs, SaaS startups, marketplaces wanting per-call simplicity
Public customers	OpenAI itself, Replit, Snap, every ChatGPT API customer using moderation	The New York Times, The Wall Street Journal, Reddit, El País, Disqus	Microsoft Copilot products, Bumble, X (for select surfaces)	Zynga, Slack, Pinterest, Spotify, AWS-hosted UGC platforms broadly	Reddit, Giphy, Bumble, Vimeo, Kik, NBCUniversal	Vimeo, Yubo, Sketchfab, OnlineSBI, plus 5,000+ indie SaaS apps
Custom labels / fine-tuning	Not currently exposed for moderation endpoint	Not supported (fixed attribute set)	Custom categories in Preview (Content Safety Custom)	Comprehend Custom Classification ~$3/hr training + inference	Custom model training included on enterprise contract	Custom thresholds + plug-in workflows; no model fine-tuning

Sources as of June 2026 — verify before procurement: https://platform.openai.com/docs/guides/moderation, https://perspectiveapi.com/, https://azure.microsoft.com/en-us/pricing/details/cognitive-services/content-safety/, https://aws.amazon.com/comprehend/pricing/, https://aws.amazon.com/rekognition/pricing/, https://cloud.google.com/natural-language/pricing, https://hivemoderation.com/pricing, https://sightengine.com/pricing. API pricing changes frequently — confirm in writing before any production migration.

What each moderation API actually classifies (and the marketing copy to ignore)

**OpenAI Moderation** is the omni-moderation-latest endpoint at https://platform.openai.com/docs/guides/moderation. It accepts text and image inputs, returns a flag plus per-category scores across thirteen categories (sexual, sexual/minors, harassment, hate, illicit, self-harm, violence and their variants), and is free under standard OpenAI rate limits. The marketing claim to ignore: it does not detect spam, scam links, deepfakes, weapons brand names, or AI-generated images. It is a baseline LLM-policy filter, not a full trust and safety stack.

**Perspective API** from Jigsaw at https://perspectiveapi.com/ is the most-cited toxicity API on the planet, used by The New York Times comments, The Wall Street Journal, and Reddit's automod. It returns 0-1 scores across six production attributes (toxicity, severe toxicity, identity attack, insult, profanity, threat). Text-only, free under a 1 QPS quota, with higher quotas via dashboard application. 'Identity attack' was retrained in 2024 to reduce demographic bias but still over-flags reclaimed slurs in LGBTQ and Black English contexts. Tune thresholds per surface, not globally.

**Azure AI Content Safety** at https://azure.microsoft.com/en-us/pricing/details/cognitive-services/content-safety/ covers text and image across four harm categories (hate, sexual, violence, self-harm) at four severity levels each, plus a separate Prompt Shields endpoint for jailbreak detection and a Groundedness Detection endpoint for hallucination flagging. Pricing runs roughly $0.38 per 1k text records and $1.00 per 1k images at Standard tier. Azure also ships Custom Categories in Preview — not GA in every region, so do not bet a launch on Preview SKUs.

**AWS Comprehend Toxicity Detection** at https://aws.amazon.com/comprehend/pricing/ handles text moderation across seven labels (graphic, harassment_or_abuse, hate_speech, insult, profanity, sexual, violence_or_threat) at roughly $0.0005 per 100 characters. For images and video, **Rekognition Content Moderation** at https://aws.amazon.com/rekognition/pricing/ bills roughly $0.001 per image and $0.10 per minute of video in the first million units. Rekognition's celebrity recognition feature was deprecated in 2023 for several use cases — verify which sub-features are GA in your region.

**Google Cloud Natural Language** moderation at https://cloud.google.com/natural-language/pricing offers a moderateText endpoint at roughly $1 per 1,000 calls, plus Vertex AI safety filters baked into every Gemini API call for free. The DLP API handles PII redaction at roughly $1 per 1,000 records. Coverage is narrower than Azure or AWS — best as an add-on for existing Vertex AI customers, not a primary choice.

**Hive Moderation** at https://hivemoderation.com/pricing and **Sightengine** at https://sightengine.com/pricing are the specialists. Hive sells 30+ classifier models including deepfake detection, weapons brand recognition, OCR-aware moderation, and demographic estimation — custom pricing lands roughly $1-2 per 1,000 calls. Sightengine ships 30+ classifiers including deepfake, AI-generated content, and scam-link analysis on a transparent tier ladder. Both win when your needs go past the four standard harm categories.

Architecture: how each moderation layer plugs into your stack

**OpenAI Moderation** is a single REST endpoint at api.openai.com/v1/moderations — POST text or an image, get a flag plus category scores in 150-300ms. Because it is free, the standard pattern is to call it as a pre-filter on every LLM input and a post-filter on every LLM output. The Cookbook at https://cookbook.openai.com/examples/how_to_use_moderation shows the canonical wrapper. The architectural risk: if api.openai.com has an outage, moderation fails open or closed depending on your wrapper. Wire a circuit breaker.

**Perspective API** is similarly a single REST endpoint at commentanalyzer.googleapis.com with a documented client library at https://developers.perspectiveapi.com/. Quota lives on a Google Cloud project, with higher-than-1-QPS access via dashboard request — most production deployments land between 10 and 100 QPS. The right architecture is to route inbound comments through Perspective before they hit your database, then store scores as columns so moderators can re-rank queues without re-calling.

**Azure AI Content Safety** lives behind a Cognitive Services resource and supports regional endpoints (eastus, westeurope, etc.) plus a Studio UI at https://contentsafety.cognitive.azure.com/ for tuning thresholds without code. Prompt Shields and Groundedness Detection endpoints are billed separately per https://azure.microsoft.com/en-us/pricing/details/cognitive-services/content-safety/. The architectural win is co-location: if your app is already on Azure, calls stay inside the same region and add 80-200ms instead of 200-400ms.

**AWS Comprehend + Rekognition** is two services. Comprehend's DetectToxicContent runs on text and integrates cleanly with Lambda. Rekognition's DetectModerationLabels runs on S3 objects or base64 images and supports StartContentModeration for video streams per https://docs.aws.amazon.com/rekognition/latest/dg/moderation.html. The pattern is event-driven: S3 PutObject triggers a Lambda that calls Rekognition, results land in DynamoDB, moderators review via a custom queue.

**Hive Moderation** ships as REST endpoints per https://docs.thehive.ai/ with one endpoint per classifier (visual moderation, text, deepfake, demographics, OCR). Hive's recommended pattern is fan-out: send the same asset to every relevant classifier in parallel, aggregate scores, route to a moderator queue if any flags above threshold. Latency runs 250-500ms p50 for image classification — not a hot-path API for real-time chat, but the right shape for post-upload moderation on marketplaces, dating apps, and UGC platforms.

**Sightengine** at https://sightengine.com/docs is the easiest to integrate of the six — one endpoint with model selection via URL parameter, returning JSON scores across whichever models you requested. The workflow builder at https://dashboard.sightengine.com/ chains models with conditional logic, eliminating application-side glue code.

Pricing deep-dive: what you actually pay at real traffic volumes

List price is meaningful here because most of these APIs do not negotiate discounts under enterprise volume. **OpenAI Moderation** is the easy one: $0 across all volume, subject to OpenAI's standard rate limits at https://platform.openai.com/docs/guides/rate-limits. A platform doing 10 million text moderation calls per month pays zero — the only cost is engineering time for the circuit breaker.

**Perspective API** is similarly free but quota-constrained. The default 1 QPS limit covers comment sections under 100,000 per day; serious news platforms apply for higher QPS via the dashboard. New York Times-level volumes are documented in Jigsaw case studies, which suggests Google approves real production quotas. Budget zero dollars but apply for quota in the first week, not the night before launch.

**Azure AI Content Safety** pricing per https://azure.microsoft.com/en-us/pricing/details/cognitive-services/content-safety/ runs roughly $0.38 per 1,000 text records and $1.00 per 1,000 images at Standard tier. A platform doing 5 million text moderation calls plus 500,000 image moderation calls per month lands at roughly $1,900 + $500 = $2,400 per month. Prompt Shields is billed separately at roughly $0.30 per 1,000 calls. Free tier covers 5,000 of each for the first 12 months — useful for evaluation, not for production.

**AWS Comprehend Toxicity Detection** prices at roughly $0.0005 per unit of 100 characters per https://aws.amazon.com/comprehend/pricing/, so 5 million text moderation calls averaging 500 characters each costs about $12,500. **Rekognition Content Moderation** prices at $0.001 per image for the first 1M images, dropping to $0.0008 after, plus $0.10 per minute of video. A platform moderating 500,000 images and 10,000 minutes of video pays about $500 + $1,000 = $1,500 per month. AWS becomes the cost winner past ~5M images per month due to volume discounts.

**Hive Moderation** does not publish a public price sheet — pricing per https://hivemoderation.com/pricing is custom and quoted based on volume, classifier mix, and whether you want a managed moderator queue on top. Reported customer benchmarks land at roughly $1-2 per 1,000 calls for text moderation and $1.50-3 per 1,000 calls for image moderation, with deepfake detection priced higher. Enterprise minimums typically start around $25,000 per year. Hive's value is not the price — it is the breadth of classifiers and the willingness to train custom models inside the contract.

**Sightengine** publishes a transparent ladder per https://sightengine.com/pricing: Free at 500 operations/month, Starter at $29/month for 5,000 operations, Standard at $89/month for 25,000, Pro at $399/month for 150,000, then enterprise custom. A SaaS startup at 100,000 image moderations per month lands at Pro — about $400 per month. The pricing is the easiest of the six to forecast. Note that operations are per-model — three models on one image counts as three operations.

Real use-case decision matrix: which API to ship for which surface

If you are building an LLM app on the OpenAI stack and you need policy guardrails on user prompts plus model outputs, ship **OpenAI Moderation** as your primary filter. It is free, the omni-moderation-latest model covers thirteen categories including image inputs, and the integration is a single REST call per request. The documented pattern at https://platform.openai.com/docs/guides/moderation handles the 95 percent case. Layer Perspective API on top only if you also have a comment-style surface where tonal nuance matters more than policy categories.

If you run a news or media platform with a comment section and your moderation problem is incivility and toxicity rather than illegal content, ship **Perspective API**. It is the right tool for the New York Times use case and Jigsaw publishes detailed scoring methodology at https://developers.perspectiveapi.com/. Combine Perspective with a human moderator queue tuned at toxicity thresholds of 0.6-0.8 — going lower over-flags reclaimed slurs and political speech, going higher misses borderline harassment.

If your company is already on Azure and you need multi-modal moderation plus Prompt Shields for an enterprise GenAI rollout, ship **Azure AI Content Safety**. The co-location wins on latency, the SLA is real, and the four-severity-level model maps cleanly to existing enterprise risk frameworks. Pricing at https://azure.microsoft.com/en-us/pricing/details/cognitive-services/content-safety/ is mid-pack — not the cheapest, but procurement is one PO with your existing Azure spend instead of a new vendor.

If you are AWS-native and you handle a lot of video moderation — short-form video platforms, livestreaming, video UGC — ship **AWS Rekognition Content Moderation**. The async StartContentModeration API for video is the most mature in the category and the per-minute pricing scales reasonably. Pair it with Comprehend for the text moderation side. The architecture is single-vendor and S3-native, which simplifies your IAM and data-residency story.

If you run a marketplace, dating app, or social platform where deepfake detection, OCR-aware moderation, and custom enterprise labels matter, ship **Hive Moderation**. The classifier breadth at https://hivemoderation.com/pricing is unmatched — weapons brand recognition, demographic estimation, AI-generated content detection, celebrity recognition, and OCR for embedded text in images. Hive is the only vendor on this list that will train a custom classifier inside your contract on a six-week turnaround.

If you are an indie developer, an early-stage SaaS, or a marketplace at under series-B scale that needs serious moderation without an enterprise procurement cycle, ship **Sightengine**. Pricing per https://sightengine.com/pricing is the easiest to forecast, the API is the easiest to integrate, and the model coverage (NSFW, weapons, drugs, deepfake, AI-generated, scam links) genuinely competes with Hive at a fraction of the contract complexity. The 500 free operations per month is enough to validate the API before swiping a card.

Latency, SLA, and reliability: what to verify before production

Median latency matters because moderation calls are on the critical path. **OpenAI Moderation** typically runs 150-300ms p50 from US East, longer from APAC because requests hit US-based infrastructure. There is no published SLA on the moderation endpoint specifically — it inherits the general OpenAI API reliability posture at https://status.openai.com/. Plan for occasional minute-scale outages and fail closed on high-risk surfaces, fail open with logging on low-risk surfaces.

**Perspective API** runs 200-400ms p50 depending on Google data center routing. No formal SLA because Jigsaw operates it as a public-good service, not a commercial cloud product. Multi-year uptime is in the 99.9+ range per community monitoring, but do not contractually rely on Perspective for safety-critical surfaces. Use it for queue ranking, not hard-block decisions on illegal content.

**Azure AI Content Safety** carries the standard Azure Cognitive Services 99.9% uptime SLA. Regional-endpoint latency lands 80-200ms p50 because Microsoft co-locates the model with the regional frontend — the lowest-latency option on the list when you are deployed in the same Azure region, and the gap matters if you are gating every keystroke in a chat product.

**AWS Comprehend and Rekognition** both carry the standard AWS 99.9% service SLA at https://aws.amazon.com/service-terms/. Regional-endpoint latency is 120-250ms p50 for Comprehend text and slightly longer for Rekognition image. AWS observability tooling (CloudWatch, X-Ray) makes these the easiest to monitor in production. Trade-off: Lambda cold-start adds 200-400ms on first invocation.

**Hive Moderation** quotes 99.9% uptime on enterprise contracts only. Median image latency runs 250-500ms p50 because each classifier is a separate inference. Wrong shape for real-time chat; right shape for post-upload moderation queues where breadth of classifiers wins over latency.

**Sightengine** offers a 99.9% uptime SLA on Pro tier and above per https://sightengine.com/pricing. Latency runs 200-400ms p50 via the global edge. The caveat: calling multiple models per asset stacks latency roughly linearly.

Build vs. buy: when to use the model's native moderation instead

A growing number of LLM platforms ship moderation baked into the inference call itself. **Vertex AI safety filters** at https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/configure-safety-attributes apply to every Gemini API call at no extra charge across four harm categories with configurable thresholds. **Anthropic Claude** has a constitutional AI safety layer built into every API call documented at https://docs.anthropic.com/en/docs/test-and-evaluate/strengthen-guardrails/reduce-harm. **OpenAI's** chat completions endpoint runs internal safety filtering before generation. For many LLM-only use cases, that is the only moderation layer you need.

Where the model's native moderation falls short: user-generated content that never touches an LLM. If you run an image-upload feature, a video-comment thread, a marketplace listing form, or any surface where users submit content that is rendered to other users without going through generation, the LLM's built-in safety does nothing for you. That is the entire reason this category exists. The default failure mode is shipping a beautiful LLM safety story while your image-upload endpoint accepts CSAM unfiltered.

Where build-your-own does work: classification problems specific enough that no off-the-shelf moderation API maps to them. If you need to detect, say, a specific list of regulated medical device brand names in user reviews for compliance reasons, training a small custom classifier on labeled data with a model like a fine-tuned BERT or a few-shot Claude call can outperform any of the six APIs on this list. Document the niche workflow, scope the build honestly, and compare the inference cost to the per-call price of the closest API.

The hybrid pattern that works in 2026: ship the model's native safety filter as the first line of defense, plus **OpenAI Moderation** as a free policy filter at the application layer, plus a specialist like **Hive** or **Sightengine** for image and video upload paths that the LLM never sees. Total cost lands somewhere between $0 and a few thousand dollars per month at series-A scale, and the coverage spans LLM hallucination, prompt injection, text policy, and visual content all at once.

If you go pure custom — training your own toxicity model on top of OpenAI embeddings or Anthropic Claude — the hidden cost is the dataset labeling. A defensible custom toxicity classifier needs at minimum 5,000 labeled examples per category, ideally 20,000+, and the labeling cost dominates the inference cost. Most teams that try this end up shipping a worse Perspective API eighteen months later. Budget for the labeling pipeline before you commit to building.

Run the math via the OpenAI API cost calculator before committing. The most common mistake is forgetting that moderation runs on every input, every output, every upload — LLM-as-judge token costs compound faster than the per-call price of a dedicated moderation API.

Implementation timeline: what the first 60 days look like

**OpenAI Moderation** is the fastest to ship: a single afternoon for a competent backend engineer. The integration is a wrapper around the moderations endpoint plus a category-to-action mapping (block, flag for review, log, allow). The remaining work is policy: deciding which categories at which scores trigger which actions, which is a trust and safety conversation, not an engineering one. Plan one engineering day plus one to two weeks of policy iteration before turning it on in production.

**Perspective API** also ships fast — a day for the integration, a week for quota approval if you need above 1 QPS, then two to four weeks of threshold tuning per surface. The hard part is not the integration; it is calibrating your toxicity threshold against your moderator capacity. Start at 0.7, monitor the false positive rate against human review for two weeks, then adjust per surface (a news comment section tolerates more than a children's app).

**Azure AI Content Safety** rollouts take two to four weeks for a production-grade deployment: one week for resource provisioning, IAM, and regional configuration; one week for threshold tuning in the Content Safety Studio at https://contentsafety.cognitive.azure.com/; one to two weeks for moderator workflow integration and alerting. The Studio UI accelerates the threshold tuning meaningfully — your trust and safety team can re-tune without filing engineering tickets.

**AWS Comprehend plus Rekognition** rollouts run three to six weeks. The text side is fast; the video side is where the time goes. Rekognition's async video moderation API needs an S3 event pipeline, an SNS notification flow, a DynamoDB store for moderation results, and a moderator review UI. AWS publishes a reference architecture at https://docs.aws.amazon.com/rekognition/latest/dg/video.html that covers most of the wiring, but your team will still spend a week tuning the moderation label thresholds for your specific content profile.

**Hive Moderation** rollouts take four to eight weeks because most Hive customers buy custom classifier training inside the contract — adding three to six weeks of labeling and training on top of the integration work. The timeline reflects the depth of customization, not the difficulty of the API itself.

**Sightengine** matches OpenAI Moderation on speed: one to two days for a basic integration plus one to two weeks of threshold tuning per model. The workflow builder at https://dashboard.sightengine.com/ lets non-engineers chain models together with conditional logic — the fastest path from zero to production-grade visual moderation in the category.

The opinionated 2026 pick: what I would ship

If I were shipping a new LLM-powered SaaS today, I would wire **OpenAI Moderation** as the free baseline filter on every text and image input, then add **Sightengine** for user-uploaded media that does not go through an LLM. Combined cost at series-A scale is roughly $400/month — Sightengine Pro covers 150,000 image operations and OpenAI Moderation is free. Verify at https://platform.openai.com/docs/guides/moderation and https://sightengine.com/pricing.

If I were running a news platform with a serious comment section, I would ship **Perspective API** tuned at 0.7 toxicity with a human moderator queue for borderline scores, and layer **OpenAI Moderation** for hard-block illegal-content categories. Total cost is zero plus moderator salaries. This is the architecture the New York Times has documented publicly and it remains the right answer in 2026.

If I were already on Azure and rolling out enterprise GenAI to a regulated industry, I would ship **Azure AI Content Safety** plus **Prompt Shields**. Single-vendor procurement, real SLA, and the four-severity-level harm taxonomy maps cleanly to regulated-industry risk frameworks. Cost at 5M text plus 500k image moderations per month is around $2,400 — rounding error on a real enterprise GenAI budget.

If I were running a video-heavy UGC platform on AWS-native infrastructure, I would ship **AWS Rekognition Content Moderation** for video and images plus **AWS Comprehend** for text. Volume pricing at scale beats alternatives past about 5M images per month, and the S3-event-driven architecture is the best-tested in the category. Verify at https://aws.amazon.com/rekognition/pricing/.

If I were running a marketplace or dating app where deepfake detection or custom classifier training mattered, I would ship **Hive Moderation** on an enterprise contract. The classifier breadth is unique and the willingness to train custom models inside the contract is real. Budget $25k to $100k per year. Verify at https://hivemoderation.com/pricing.

The one thing I would not do in 2026 is ship a single-vendor stack and call it done. Every API on this list misses something — OpenAI misses spam and deepfakes, Perspective misses images entirely, Azure and AWS miss niche custom labels, Hive and Sightengine cost real money for categories OpenAI covers free. Layer two APIs, not one, and route surfaces to the API that handles each best. The cost of one breach a single-vendor stack missed dwarfs the cost of a second API call.

How to pick and roll out the right content moderation API for your team

1
Step 1: Inventory every surface that needs moderation
Before you take a single vendor demo, write down every UGC surface in your product: chat messages, profile bios, profile photos, uploaded images and videos, comments, reviews, marketplace listings, support tickets, AI-generated outputs shown to other users. For each, note modality (text, image, video), monthly volume, latency sensitivity (real-time chat vs. async upload), and legal risk tier (illegal vs. rude). That inventory is the single most important artifact in this decision — it determines whether you need one API or three, and whether free policy filters suffice or you need a specialist like Hive. If you cannot fill out this sheet, you are not ready to pick a vendor.
2
Step 2: Model your monthly cost at 1x, 3x, and 10x current traffic
Build a one-page cost model for each finalist at current traffic, 3x (one viral moment), and 10x (one year of growth). For OpenAI Moderation and Perspective, the cost is zero in all three cases — assuming you stay inside rate limits, which is the real constraint. For Azure, AWS, Hive, and Sightengine, use the pricing pages cited above. Most teams underestimate image and video moderation cost by 3-5x because they forget every upload variant (thumbnail, mid-res, full-res) often needs separate moderation. Get this model in front of finance before committing.
3
Step 3: Run a parallel-shadow test on real production traffic
Pick the top two finalists and run them in shadow mode against 30 days of real production content. Send every moderation call to both APIs in parallel, log both scores plus the human moderator decision, and measure: precision (of items flagged, how many should have been), recall (of items that should have been flagged, how many were), and disagreement rate between the APIs. The vendor with the better precision-recall tradeoff on your specific content wins. Vendor benchmark claims are universally optimistic — the right API for one platform is often wrong for another. Two weeks of shadow data is worth more than two months of demos.
4
Step 4: Verify legal, data residency, and SLA terms in writing
Get the data processing agreement, data residency commitments, and SLA terms in writing before signing. For Azure, verify your regional endpoint commitment — Cognitive Services has occasionally moved workloads across regions during incidents. For AWS, verify whether moderation traffic stays inside your specified region. For Hive, verify what data is retained and whether it trains their models. For OpenAI and Perspective, verify data-use terms — both have changed for enterprise versus standard tiers. Get a CCPA and GDPR data-subject-request workflow in writing if you handle EU or California users.
5
Step 5: Ship with circuit breakers, dashboards, and human escalation paths
Wire every moderation call behind a circuit breaker that fails closed for high-risk content and fails open with logging for low-risk. Stand up a dashboard showing API latency p50/p95/p99, error rate, and category-flag rate per surface — the day a viral post 10x'es your flag rate is the day you find out whether your moderator queue can handle it. Alert on flag-rate anomalies (sudden spike means either an attack or a model regression on the vendor side). Document the human escalation path: every automated decision needs a user appeal route and a moderator override. Skipping any of these turns moderation from an asset into a liability the first time something goes wrong.

Digital Dashboard Hub

The prompt patterns above work 10x better when they live in a library you actually own — tunable to your niche, exportable to GPT-5, Claude, Gemini, Perplexity, Midjourney, Llama. Stop pasting across 6 tools.

Try DDH's AI Prompt Builder — free 14 days, no card. →

Related prompt tools

OpenAI Moderation API vs Perspective API: head-to-head→LLM toxicity detection tools compared (2026)→LLM jailbreak prevention: tools and tactics (2026)→OpenAI API cost calculator→

Frequently Asked Questions

Is OpenAI Moderation really free, and if so, what is the catch?

Yes, OpenAI's omni-moderation-latest model at https://platform.openai.com/docs/guides/moderation is genuinely free under standard OpenAI API rate limits — no per-call charge, no monthly fee. The catch is rate limits, not dollars: the moderation endpoint shares organizational TPM and RPM quotas with the rest of your usage, so a high-volume app can hit ceilings on busy days. The other catch is coverage: it does not detect spam, scam links, deepfakes, weapons brand names, or AI-generated images. Treat it as a free baseline filter, not a complete trust and safety stack.

Why does Perspective API only support 18 languages when newer APIs claim 100+?

Perspective at https://perspectiveapi.com/ is operated by Jigsaw as a public-good project — they publish formal toxicity models only for languages where native-speaker annotators validate label quality. The 18 production languages cover roughly 80 percent of global comment volume. Azure and AWS claim 100+ languages via auto-translation before classification, which adds latency and degrades nuance — a slur translated into English loses cultural context. If you care about non-English markets, validate per-language precision in your shadow test rather than trusting the vendor language count.

How does Azure AI Content Safety compare to AWS Rekognition for image moderation?

Azure and AWS price similarly at around $1 per 1,000 images at Standard tier per https://azure.microsoft.com/en-us/pricing/details/cognitive-services/content-safety/ and https://aws.amazon.com/rekognition/pricing/. Azure's four-severity model per harm category is cleaner for risk-tiering; AWS's 35+ labels give finer category coverage including weapons types, drugs, gambling, and alcohol brand recognition. AWS wins for S3-resident workflows and video; Azure wins for Microsoft-stack co-location and Prompt Shields. If vendor-neutral, run both on your real image set in shadow mode and pick on precision-recall.

Can I get away with just using a model's built-in safety filter instead of a separate moderation API?

For LLM-only outputs (chatbot responses, AI-generated text), yes — Vertex AI safety filters, OpenAI's internal chat-completion safety, and Anthropic's constitutional AI layer all provide reasonable baseline safety. For user-generated content that never touches an LLM (uploaded images, video, marketplace listings, profile photos), absolutely not — the model's safety filter does nothing for you because there is no model in the loop. The default failure mode is launching with a polished LLM safety story and an unprotected upload endpoint. Audit every UGC surface separately.

What is the cheapest credible content moderation stack for an indie SaaS startup?

If you are budget-constrained: **OpenAI Moderation** (free, https://platform.openai.com/docs/guides/moderation) for all text plus **Sightengine** Starter at $29/month per https://sightengine.com/pricing for 5,000 image operations. Total cost about $29/month, scaling to $89 at Standard (25,000 ops) and $399 at Pro (150,000 ops). This covers thirteen text policy categories plus NSFW, weapons, drugs, gore, and deepfake detection. You will not get Hive-level deepfake quality or AWS-level video moderation, but you get a defensible production moderation layer that scales to Series-A volume.

How accurate is deepfake detection from Hive and Sightengine in 2026?

Accurate enough to flag obvious AI-generated images and deepfake videos from major generation tools (Sora, Veo, Midjourney, Stable Diffusion), but not accurate enough to be the sole basis for removal on borderline cases. Both Hive (https://hivemoderation.com/ai-generated-content-detection) and Sightengine (https://sightengine.com/detect-ai-generated-images) report 90+ percent precision on public benchmarks, but adversarial deepfakes designed to evade detection cut accuracy by 20-30 points. Treat deepfake scores as a moderator-queue signal, not an automated hard-block — especially on content with legal or political stakes.

Do any of these APIs offer EU data residency for GDPR compliance?

Yes, with caveats. Azure AI Content Safety supports 30+ regions including West Europe and North Europe. AWS Comprehend and Rekognition support EU (Frankfurt) and EU (Ireland). Sightengine offers a France data residency option per https://sightengine.com/security. OpenAI offers ZDR (Zero Data Retention) for Enterprise customers but not EU-specific residency as of June 2026. Perspective runs on Google US/EU infrastructure without an EU-only commitment. Hive offers EU residency only on enterprise contracts. If GDPR data residency is a hard requirement, verify the specific regional commitment in writing — vendor marketing pages and contractual commitments are not always the same thing.

Will any of these APIs work for languages outside the major Western set (Japanese, Korean, Arabic, Hindi)?

Yes, but with variable quality. AWS Comprehend supports Japanese, Korean, Arabic, and Hindi natively per https://aws.amazon.com/comprehend/. Azure claims 100+ languages via auto-translation — useful in practice, lossy on slang. OpenAI's omni-moderation-latest handles 40+ languages at varying quality (English, Spanish, French, German, Portuguese, Japanese, Chinese best). Perspective formally supports 18 languages including Japanese, Korean, Arabic, and Hindi at production quality. Hive covers 50+ languages with native-speaker validation. For any non-English market, validate per-language precision in your shadow test — vendor claims and on-the-ground quality diverge most in the long-tail languages.

What happens if my moderation API goes down — do I fail open or fail closed?

Depends on the surface. For high-risk surfaces (image uploads on a platform with CSAM exposure, chat in a children's product, regulated marketplace listings), fail closed: block submissions and show a retry message. For low-risk surfaces (comments on a tech blog, in-app chat for adults), fail open with logging and queue for retroactive moderation when the API recovers. The wrong answer is one global policy — every moderation call should have a documented failure mode per surface. Most production incidents in this category come from teams that failed open universally and then explained the incident to a regulator, or failed closed universally and lost days of UGC to a vendor outage. Build the circuit breaker right the first time.

You now know which content moderation API to ship. Now make every prompt your AI tools run actually hit.

AI Prompt Generator builds production-ready system prompts that work across ChatGPT, Claude, Gemini, OpenAI Moderation, Azure Content Safety, and every other AI tool in this article — so your trust and safety reviews get sharper signals, not generic AI fluff. Stop tweaking prompts by hand and start shipping prompts that drive measurable lift. 14-day free trial, no credit card required.

Browse all prompt tools →