What each platform actually does (and the marketing copy you should ignore)
**NVIDIA NeMo Guardrails** is fundamentally a dialog orchestration framework, not a classifier. You write rules in Colang — a small DSL that looks like a cross between Python and a state machine — to declare what the bot should do in named situations. The framework intercepts user messages and model outputs and routes them through input rails, dialog rails, retrieval rails, and output rails before anything reaches the user. The repo at https://github.com/NVIDIA/NeMo-Guardrails is Apache 2.0 and runs anywhere Python runs; NVIDIA AI Enterprise customers can deploy it as a NIM microservice for managed scaling, but the core is free.
**Guardrails AI** is a Python library that wraps your LLM calls and validates each input and output against a stack of composable validators. The Guardrails Hub at https://hub.guardrailsai.com/ ships dozens of pre-built validators — toxic language, PII, profanity, JSON schema, regex, custom Pydantic types — and you compose them per output field. The OSS library is free; the cloud control plane adds a dashboard, hosted validator endpoints, and team management, with pricing that scales per 1,000 validations (verify current pricing at https://www.guardrailsai.com/pricing).
**Lakera Guard** is the most product-shaped of the six. It is a hosted API you call before and after your LLM call, and the response tells you whether the input looks like a prompt injection, jailbreak, PII leak, or off-topic message. Lakera trained its own classifiers on a corpus of attacks they collected through the Gandalf jailbreak game and customer telemetry — see https://lakera.ai/ for the methodology. The free tier handles 10,000 requests per month, which is plenty for a proof of concept, and enterprise pricing starts around $999 per month for higher volumes plus SSO.
**Rebuff** is open-source by design and self-host only. The repo at https://github.com/protectai/rebuff (Protect AI took over stewardship in 2023) combines four detection layers: heuristics against known injection patterns, a dedicated LLM check, an embedding-based vector store of past attacks, and canary tokens — secret strings inserted into prompts that, if leaked back, prove an injection succeeded. It is a starter kit, not a complete platform. You bring the storage, the orchestration, and the operational tooling.
**Robust Intelligence** (Cisco acquired the company in 2024) is the enterprise-grade eval-time platform. Their AI Validation product runs algorithmic red teaming against your model and continuously generates new attacks based on the model's weaknesses; AI Firewall is the runtime proxy that blocks attacks in production. The pitch at https://www.robustintelligence.com/ is that you cannot defend what you have not tested, and they couple the two products tightly. Pricing is custom enterprise — expect $80,000 to $250,000-plus annually for a meaningful deployment.
**IBM watsonx.governance** (the rebranded Guardium AI Security plus OpenPages-derived governance tooling) is the IBM bet on AI risk management as a discipline, not just an inference-time filter. It catalogs models, tracks lineage, runs detectors at runtime, and produces the documentation regulated industries need for EU AI Act, NYDFS, and similar regimes. The product page at https://www.ibm.com/products/watsonx-governance positions it for existing IBM customers; the procurement and integration overhead make it a heavy choice for teams not already in the IBM ecosystem.