What each toolkit actually does (and the marketing copy to ignore)
**NVIDIA NeMo Guardrails** is a programmable dialog layer that sits between your application and an LLM. You write flow files in Colang — a small domain-specific language that looks like a cross between Python and a chatbot script — and the runtime executes them as a state machine on every turn. The shipping toolbox includes topical rails (keep the bot on-topic), moderation rails (block disallowed content), jailbreak detection, sensitive-data masking, and fact-checking rails that compare outputs against retrieved documents. The repo at https://github.com/NVIDIA/NeMo-Guardrails is Apache 2.0 and the canonical docs live at https://docs.nvidia.com/nemo/guardrails/.
**Guardrails AI** is a different shape of tool. It is a Python validator pipeline that wraps an LLM call. You declare a RAIL spec — either an XML schema or a Pydantic model — and Guardrails enforces the output against per-field validators (regex match, value within range, no PII, no profanity, no hallucination versus retrieved context, etc.). The OSS library at https://github.com/guardrails-ai/guardrails is MIT licensed and the public validator marketplace is at https://hub.guardrailsai.com/. There is no dialog state machine and no Colang. You compose validators per call.
The marketing for both projects overlaps heavily — both pitch themselves as 'the guardrails for LLMs' — but they are not interchangeable. NeMo Guardrails answers 'what should this assistant be allowed to talk about, and what should it do when a user tries to push it outside that.' Guardrails AI answers 'what shape should this output be, and what concrete properties does each field need to satisfy.' If your problem is dialog flow, NeMo. If your problem is per-call structured-output validation, Guardrails AI. Most production teams need both, layered.
**LangChain Guardrails** is much simpler than either: a callback layer in LangChain that runs a constitutional-AI critique or a moderation chain before returning the response. It is fine for a hackathon, undersized for production. **Llama Guard** is a fine-tuned Llama classifier — model card at https://ai.meta.com/llama/ — that takes a prompt or response and returns a safety label across 13 hazard categories. It is a building block, not a framework. **Lakera Guard** is the commercial pick: a hosted REST API with a tuned prompt-injection model, a fast PII detector, and a content moderation classifier, priced per request at https://www.lakera.ai/pricing.
If you only remember one thing about the category in 2026, remember this: the open-source toolkits are not competing with the paid platforms head-on. NeMo Guardrails and Guardrails AI are competing with each other and with internal builds. Lakera Guard and managed services like AWS Bedrock Guardrails (https://aws.amazon.com/bedrock/guardrails/) are competing for the teams that do not want to run any of this themselves. Decide which side of that line you are on before you compare features.
One useful frame: NeMo Guardrails is what you reach for when you are designing a new agent and want guardrails baked into the dialog graph from day one. Guardrails AI is what you reach for when you have an existing LLM call in production and you need to bolt on field-level validation without rewriting the agent. The former is greenfield architecture; the latter is retrofit. Most teams end up with both at different points in their stack.