← Back to glossary
Glossary

Guardrails

Reviewed 20 March 2026 Canonical definition

Guardrails are runtime constraints that limit what an AI agent can do, what data it can access, and how it can respond. They are designed to remain enforceable even if the agent reasons toward an unsafe action.