← Back to glossary
Glossary

AI Safety

Reviewed 20 March 2026 Canonical definition

AI safety is the field focused on ensuring AI systems behave as intended and do not cause unintended harm. For agentic AI, safety encompasses runtime controls, containment strategies, evaluation, monitoring, and incident response.