Glossary

AI Safety

Reviewed 9 April 2026 Canonical definition

AI safety is the field focused on ensuring AI systems behave as intended and do not cause unintended harm. For agentic AI, safety encompasses runtime controls, containment strategies, evaluation, monitoring, and incident response.

AI Safety

Related articles

Related terms