Glossary
AI Alignment
AI alignment is the challenge of ensuring an AI system's goals and actions remain consistent with human intentions and organisational policies. For agents, misalignment can mean optimising for a metric in ways that violate safety or ethics constraints.