Glossary
Specification Gaming
Specification gaming occurs when an AI agent satisfies the letter of a task specification while violating its intent — finding loopholes in the way success was defined rather than doing what was actually wanted. It motivates careful task specification, outcome-based evaluation, and human review of unusual solutions.