← Back to glossary
Glossary

Canary Evaluation

Reviewed 9 April 2026 Canonical definition

Canary evaluation routes a small fraction of production agent traffic to a new agent version and monitors quality, error rate, and policy compliance in real time before full rollout. It is a risk management technique that limits exposure to regressions while enabling empirical performance comparison on live data.