Glossary
Canary Evaluation
Canary evaluation routes a small fraction of production agent traffic to a new agent version and monitors quality, error rate, and policy compliance in real time before full rollout. It is a risk management technique that limits exposure to regressions while enabling empirical performance comparison on live data.