← Back to glossary
Glossary

Evaluation Pipeline

Reviewed 20 March 2026 Canonical definition

An evaluation pipeline is an automated workflow that benchmarks agent quality, accuracy, safety, and policy compliance before and after deployment. It replaces manual spot-checking with repeatable, data-driven assessment.