← Back to glossary
Glossary

Evaluation Pipeline

Reviewed 9 April 2026 Canonical definition

An evaluation pipeline is an automated workflow that benchmarks agent quality, accuracy, safety, and policy compliance before and after deployment. It replaces manual spot-checking with repeatable, data-driven assessment.