Glossary

Agent Evaluation

Reviewed 9 April 2026 Canonical definition

Agent evaluation is the systematic assessment of an agent's quality, accuracy, safety, and policy compliance across a representative set of tasks. It should be automated, repeatable, and run before every deployment.

See how every agent performs — and make it better

Prefactor helps teams observe, evaluate, and improve their AI agents in production — across every framework and provider.

Book a demo View docs

Agent Evaluation

Related articles

Related terms

See how every agent performs — and make it better