Glossary
Agent Evaluation
Agent evaluation is the systematic assessment of an agent's quality, accuracy, safety, and policy compliance across a representative set of tasks. It should be automated, repeatable, and run before every deployment.
Agent evaluation is the systematic assessment of an agent's quality, accuracy, safety, and policy compliance across a representative set of tasks. It should be automated, repeatable, and run before every deployment.
Prefactor helps teams observe, evaluate, and improve their AI agents in production — across every framework and provider.