← Back to glossary
Glossary

Confidence Score

Reviewed 20 March 2026 Canonical definition

A confidence score is a numeric value representing how certain a model or agent is about a particular output or decision. Low-confidence actions can be routed for human review or blocked by policy rules.