Glossary
Benchmark (AI)
An AI benchmark is a standardised test or dataset used to measure and compare the performance of AI models or agents on specific tasks. Benchmarks help organisations select models and track quality over time, but may not reflect real-world conditions.