← Back to glossary
Glossary

Agent Scaling

Reviewed 9 April 2026 Canonical definition

Agent scaling is the process of increasing or decreasing the compute resources allocated to AI agents based on demand. Scaling decisions affect cost, latency, and availability — and governance must ensure that scaling does not bypass security or policy controls.