← Back to glossary
Glossary

Agent Scaling

Reviewed 20 March 2026 Canonical definition

Agent scaling is the process of increasing or decreasing the compute resources allocated to AI agents based on demand. Scaling decisions affect cost, latency, and availability — and governance must ensure that scaling does not bypass security or policy controls.