Cloud Computing
Auto Scaling
Definition
Auto scaling is a cloud computing feature that allows you to automatically adjust the amount of computational resources in a server farm - typically measured by the number of active servers - up or down based on the load.
Why It Matters
Auto scaling is the mechanism that provides elasticity. It ensures that you have enough capacity to meet demand during peak times, and also saves money by automatically removing resources when they are no longer needed.
Contextual Example
An Auto Scaling Group for a website is configured to monitor the CPU utilization of its servers. If the average CPU load goes above 70%, it automatically launches a new server instance. If the load drops below 30%, it terminates an instance.
Common Misunderstandings
- Auto scaling is essential for building cost-effective and resilient applications in the cloud.
- It is a standard feature in all major cloud providers.