<ul data-eligibleForWebStory="true">Many teams face scaling issues despite having auto scaling, alarms, and metrics set up for their ECS tasks.The problem often lies in the mismatch between task scaling and available cluster capacity.Tasks can remain stuck in PENDING if there is not enough CPU or memory available on the cluster.ECS Cluster Auto Scaling (CAS) becomes crucial to address this capacity constraint.Understanding API behavior and estimating task size are essential for proper ECS scaling.Monitoring key metrics like CPUReservation and MemoryReservation is vital for effective scaling policies.ECS CAS utilizes a formula to calculate the needed EC2 capacity based on task resource demand.Daemon and Non-Daemon tasks play a significant role in determining EC2 scaling in ECS.ECS checks task placement every 15 seconds and can have up to 100 tasks in provisioning per cluster.By implementing CAS, linking services to capacity providers, and enabling managed scaling, tasks can scale efficiently without 502 errors.Estimating concurrency, understanding app behavior, and wisely choosing EC2 instances are key factors for early capacity planning.