Create and Configure the Auto Scaling Group in EC2
Last Updated :
17 Nov, 2023
Auto Scaling is an Amazon Web Service it allows instances to scale when traffic or CPU load increases. Auto-scaling is a service that monitors all instances that are configured into the Auto Scaling group and ensures that loads are balanced in all instances. Depending on the load scaling group, increase the instance according to the configuration. When we created the auto-scaling group, we configured the Desired capacity, Minimum capacity, maximum capacity, and CPU utilization. If CPU utilization increases by 60% in all instances, one more instance is created, and if CPU utilization decreases by 30% in all instances, one instance is terminated. These are totally up to us; what is our requirement. If any Instance fails due to any reason, then the Scaling group maintains the Desired capacity and starts another instance.
The auto-scaling group follows Horizontal Scaling. This service is very important for us nowadays because we do not need to create new instances manually and do not require manual monitoring.
AWS Auto Scaling
AWS auto scaling is used to scale up and scale down the EC2-instance by depending up on the incoming traffic. You can scale up and scale down the applications in few minutes based up on the traffic which will decrease the latency of the application to the end-users. You can integrate the AWS Auto Scaling with multiple services provided by the AWS like Amazon traffic,, Amazon DynamoDB, and Amazon Aurora. You can also decrease the cost of an application because of dynamic scaling. When there is traffic , only maximum resources are used other wise it will use minimum resources.

Benefits of Auto Scaling
- Dynamical scaling: AWS auto-scaling service doesn't required any type of manual intervention it will automatically scale the application down and up by depending up on the incoming traffic.
- Pay For You Use: Because of auto scaling the resource will be utilised in the optimised way where the demand is low the resource utilisation will be low and the demand will high the resource utilisation will increase so the AWS is going to charge you only for the amount of resources you really used.
- Automatic Performance Maintenance: AWS autoscaling maintains the optimal application performance with considering the workloads it will ensures that the application is running to desired level which will decrease the latency and also the capacity will be increased by based on your application
How AWS Auto Scaling Works?
AWS autoscaling will scale the application based on the load of application. Instead of scaling manually AWS auto scaling will scale the application automatically when the incoming traffic is high it will scale up the application and when the traffic is low it will scale down the application.

First you should choose which service or an application you want to scale then select the optimisation way like cost and performance and then keep track how the scaling is working.
Steps To create Auto Scaling Launch Template
Step 1: Click on the All Services.
Step 2: Click on the EC2(Elastic Cloud Computing).
Step 3: Scroll Down and click on the Launch Templates and click on the Create launch template
Step 4: Type the Template name.
Step 5: Select the Amazon Machine Image.
Step 6: Select the Instance Type and Key pair.
Step 7: Select the Security Group or Create the new one.
Step 8: Click on the Create Launch Template.
Step 9: Now you can see the template is created. Now, scroll down and click on the Auto Scaling Groups.
Create An Auto Scaling Group Using a Launch Template
Step 1: Click on the Create Auto Scaling group.
Step 2: Type the Auto Scaling group name.
Step 3: Select your Template.

Step 4: Select the VPC or go with the default VPC and also select the Availability zone.

Step 5: Configure the Group size and Scaling policies.
Select as per your requirement:
- Desired: 4
- Minimum: 4
- Maximum: 8

Step 6: Select the Target tracking scaling policy.

Step 7: Click on the Create Auto Scaling Group.

- Now you can see the Auto Scaling is creating and it is also creating the desired state of the EC2 Instance

- We selected the Desired state equal to 4 and you can see the 4 Instance is Running
Similar Reads
Creating Autoscaling And Autoscaling Group Using Terraform In modern cloud computing environments, the ability to dynamically scale resources in light of changing interests is essential for keeping up with execution, accessibility, and cost-effectiveness. Autoscaling and Autoscaling Groups are key parts that enable this unique scaling functionality in cloud
7 min read
What is Security Group in AWS and How to Create it? Cybersecurity has grown to be a crucial component of any business in the modern digital age. Access management is a fundamental element of cybersecurity. Controlling access includes deciding who has access to what resources and for what goals. The management of resource access in the cloud is done u
8 min read
Create and Configure: Mastering AWS Network Load Balancer Essentials In the domain of distributed computing, compelling administration of network traffic is fundamental for keeping up with the presentation, dependability, and versatility of utilizations. Amazon Web Services (AWS), a main cloud specialist organization, offers the Network Load Balancer (NLB) as an esse
14 min read
Create EC2 Instance in AWS (Amazon): Complete Tutorial Amazon EC2 (Elastic Compute Cloud) is a cloud computing service provided by AWS that allows users to rent virtual machines (VMs) to run applications on-demand. EC2 Offers a scalable, cost-efficient, and flexible computing environment without the need for users to manage physical hardware. ]Users can
11 min read
How to Create IAM roles for Amazon EC2? In this article, we will cover how we can easily create an IAM role use it with an EC2 instance, and provide the required permissions with the S3 policies. These IAM Roles are the identities that we are creating in our account so that we can provide specific permissions to the users. So these Roles
7 min read