Questions tagged [autoscaling]

Auto scaling is a feature of cloud computing services which allows adding more computational resources automatically if load increases or based on triggers/custom metrics.

Auto scaling is a feature of cloud computing services which allows almost infinite growth of computational power automatically as load increases on CPU, memory or disk storage.

In particular, such feature is available, for example, in Amazon EC2 service using ASG.

1836 questions
114
votes
10 answers

AWS EC2 Auto Scaling Groups: I get Min and Max, but what's Desired instances limit for?

When you setup an Auto Scaling groups in AWS EC2 Min and Max bounds seem to make sense: The minimum number of instances to scale down to based on policies The maximum number of instances to scale up to based on policies However, I've never been…
Ray
  • 40,256
  • 21
  • 101
  • 138
70
votes
5 answers

What is the difference between AWS ASG Launch Templates and Launch Configurations?

I'm getting confused when using Terraform to provision an auto-scaling group. Should I use launch configuration or launch template for EC2 properties, such as which AMI, instance types, ...? I don't know what the difference is between them, which we…
Dat Ho
  • 975
  • 1
  • 7
  • 9
36
votes
2 answers

AWS Auto Scaling Group - Application Load Balancer Request Count Per Target

On AWS, you can create an auto scaling policy which scales based on "Application Load Balancer Request Count Per Target". Like this: This has a min of 1 instance and a max of 5. It should aim to achieve 10 "Request count per target" for my…
ThePerson
  • 3,048
  • 8
  • 43
  • 69
33
votes
3 answers

Amazon EC2 autoscaling down with graceful shutdown?

We're looking at using EC2 autoscaling to deal with spikes in load. In our case we want to scale up instances based on an SQS queue size and then down scale with the queue size gets back under control. Each SQS message defines a potentially long…
danw
  • 1,528
  • 1
  • 17
  • 17
31
votes
5 answers

How to create variable number of EC2 instance resources in Cloudformation template?

How to create variable number of EC2 instance resources in Cloudformation template, according to a template parameter? The EC2 API and management tools allow launching multiple instances of the same AMI, but I can't find how to do this using…
29
votes
6 answers

What "desired instances" is needed for? AWS Amazon Webservices AutoScaling group

i am facing a strange situation on Amazon AWS. I don´t understand for what the desired Instances number is used for? I have an autoscaling group that contains scale up and scale down actions configured. I have a custom PHP file that run actions…
X.Otano
  • 2,079
  • 1
  • 22
  • 40
28
votes
4 answers

How do I update a Kubernetes autoscaler?

I have created a Kubernetes autoscaler, but I need to change its parameters. How do I update it? I've tried the following, but it fails: ✗ kubectl autoscale -f docker/production/web-controller.yaml --min=2 --max=6 Error from server:…
aknuds1
  • 65,625
  • 67
  • 195
  • 317
27
votes
6 answers

Elastic Load Balancing both internal and internet-facing

We are trying to use Elastic Load Balancing in AWS with auto-scaling so we can scale in and out as needed. Our application consists of several smaller applications, they are all on the same subnet and the same VPC. We want to put our ELB between one…
engma
  • 1,849
  • 2
  • 26
  • 55
26
votes
4 answers

Auto Scale Fargate Service Based On SQS ApproximateNumberOfMessagesVisible

I would like to scale out my aws fargate containers based on the size of the SQS queue. It appears that I can only scale based on the container's CPU or Memory usage. Is there a way to create a policy that would scale out and in based on queue size?…
24
votes
4 answers

Difference between API versions v2beta1 and v2beta2 in Horizontal Pod Autoscaler?

The Kubernetes Horizontal Pod Autoscaler walkthrough in https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale-walkthrough/ explains that we can perform autoscaling on custom metrics. What I didn't understand is when to use the…
Ajay Maity
  • 740
  • 2
  • 8
  • 17
24
votes
5 answers

In Kubernetes, how do I autoscale based on the size of a queue?

Suppose I have a RabbitMQ instance and a set of pods that pick messages from RabbitMQ and process them. How do I make Kubernetes increase the number of pods as the queue size increases? (I'm mentioning RabbitMQ, but that's just an example. Pick your…
Likk
  • 747
  • 3
  • 7
  • 8
24
votes
2 answers

What is the best way I can scale my nodejs app?

The basics Right now a few of my friends and I are trying to develope a browser game made in nodejs. It's a multiplayer top-down shooter, and most of both the client-side and server-side code is in javascript. We have a good general direction that…
Michael
  • 898
  • 1
  • 8
  • 14
22
votes
1 answer

AWS/ECS CPUUtilization average vs maximum

After reading AWS documentation I am still not clear about cloudwatch metrics statistics average and maximum, specifically for ECS CPUUtilization. I have a AWS ECS cluster fargate setup, a service with minimum count of 2 healthy task. I have enabled…
21
votes
6 answers

Prevent killing some pods when scaling down possible?

I need to scale a set of pods that run queue-based workers. Jobs for workers can run for a long time (hours) and should not get interrupted. The number of pods is based on the length of the worker queue. Scaling would be either using the horizontal…
Stragulus
  • 1,023
  • 11
  • 10
19
votes
2 answers

Auto-Scaling Groups Don't Update on Launch Configuration Change

I have an AWS Auto-Scaling Group, a Launch Configuration, and an Auto-Scaling Group Policy defined in Terraform like this: resource "aws_autoscaling_group" "default" { name = "..." health_check_type = "EC2" vpc_zone_identifier = ["${...}"] …
Naftuli Kay
  • 87,710
  • 93
  • 269
  • 411
1
2 3
99 100