Automatic Scaling Args
Automatic scaling is based on request rate, response latencies, and other application metrics.
Constructors
Properties
The time period that the Autoscaler (https://cloud.google.com/compute/docs/autoscaler/) should wait before it starts collecting information from a new instance. This prevents the autoscaler from collecting information when the instance is initializing, during which the collected usage would not be reliable. Only applicable in the App Engine flexible environment.
Target scaling by CPU usage.
Target scaling by user-provided metrics. Only applicable in the App Engine flexible environment.
Target scaling by disk usage.
Number of concurrent requests an automatic scaling instance can accept before the scheduler spawns a new instance.Defaults to a runtime-specific value.
Maximum number of idle instances that should be maintained for this version.
Maximum amount of time that a request should wait in the pending queue before starting a new instance to handle it.
Maximum number of instances that should be started to handle requests for this version.
Minimum number of idle instances that should be maintained for this version. Only applicable for the default version of a service.
Minimum amount of time a request should wait in the pending queue before starting a new instance to handle it.
Minimum number of running instances that should be maintained for this version.
Target scaling by network usage.
Target scaling by request utilization.
Scheduler settings for standard environment.