manual Scaling
Parameters
value
Manually select the number of nodes to use for serving the model. You should generally use auto_scaling
with an appropriate min_nodes
instead, but this option is available if you want more predictable billing. Beware that latency and error rates will increase if the traffic exceeds that capability of the system to serve it based on the selected number of nodes.
Parameters
argument
Manually select the number of nodes to use for serving the model. You should generally use auto_scaling
with an appropriate min_nodes
instead, but this option is available if you want more predictable billing. Beware that latency and error rates will increase if the traffic exceeds that capability of the system to serve it based on the selected number of nodes.