EndpointConfigurationAsyncInferenceConfigClientConfig

data class EndpointConfigurationAsyncInferenceConfigClientConfig(val maxConcurrentInvocationsPerInstance: Int? = null)

Constructors

constructor(maxConcurrentInvocationsPerInstance: Int? = null)

Types

Link copied to clipboard
object Companion

Properties

Link copied to clipboard

The maximum number of concurrent requests sent by the SageMaker AI client to the model container. If no value is provided, Amazon SageMaker AI will choose an optimal value for you.