ModelContainerMultiModelConfigArgs

data class ModelContainerMultiModelConfigArgs(val modelCacheSetting: Output<String>? = null) : ConvertibleToJava<ModelContainerMultiModelConfigArgs>

Constructors

Link copied to clipboard
constructor(modelCacheSetting: Output<String>? = null)

Properties

Link copied to clipboard
val modelCacheSetting: Output<String>? = null

Whether to cache models for a multi-model endpoint. By default, multi-model endpoints cache models so that a model does not have to be loaded into memory each time it is invoked. Some use cases do not benefit from model caching. For example, if an endpoint hosts a large number of models that are each invoked infrequently, the endpoint might perform better if you disable model caching. To disable model caching, set the value of this parameter to Disabled. Allowed values are: Enabled and Disabled.

Functions

Link copied to clipboard
open override fun toJava(): ModelContainerMultiModelConfigArgs