Inference Component Compute Resource Requirements Args
data class InferenceComponentComputeResourceRequirementsArgs(val maxMemoryRequiredInMb: Output<Int>? = null, val minMemoryRequiredInMb: Output<Int>? = null, val numberOfAcceleratorDevicesRequired: Output<Double>? = null, val numberOfCpuCoresRequired: Output<Double>? = null) : ConvertibleToJava<InferenceComponentComputeResourceRequirementsArgs>
Properties
Link copied to clipboard
The maximum MB of memory to allocate to run a model that you assign to an inference component.
Link copied to clipboard
The minimum MB of memory to allocate to run a model that you assign to an inference component.
Link copied to clipboard
The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and AWS Inferentia.
Link copied to clipboard
The number of CPU cores to allocate to run a model that you assign to an inference component.