The Kubernetes Engine config for Dataproc clusters deployed to Kubernetes. Setting this is considered mutually exclusive with Compute Engine-based options such as gce_cluster_config, master_config, worker_config, secondary_worker_config, and autoscaling_config.

initializationActions

val initializationActions: List<WorkflowTemplatePlacementManagedClusterConfigInitializationAction>? = null

Commands to execute on each node after config is completed. By default, executables are run on master and all worker nodes. You can test a node's role metadata to run an executable on a master or worker node, as shown below using curl (you can also use wget): ROLE=$(curl -H Metadata-Flavor:Google http://metadata/computeMetadata/v1/instance/attributes/dataproc-role) if ; then ... master specific actions ... else ... worker specific actions ... fi

lifecycleConfig

val lifecycleConfig: WorkflowTemplatePlacementManagedClusterConfigLifecycleConfig? = null

Lifecycle setting for the cluster.

masterConfig

val masterConfig: WorkflowTemplatePlacementManagedClusterConfigMasterConfig? = null

The Compute Engine config settings for additional worker instances in a cluster.

metastoreConfig

val metastoreConfig: WorkflowTemplatePlacementManagedClusterConfigMetastoreConfig? = null

Metastore configuration.

secondaryWorkerConfig

val secondaryWorkerConfig: WorkflowTemplatePlacementManagedClusterConfigSecondaryWorkerConfig? = null

The Compute Engine config settings for additional worker instances in a cluster.

securityConfig

val securityConfig: WorkflowTemplatePlacementManagedClusterConfigSecurityConfig? = null

Security settings for the cluster.

softwareConfig

val softwareConfig: WorkflowTemplatePlacementManagedClusterConfigSoftwareConfig? = null

The config settings for software inside the cluster.

stagingBucket

val stagingBucket: String? = null

A Cloud Storage bucket used to stage job dependencies, config files, and job driver console output. If you do not specify a staging bucket, Cloud Dataproc will determine a Cloud Storage location (US, ASIA, or EU) for your cluster's staging bucket according to the Compute Engine zone where your cluster is deployed, and then create and manage this project-level, per-location bucket (see (https://cloud.google.com/dataproc/docs/concepts/configuring-clusters/staging-bucket)).

tempBucket

val tempBucket: String? = null

A Cloud Storage bucket used to store ephemeral cluster and jobs data, such as Spark and MapReduce history files. If you do not specify a temp bucket, Dataproc will determine a Cloud Storage location (US, ASIA, or EU) for your cluster's temp bucket according to the Compute Engine zone where your cluster is deployed, and then create and manage this project-level, per-location bucket. The default bucket has a TTL of 90 days, but you can use any TTL (or none) if you specify a bucket.

workerConfig

val workerConfig: WorkflowTemplatePlacementManagedClusterConfigWorkerConfig? = null

The Compute Engine config settings for additional worker instances in a cluster.