PySparkBatchResponse

data class PySparkBatchResponse(val archiveUris: List<String>, val args: List<String>, val fileUris: List<String>, val jarFileUris: List<String>, val mainPythonFileUri: String, val pythonFileUris: List<String>)

A configuration for running an Apache PySpark (https://spark.apache.org/docs/latest/api/python/getting_started/quickstart.html) batch workload.

Constructors

Link copied to clipboard
fun PySparkBatchResponse(archiveUris: List<String>, args: List<String>, fileUris: List<String>, jarFileUris: List<String>, mainPythonFileUri: String, pythonFileUris: List<String>)

Types

Link copied to clipboard
object Companion

Properties

Link copied to clipboard

Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip.

Link copied to clipboard

Optional. The arguments to pass to the driver. Do not include arguments that can be set as batch properties, such as --conf, since a collision can occur that causes an incorrect batch submission.

Link copied to clipboard

Optional. HCFS URIs of files to be placed in the working directory of each executor.

Link copied to clipboard

Optional. HCFS URIs of jar files to add to the classpath of the Spark driver and tasks.

Link copied to clipboard

The HCFS URI of the main Python file to use as the Spark driver. Must be a .py file.

Link copied to clipboard

Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip.