pulumi-google-native-kotlin/com.pulumi.googlenative.bigquery.v2.kotlin.inputs/JobConfigurationLoadArgs

JobConfigurationLoadArgs

data class JobConfigurationLoadArgs(val allowJaggedRows: Output<Boolean>? = null, val allowQuotedNewlines: Output<Boolean>? = null, val autodetect: Output<Boolean>? = null, val clustering: Output<ClusteringArgs>? = null, val connectionProperties: Output<List<ConnectionPropertyArgs>>? = null, val createDisposition: Output<String>? = null, val createSession: Output<Boolean>? = null, val decimalTargetTypes: Output<List<String>>? = null, val destinationEncryptionConfiguration: Output<EncryptionConfigurationArgs>? = null, val destinationTable: Output<TableReferenceArgs>? = null, val destinationTableProperties: Output<DestinationTablePropertiesArgs>? = null, val encoding: Output<String>? = null, val fieldDelimiter: Output<String>? = null, val hivePartitioningOptions: Output<HivePartitioningOptionsArgs>? = null, val ignoreUnknownValues: Output<Boolean>? = null, val jsonExtension: Output<String>? = null, val maxBadRecords: Output<Int>? = null, val nullMarker: Output<String>? = null, val parquetOptions: Output<ParquetOptionsArgs>? = null, val preserveAsciiControlCharacters: Output<Boolean>? = null, val projectionFields: Output<List<String>>? = null, val quote: Output<String>? = null, val rangePartitioning: Output<RangePartitioningArgs>? = null, val referenceFileSchemaUri: Output<String>? = null, val schema: Output<TableSchemaArgs>? = null, val schemaInline: Output<String>? = null, val schemaInlineFormat: Output<String>? = null, val schemaUpdateOptions: Output<List<String>>? = null, val skipLeadingRows: Output<Int>? = null, val sourceFormat: Output<String>? = null, val sourceUris: Output<List<String>>? = null, val timePartitioning: Output<TimePartitioningArgs>? = null, val useAvroLogicalTypes: Output<Boolean>? = null, val writeDisposition: Output<String>? = null) : ConvertibleToJava<JobConfigurationLoadArgs>

Constructors

constructor(allowJaggedRows: Output<Boolean>? = null, allowQuotedNewlines: Output<Boolean>? = null, autodetect: Output<Boolean>? = null, clustering: Output<ClusteringArgs>? = null, connectionProperties: Output<List<ConnectionPropertyArgs>>? = null, createDisposition: Output<String>? = null, createSession: Output<Boolean>? = null, decimalTargetTypes: Output<List<String>>? = null, destinationEncryptionConfiguration: Output<EncryptionConfigurationArgs>? = null, destinationTable: Output<TableReferenceArgs>? = null, destinationTableProperties: Output<DestinationTablePropertiesArgs>? = null, encoding: Output<String>? = null, fieldDelimiter: Output<String>? = null, hivePartitioningOptions: Output<HivePartitioningOptionsArgs>? = null, ignoreUnknownValues: Output<Boolean>? = null, jsonExtension: Output<String>? = null, maxBadRecords: Output<Int>? = null, nullMarker: Output<String>? = null, parquetOptions: Output<ParquetOptionsArgs>? = null, preserveAsciiControlCharacters: Output<Boolean>? = null, projectionFields: Output<List<String>>? = null, quote: Output<String>? = null, rangePartitioning: Output<RangePartitioningArgs>? = null, referenceFileSchemaUri: Output<String>? = null, schema: Output<TableSchemaArgs>? = null, schemaInline: Output<String>? = null, schemaInlineFormat: Output<String>? = null, schemaUpdateOptions: Output<List<String>>? = null, skipLeadingRows: Output<Int>? = null, sourceFormat: Output<String>? = null, sourceUris: Output<List<String>>? = null, timePartitioning: Output<TimePartitioningArgs>? = null, useAvroLogicalTypes: Output<Boolean>? = null, writeDisposition: Output<String>? = null)

Properties

allowJaggedRows

val allowJaggedRows: Output<Boolean>? = null

Optional Accept rows that are missing trailing optional columns. The missing values are treated as nulls. If false, records with missing trailing columns are treated as bad records, and if there are too many bad records, an invalid error is returned in the job result. The default value is false. Only applicable to CSV, ignored for other formats.

allowQuotedNewlines

val allowQuotedNewlines: Output<Boolean>? = null

Indicates if BigQuery should allow quoted data sections that contain newline characters in a CSV file. The default value is false.

autodetect

val autodetect: Output<Boolean>? = null

Optional Indicates if we should automatically infer the options and schema for CSV and JSON sources.

clustering

val clustering: Output<ClusteringArgs>? = null

Beta Clustering specification for the destination table. Must be specified with time-based partitioning, data in the table will be first partitioned and subsequently clustered.

connectionProperties

val connectionProperties: Output<List<ConnectionPropertyArgs>>? = null

Connection properties.

createDisposition

val createDisposition: Output<String>? = null

Optional Specifies whether the job is allowed to create new tables. The following values are supported: CREATE_IF_NEEDED: If the table does not exist, BigQuery creates the table. CREATE_NEVER: The table must already exist. If it does not, a 'notFound' error is returned in the job result. The default value is CREATE_IF_NEEDED. Creation, truncation and append actions occur as one atomic update upon job completion.

createSession

val createSession: Output<Boolean>? = null

If true, creates a new session, where session id will be a server generated random id. If false, runs query with an existing session_id passed in ConnectionProperty, otherwise runs the load job in non-session mode.

decimalTargetTypes

val decimalTargetTypes: Output<List<String>>? = null

Optional Defines the list of possible SQL data types to which the source decimal values are converted. This list and the precision and the scale parameters of the decimal field determine the target type. In the order of NUMERIC, BIGNUMERIC, and STRING, a type is picked if it is in the specified list and if it supports the precision and the scale. STRING supports all precision and scale values. If none of the listed types supports the precision and the scale, the type supporting the widest range in the specified list is picked, and if a value exceeds the supported range when reading the data, an error will be thrown. Example: Suppose the value of this field is "NUMERIC", "BIGNUMERIC". If (precision,scale) is: (38,9) -> NUMERIC; (39,9) -> BIGNUMERIC (NUMERIC cannot hold 30 integer digits); (38,10) -> BIGNUMERIC (NUMERIC cannot hold 10 fractional digits); (76,38) -> BIGNUMERIC; (77,38) -> BIGNUMERIC (error if value exeeds supported range). This field cannot contain duplicate types. The order of the types in this field is ignored. For example, "BIGNUMERIC", "NUMERIC" is the same as "NUMERIC", "BIGNUMERIC" and NUMERIC always takes precedence over BIGNUMERIC. Defaults to "NUMERIC", "STRING" for ORC and "NUMERIC" for the other file formats.

destinationEncryptionConfiguration

val destinationEncryptionConfiguration: Output<EncryptionConfigurationArgs>? = null

Custom encryption configuration (e.g., Cloud KMS keys).

destinationTable

val destinationTable: Output<TableReferenceArgs>? = null

Required The destination table to load the data into.

destinationTableProperties

val destinationTableProperties: Output<DestinationTablePropertiesArgs>? = null

Optional Properties with which to create the destination table if it is new.

encoding

val encoding: Output<String>? = null

Optional The character encoding of the data. The supported values are UTF-8 or ISO-8859-1. The default value is UTF-8. BigQuery decodes the data after the raw, binary data has been split using the values of the quote and fieldDelimiter properties.

fieldDelimiter

val fieldDelimiter: Output<String>? = null

Optional The separator for fields in a CSV file. The separator can be any ISO-8859-1 single-byte character. To use a character in the range 128-255, you must encode the character as UTF8. BigQuery converts the string to ISO-8859-1 encoding, and then uses the first byte of the encoded string to split the data in its raw, binary state. BigQuery also supports the escape sequence "\t" to specify a tab separator. The default value is a comma (',').

hivePartitioningOptions

val hivePartitioningOptions: Output<HivePartitioningOptionsArgs>? = null

Optional Options to configure hive partitioning support.

ignoreUnknownValues

val ignoreUnknownValues: Output<Boolean>? = null

Optional Indicates if BigQuery should allow extra values that are not represented in the table schema. If true, the extra values are ignored. If false, records with extra columns are treated as bad records, and if there are too many bad records, an invalid error is returned in the job result. The default value is false. The sourceFormat property determines what BigQuery treats as an extra value: CSV: Trailing columns JSON: Named values that don't match any column names

jsonExtension

val jsonExtension: Output<String>? = null

Optional If sourceFormat is set to newline-delimited JSON, indicates whether it should be processed as a JSON variant such as GeoJSON. For a sourceFormat other than JSON, omit this field. If the sourceFormat is newline-delimited JSON: - for newline-delimited GeoJSON: set to GEOJSON.

maxBadRecords

val maxBadRecords: Output<Int>? = null

Optional The maximum number of bad records that BigQuery can ignore when running the job. If the number of bad records exceeds this value, an invalid error is returned in the job result. This is only valid for CSV and JSON. The default value is 0, which requires that all records are valid.

nullMarker

val nullMarker: Output<String>? = null

Optional Specifies a string that represents a null value in a CSV file. For example, if you specify "\N", BigQuery interprets "\N" as a null value when loading a CSV file. The default value is the empty string. If you set this property to a custom value, BigQuery throws an error if an empty string is present for all data types except for STRING and BYTE. For STRING and BYTE columns, BigQuery interprets the empty string as an empty value.

parquetOptions

val parquetOptions: Output<ParquetOptionsArgs>? = null

Optional Options to configure parquet support.

preserveAsciiControlCharacters

val preserveAsciiControlCharacters: Output<Boolean>? = null

Optional Preserves the embedded ASCII control characters (the first 32 characters in the ASCII-table, from '\x00' to '\x1F') when loading from CSV. Only applicable to CSV, ignored for other formats.

projectionFields

val projectionFields: Output<List<String>>? = null

If sourceFormat is set to "DATASTORE_BACKUP", indicates which entity properties to load into BigQuery from a Cloud Datastore backup. Property names are case sensitive and must be top-level properties. If no properties are specified, BigQuery loads all properties. If any named property isn't found in the Cloud Datastore backup, an invalid error is returned in the job result.

quote

val quote: Output<String>? = null

Optional The value that is used to quote data sections in a CSV file. BigQuery converts the string to ISO-8859-1 encoding, and then uses the first byte of the encoded string to split the data in its raw, binary state. The default value is a double-quote ('"'). If your data does not contain quoted sections, set the property value to an empty string. If your data contains quoted newline characters, you must also set the allowQuotedNewlines property to true.

rangePartitioning

val rangePartitioning: Output<RangePartitioningArgs>? = null

TrustedTester Range partitioning specification for this table. Only one of timePartitioning and rangePartitioning should be specified.

referenceFileSchemaUri

val referenceFileSchemaUri: Output<String>? = null

User provided referencing file with the expected reader schema, Available for the format: AVRO, PARQUET, ORC.

schema

val schema: Output<TableSchemaArgs>? = null

Optional The schema for the destination table. The schema can be omitted if the destination table already exists, or if you're loading data from Google Cloud Datastore.

schemaInline

val ~~schemaInline~~: Output<String>? = null

Deprecated The inline schema. For CSV schemas, specify as "Field1:Type1,Field2:Type2*". For example, "foo:STRING, bar:INTEGER, baz:FLOAT".

schemaInlineFormat

val ~~schemaInlineFormat~~: Output<String>? = null

Deprecated The format of the schemaInline property.

schemaUpdateOptions

val schemaUpdateOptions: Output<List<String>>? = null

Allows the schema of the destination table to be updated as a side effect of the load job if a schema is autodetected or supplied in the job configuration. Schema update options are supported in two cases: when writeDisposition is WRITE_APPEND; when writeDisposition is WRITE_TRUNCATE and the destination table is a partition of a table, specified by partition decorators. For normal tables, WRITE_TRUNCATE will always overwrite the schema. One or more of the following values are specified: ALLOW_FIELD_ADDITION: allow adding a nullable field to the schema. ALLOW_FIELD_RELAXATION: allow relaxing a required field in the original schema to nullable.

skipLeadingRows

val skipLeadingRows: Output<Int>? = null

Optional The number of rows at the top of a CSV file that BigQuery will skip when loading the data. The default value is 0. This property is useful if you have header rows in the file that should be skipped.

sourceFormat

val sourceFormat: Output<String>? = null

Optional The format of the data files. For CSV files, specify "CSV". For datastore backups, specify "DATASTORE_BACKUP". For newline-delimited JSON, specify "NEWLINE_DELIMITED_JSON". For Avro, specify "AVRO". For parquet, specify "PARQUET". For orc, specify "ORC". The default value is CSV.

sourceUris

val sourceUris: Output<List<String>>? = null

Required The fully-qualified URIs that point to your data in Google Cloud. For Google Cloud Storage URIs: Each URI can contain one '' wildcard character and it must come after the 'bucket' name. Size limits related to load jobs apply to external data sources. For Google Cloud Bigtable URIs: Exactly one URI can be specified and it has be a fully specified and valid HTTPS URL for a Google Cloud Bigtable table. For Google Cloud Datastore backups: Exactly one URI can be specified. Also, the '' wildcard character is not allowed.

timePartitioning

val timePartitioning: Output<TimePartitioningArgs>? = null

Time-based partitioning specification for the destination table. Only one of timePartitioning and rangePartitioning should be specified.

useAvroLogicalTypes

val useAvroLogicalTypes: Output<Boolean>? = null

Optional If sourceFormat is set to "AVRO", indicates whether to interpret logical types as the corresponding BigQuery data type (for example, TIMESTAMP), instead of using the raw type (for example, INTEGER).

writeDisposition

val writeDisposition: Output<String>? = null

Optional Specifies the action that occurs if the destination table already exists. The following values are supported: WRITE_TRUNCATE: If the table already exists, BigQuery overwrites the table data. WRITE_APPEND: If the table already exists, BigQuery appends the data to the table. WRITE_EMPTY: If the table already exists and contains data, a 'duplicate' error is returned in the job result. The default value is WRITE_APPEND. Each action is atomic and only occurs if BigQuery is able to complete the job successfully. Creation, truncation and append actions occur as one atomic update upon job completion.

Functions

toJava

open override fun toJava(): JobConfigurationLoadArgs