Package-level declarations

Types

enum DocumentClassifierAugmentedManifestsListItemSplit : Enum<DocumentClassifierAugmentedManifestsListItemSplit> , ConvertibleToJava<DocumentClassifierAugmentedManifestsListItemSplit>

The purpose of the data you've provided in the augmented manifest. You can either train or test this data. If you don't specify, the default is train. TRAIN - all of the documents in the manifest will be used for training. If no test documents are provided, Amazon Comprehend will automatically reserve a portion of the training documents for testing. TEST - all of the documents in the manifest will be used for testing.

enum DocumentClassifierDocumentReaderConfigDocumentReadAction : Enum<DocumentClassifierDocumentReaderConfigDocumentReadAction> , ConvertibleToJava<DocumentClassifierDocumentReaderConfigDocumentReadAction>

This field defines the Amazon Textract API operation that Amazon Comprehend uses to extract text from PDF files and image files. Enter one of the following values:

enum DocumentClassifierDocumentReaderConfigDocumentReadMode : Enum<DocumentClassifierDocumentReaderConfigDocumentReadMode> , ConvertibleToJava<DocumentClassifierDocumentReaderConfigDocumentReadMode>

Determines the text extraction actions for PDF files. Enter one of the following values:

enum DocumentClassifierDocumentReaderConfigFeatureTypesItem : Enum<DocumentClassifierDocumentReaderConfigFeatureTypesItem> , ConvertibleToJava<DocumentClassifierDocumentReaderConfigFeatureTypesItem>
enum DocumentClassifierInputDataConfigDataFormat : Enum<DocumentClassifierInputDataConfigDataFormat> , ConvertibleToJava<DocumentClassifierInputDataConfigDataFormat>

The format of your training data:

enum DocumentClassifierInputDataConfigDocumentType : Enum<DocumentClassifierInputDataConfigDocumentType> , ConvertibleToJava<DocumentClassifierInputDataConfigDocumentType>

The type of input documents for training the model. Provide plain-text documents to create a plain-text model, and provide semi-structured documents to create a native document model.

Link copied to clipboard
enum DocumentClassifierLanguageCode : Enum<DocumentClassifierLanguageCode> , ConvertibleToJava<DocumentClassifierLanguageCode>

The language of the input documents. You can specify any of the languages supported by Amazon Comprehend. All documents must be in the same language.

Link copied to clipboard
enum DocumentClassifierMode : Enum<DocumentClassifierMode> , ConvertibleToJava<DocumentClassifierMode>

Indicates the mode in which the classifier will be trained. The classifier can be trained in multi-class (single-label) mode or multi-label mode. Multi-class mode identifies a single class label for each document and multi-label mode identifies one or more class labels for each document. Multiple labels for an individual document are separated by a delimiter. The default delimiter between labels is a pipe (|).

Link copied to clipboard
enum FlywheelDocumentClassificationConfigMode : Enum<FlywheelDocumentClassificationConfigMode> , ConvertibleToJava<FlywheelDocumentClassificationConfigMode>

Classification mode indicates whether the documents are MULTI_CLASS or MULTI_LABEL .

Link copied to clipboard
enum FlywheelModelType : Enum<FlywheelModelType> , ConvertibleToJava<FlywheelModelType>

Model type of the flywheel's model.

Link copied to clipboard
enum FlywheelTaskConfigLanguageCode : Enum<FlywheelTaskConfigLanguageCode> , ConvertibleToJava<FlywheelTaskConfigLanguageCode>

Language code for the language that the model supports.