GoogleCloudDialogflowCxV3InputAudioConfigResponse

data class GoogleCloudDialogflowCxV3InputAudioConfigResponse(val audioEncoding: String, val enableWordInfo: Boolean, val model: String, val modelVariant: String, val phraseHints: List<String>, val sampleRateHertz: Int, val singleUtterance: Boolean)

Instructs the speech recognizer on how to process the audio content.

Constructors

fun GoogleCloudDialogflowCxV3InputAudioConfigResponse(audioEncoding: String, enableWordInfo: Boolean, model: String, modelVariant: String, phraseHints: List<String>, sampleRateHertz: Int, singleUtterance: Boolean)

Types

Link copied to clipboard
object Companion

Properties

Link copied to clipboard

Audio encoding of the audio content to process.

Link copied to clipboard

Optional. If true, Dialogflow returns SpeechWordInfo in StreamingRecognitionResult with information about the recognized speech words, e.g. start and end time offsets. If false or unspecified, Speech doesn't return any word-level information.

Link copied to clipboard

Optional. Which Speech model to select for the given request. Select the model best suited to your domain to get best results. If a model is not explicitly specified, then we auto-select a model based on the parameters in the InputAudioConfig. If enhanced speech model is enabled for the agent and an enhanced version of the specified model for the language does not exist, then the speech is recognized using the standard version of the specified model. Refer to Cloud Speech API documentation for more details. If you specify a model, the following models typically have the best performance: - phone_call (best for Agent Assist and telephony) - latest_short (best for Dialogflow non-telephony) - command_and_search (best for very short utterances and commands)

Link copied to clipboard

Optional. Which variant of the Speech model to use.

Link copied to clipboard

Optional. A list of strings containing words and phrases that the speech recognizer should recognize with higher likelihood. See the Cloud Speech documentation for more details.

Link copied to clipboard

Sample rate (in Hertz) of the audio content sent in the query. Refer to Cloud Speech API documentation for more details.

Link copied to clipboard

Optional. If false (default), recognition does not cease until the client closes the stream. If true, the recognizer will detect a single spoken utterance in input audio. Recognition ceases when it detects the audio's voice has stopped or paused. In this case, once a detected intent is received, the client should close the stream and start a new request with a new stream as needed. Note: This setting is relevant only for streaming methods.