Performs all operations included in the Basic mode, additionally performing language detection and speaker diarization.