Create Audio Transcription
Transcribe audio files to text using Mistral AI's Voxtral models. Use this action to convert speech in audio files to written text. Supports multiple input methods: file upload, file_id from previously uploaded files, or file_url for publicly accessible audio. The transcription supports 13 languages with automatic language detection, speaker diarization, and configurable timestamp granularities (segment or word level). Key features: - Multi-language support (English, Chinese, Hindi, Spanish, Arabic, French, Portuguese, Russian, German, Japanese, Korean, Italian, Dutch) - Speaker diarization to identify different speakers - Word and segment-level timestamps - Context biasing for domain-specific terminology - Supports common audio formats (mp3, wav, m4a, etc.) Note: Exactly one of file, file_id, or file_url must be provided. The action does not support streaming mode.