Mistral Voxtral
Voxtral is a family of open-weight audio AI models developed by Paris-based Mistral AI, encompassing speech-to-text (ASR = Automatic Speech Recognition), speech understanding, and text-to-speech (TTS) capabilities. First released in July 2025 as Mistral's debut audio model line, the family has expanded to include Voxtral Transcribe 2 (February 2026) and Voxtral TTS (March 2026), positioned as lower-cost open alternatives to closed providers such as OpenAI Whisper and ElevenLabs.
Voxtral originally launched July 15, 2025 with Voxtral Small (24B) and Voxtral Mini (3B) under Apache 2.0. Voxtral Transcribe 2 (Mini Transcribe V2 + Realtime) launched February 4, 2026, expanding language support to 13 and adding speaker diarization, context biasing and sub-200ms streaming latency. Voxtral TTS launched March 26, 2026 as a 4B-parameter multilingual text-to-speech model under CC BY-NC 4.0. Parent Mistral AI raised a €1.7B Series C in September 2025 led by ASML (€1.3B for ~11% stake) at a €11.7B/$13.8B valuation, and added $830M in debt financing in March 2026. CMA CGM is a notable customer using Voxtral for media workflows.
No people linked.
- 01reported
- 02reported
- 03reported
- 04reported
- 05reported
- 06reported
- 07reported
- 08reported
- 09reported
- 10reported
- 11reported
- 12reported
- 13reported
- 14reported
- 15reported
- 16reported
- 17reported
- 18reported
- 19reported
- 20reported
- 21reported
- 22reported
- 23reported
- 24reported
- 25reported
- 26reported
- 27reported
- 28reported
- 29reported