Model

Voxtral Small

Family
Voxtral
Context window
32,000 tokens
Open weights
Yes
Release date
2025-07-15

Benchmark notes

24B parameter speech-understanding model built on Mistral Small 3.1 backbone; handles up to 30 minutes of audio for transcription and 40 minutes for understanding; reported to match ElevenLabs Scribe and outperform Whisper large-v3 at lower cost.