<ul><li>ElevenLabs has launched Scribe v1, a speech-to-text model with the highest accuracy rate so far, achieving 96.7% accuracy for English.</li><li>Scribe outperforms Google's Gemini 2.0 Flash, OpenAI's Whisper v3, and Deepgram Nova-3 in accurately converting spoken speech into text.</li><li>The model delivers state-of-the-art transcription accuracy in 99 languages and can distinguish and isolate up to 32 different speakers in the same audio file.</li><li>Scribe is available now through the ElevenLabs website and API, with pricing set at $0.40 per hour of input audio.</li></ul>

ElevenLabs’ new speech-to-text model Scribe is here with highest accuracy rate so far (96.7% for English)

Discover more