Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

Voxtral models represent cutting-edge open-source systems designed for speech understanding, available in two sizes: a larger 24 B variant aimed at production-scale use and a smaller 3 B variant suitable for local and edge applications, both of which are provided under the Apache 2.0 license. These models excel in delivering precise transcription while featuring inherent semantic comprehension, accommodating long-form contexts of up to 32 K tokens and incorporating built-in question-and-answer capabilities along with structured summarization. They automatically detect languages across a range of major tongues and enable direct function-calling to activate backend workflows through voice commands. Retaining the textual strengths of their Mistral Small 3.1 architecture, Voxtral can process audio inputs of up to 30 minutes for transcription tasks and up to 40 minutes for comprehension, consistently surpassing both open-source and proprietary competitors in benchmarks like LibriSpeech, Mozilla Common Voice, and FLEURS. Users can access Voxtral through downloads on Hugging Face, API endpoints, or by utilizing private on-premises deployments, and the model also provides options for domain-specific fine-tuning along with advanced features tailored for enterprise needs, thus enhancing its applicability across various sectors.

Description

Mistral AI has introduced Voxtral Transcribe 2, an advanced suite of speech-to-text models that provides remarkably fast, high-quality audio transcription and speaker identification, supporting a diverse range of languages. This collection features Voxtral Mini Transcribe V2, which is tailored for batch transcription and includes functionalities like word-level timestamps, context biasing, and compatibility with 13 different languages, alongside Voxtral Realtime, which is optimized for live speech recognition with adjustable latency that can drop below 200 ms for immediate use cases. Both models excel in transcription accuracy while maintaining efficiency and cost-effectiveness; Mini Transcribe V2 is noted for its exceptional performance and minimal error rates, while Realtime is made available as open-source under the Apache 2.0 license, enabling developers to implement it on edge devices or within secure environments. Furthermore, the innovative technology embedded in these models represents a significant leap forward in transcription solutions, catering to various applications across industries.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Hugging Face
LazyTyper
Mistral AI

Integrations

Hugging Face
LazyTyper
Mistral AI

Pricing Details

No price information available.
Free Trial
Free Version

Pricing Details

$14.99 per month
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Mistral AI

Founded

2023

Country

France

Website

mistral.ai/news/voxtral

Vendor Details

Company Name

Mistral AI

Founded

2023

Country

France

Website

mistral.ai/news/voxtral-transcribe-2

Product Features

Transcription

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Product Features

Alternatives

Alternatives

Scribe Reviews

Scribe

ElevenLabs
Voxtral Reviews

Voxtral

Mistral AI
Azure AI Speech Reviews

Azure AI Speech

Microsoft