Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

Voxtral models represent cutting-edge open-source systems designed for speech understanding, available in two sizes: a larger 24 B variant aimed at production-scale use and a smaller 3 B variant suitable for local and edge applications, both of which are provided under the Apache 2.0 license. These models excel in delivering precise transcription while featuring inherent semantic comprehension, accommodating long-form contexts of up to 32 K tokens and incorporating built-in question-and-answer capabilities along with structured summarization. They automatically detect languages across a range of major tongues and enable direct function-calling to activate backend workflows through voice commands. Retaining the textual strengths of their Mistral Small 3.1 architecture, Voxtral can process audio inputs of up to 30 minutes for transcription tasks and up to 40 minutes for comprehension, consistently surpassing both open-source and proprietary competitors in benchmarks like LibriSpeech, Mozilla Common Voice, and FLEURS. Users can access Voxtral through downloads on Hugging Face, API endpoints, or by utilizing private on-premises deployments, and the model also provides options for domain-specific fine-tuning along with advanced features tailored for enterprise needs, thus enhancing its applicability across various sectors.

Description

Yak is an innovative voice-driven productivity tool that significantly enhances your computer interaction speed. With top-tier transcription accuracy and rapid performance, it features AI auto-editing that eliminates unnecessary filler words, incorrect starts, and self-corrections, alongside automatic formatting for numbers and symbols. It also accommodates personal dictionaries through auto-detection, offers context-sensitive styles, supports BYOK mode, and provides smart voice commands. Users can launch applications and perform tasks vocally — similar to Raycast but without the need for hands. Designed for professionals engaged in extensive typing and power users who rely on AI, Yak ensures that no data is retained on our servers, prioritizing your privacy at all times. This level of privacy assurance allows users to confidently utilize all features without concerns about data security.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

No images available

Integrations

Hugging Face
LazyTyper
Mistral AI

Integrations

Hugging Face
LazyTyper
Mistral AI

Pricing Details

No price information available.
Free Trial
Free Version

Pricing Details

$12/month/user
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Mistral AI

Founded

2023

Country

France

Website

mistral.ai/news/voxtral

Vendor Details

Company Name

Yak

Founded

2026

Website

getyak.app/

Product Features

Transcription

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Product Features

Alternatives

Alternatives

Voxtral TTS Reviews

Voxtral TTS

Mistral AI
Azure AI Speech Reviews

Azure AI Speech

Microsoft