Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

Subanana is a cutting-edge web application designed for converting audio and video content into subtitles, transcripts, and meeting summaries, supporting over 80 languages with exceptional accuracy, particularly for Asian and mixed-language speech like Cantonese, Mandarin, Japanese, and Korean, which are often inadequately addressed by English-centric tools. Users can easily import files or links from platforms like YouTube, Instagram, or Facebook to create subtitles, which can be customized with a glossary and AI-driven corrections before being exported in various formats such as SRT, VTT, TXT, DOCX, bilingual subtitles, or as burned-in video. For transcripts, the app offers features like speaker identification, the elimination of filler words, and the automatic addition of punctuation and paragraph breaks for clarity. Additionally, it provides templates for meeting summaries that capture decisions and action items, along with a unique bot that integrates with Google Meet and Microsoft Teams to analyze recordings after meetings conclude. Furthermore, Subanana offers live captioning services that provide real-time translations during events, enhancing accessibility and understanding for diverse audiences.

Description

Voxtral models represent cutting-edge open-source systems designed for speech understanding, available in two sizes: a larger 24 B variant aimed at production-scale use and a smaller 3 B variant suitable for local and edge applications, both of which are provided under the Apache 2.0 license. These models excel in delivering precise transcription while featuring inherent semantic comprehension, accommodating long-form contexts of up to 32 K tokens and incorporating built-in question-and-answer capabilities along with structured summarization. They automatically detect languages across a range of major tongues and enable direct function-calling to activate backend workflows through voice commands. Retaining the textual strengths of their Mistral Small 3.1 architecture, Voxtral can process audio inputs of up to 30 minutes for transcription tasks and up to 40 minutes for comprehension, consistently surpassing both open-source and proprietary competitors in benchmarks like LibriSpeech, Mozilla Common Voice, and FLEURS. Users can access Voxtral through downloads on Hugging Face, API endpoints, or by utilizing private on-premises deployments, and the model also provides options for domain-specific fine-tuning along with advanced features tailored for enterprise needs, thus enhancing its applicability across various sectors.

API Access

Has API

API Access

Has API

Screenshots View All

No images available

Screenshots View All

Integrations

Hugging Face
LazyTyper
Mistral AI
Vision Agents

Integrations

Hugging Face
LazyTyper
Mistral AI
Vision Agents

Pricing Details

$9/month
Free Trial
Free Version

Pricing Details

No price information available.
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Datax Limited

Country

Hong Kong

Website

subanana.com

Vendor Details

Company Name

Mistral AI

Founded

2023

Country

France

Website

mistral.ai/news/voxtral

Product Features

Transcription

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Product Features

Transcription

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Alternatives

Alternatives

Silkwave Voice Reviews

Silkwave Voice

Silkwave
Azure AI Speech Reviews

Azure AI Speech

Microsoft
Utterly Reviews

Utterly

Semantic Bridge LLC