Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

ElevenLabs has unveiled Scribe, a cutting-edge Automatic Speech Recognition (ASR) model that aims to provide remarkably accurate transcriptions in 99 different languages. This innovative system is tailored to effectively manage a wide range of real-world audio situations, featuring capabilities such as word-level timestamps, speaker identification, and audio-event tagging. In benchmark evaluations like FLEURS and Common Voice, Scribe has outperformed leading models, including Gemini 2.0 Flash, Whisper Large V3, and Deepgram Nova-3, achieving impressive word error rates of 98.7% for Italian and 96.7% for English. Additionally, Scribe shows a significant reduction in errors for languages that have often faced challenges, such as Serbian, Cantonese, and Malayalam, where competing models frequently report error rates above 40%. Furthermore, developers can easily incorporate Scribe into their applications via ElevenLabs' speech-to-text API, which returns structured JSON transcripts enriched with comprehensive annotations. This level of accessibility and performance is set to revolutionize the field of transcription and enhance the user experience across various applications.

Description

Utterly Voice is an innovative application that allows for highly customizable voice dictation and comprehensive computer control, enabling a truly hands-free computing experience. With this tool, users can perform a variety of tasks such as typing, editing, executing keyboard shortcuts, managing windows, scrolling through content, controlling the mouse, and even creating macros, all through voice commands. It is designed to be compatible with both Windows 10 and 11 and currently supports English, with future plans to incorporate additional languages. The application features several speech recognizers and models, including Vosk, Microsoft Azure, Deepgram, Google Cloud Speech-to-Text V1, and Whisper, giving users a broad selection to meet their needs. Users can effortlessly input individual characters, alphanumeric data, or even code while enjoying the flexibility provided by extensive customization options through text configuration files. Enhanced mouse control techniques, adjustable voice commands, and tailored speech recognition settings significantly improve the overall user experience, making Utterly Voice a powerful tool for anyone looking to optimize their computing through voice interaction. Overall, this application not only increases productivity but also aims to make technology more accessible to a wider audience.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Deepgram
ElevenLabs
Google Cloud Speech-to-Text
JSON
MacWhisper
Microsoft Azure
OpenAI Whisper

Integrations

Deepgram
ElevenLabs
Google Cloud Speech-to-Text
JSON
MacWhisper
Microsoft Azure
OpenAI Whisper

Pricing Details

$5 per month
Free Trial
Free Version

Pricing Details

Free
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

ElevenLabs

Founded

2022

Country

United Kingdom

Website

elevenlabs.io/blog/meet-scribe

Vendor Details

Company Name

Utterly Voice

Country

United States

Website

utterlyvoice.com

Product Features

Product Features

Alternatives

Dictation Pro Reviews

Dictation Pro

DeskShare