Whisper Description

We have developed and are releasing an open-source neural network named Whisper, which achieves levels of accuracy and resilience in English speech recognition that are comparable to human performance. This automatic speech recognition (ASR) system is trained on an extensive dataset comprising 680,000 hours of multilingual and multitask supervised information gathered from online sources. Our research demonstrates that leveraging such a comprehensive and varied dataset significantly enhances the system's capability to handle different accents, ambient noise, and specialized terminology. Additionally, Whisper facilitates transcription across various languages and provides translation into English from those languages. We are making available both the models and the inference code to support the development of practical applications and to encourage further exploration in the field of robust speech processing. The architecture of Whisper follows a straightforward end-to-end design, utilizing an encoder-decoder Transformer framework. The process begins with dividing the input audio into 30-second segments, which are then transformed into log-Mel spectrograms before being input into the encoder. By making this technology accessible, we aim to foster innovation in speech recognition technologies.

Integrations

API:
Yes, Whisper has an API

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
OpenAI
Headquarters:
United States
Website:
openai.com/blog/whisper/

Media

Whisper Screenshot 1
Recommended Products
Passwordless Authentication and Passwordless Security Icon
Passwordless Authentication and Passwordless Security

Identity is everything. Protect it with Duo.

It’s no secret — passwords can be a real headache, both for the people who use them and the people who manage them. Over time, we’ve created hundreds of passwords, it’s easy to lose track of them and they’re easily compromised. Fortunately, passwordless authentication is becoming a feasible reality for many businesses. Duo can help you get there.
Get a Free Trial

Product Details

Platforms
Web-Based
Types of Training
Training Docs
Webinars
Training Videos
Customer Support
Online Support

Whisper Features and Options

Speech Recognition Software

Audio Capture
Automatic Form Fill
Automatic Transcription
Call Analysis
Concatenated Speech
Continuous Speech
Customizable Macros
Multi-Languages
Specialty Vocabularies
Speech-to-Text Analysis
Variable Frequency
Voice Recognition

Transcription Software

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Whisper Lists

Whisper User Reviews

Write a Review
  • Previous
  • Next