Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
                    Rev.ai was created by top experts in speech recognition, leveraging millions of hours of precisely transcribed human content. Our journey began in 2011 with the inception of Rev.com, where we offered human transcription services. Now, we proudly stand as the largest transcription provider globally, employing over 35,000 contractors who collectively transcribe millions of audio minutes every month. In 2017, we expanded our offerings with the launch of Temi, an automated service for speech-to-text transcription and editing. Temi has successfully transcribed 20 million minutes of content and has been recognized as the best transcription service by Wirecutter. Today, our advanced speech engine, Rev.ai, is accessible to all, enabling businesses to maximize the usability of their audio and video content by enhancing searchability and accessibility. Through our innovative solutions, we continue to revolutionize how audio and video materials are managed and utilized.
                
            
        
            Description
                    We have developed and are releasing an open-source neural network named Whisper, which achieves levels of accuracy and resilience in English speech recognition that are comparable to human performance. This automatic speech recognition (ASR) system is trained on an extensive dataset comprising 680,000 hours of multilingual and multitask supervised information gathered from online sources. Our research demonstrates that leveraging such a comprehensive and varied dataset significantly enhances the system's capability to handle different accents, ambient noise, and specialized terminology. Additionally, Whisper facilitates transcription across various languages and provides translation into English from those languages. We are making available both the models and the inference code to support the development of practical applications and to encourage further exploration in the field of robust speech processing. The architecture of Whisper follows a straightforward end-to-end design, utilizing an encoder-decoder Transformer framework. The process begins with dividing the input audio into 30-second segments, which are then transformed into log-Mel spectrograms before being input into the encoder. By making this technology accessible, we aim to foster innovation in speech recognition technologies.
                
            
        
            API Access
            
                Has API
            
            
        
        
    
                API Access
            
                Has API
            
            
        
        
    
                Integrations
            
                
    AnotherWrapper
            
            
        
        
    
        
        
            
                
    Azure AI Speech
            
            
        
        
    
        
        
            
                
    Baseten
            
            
        
        
    
        
        
            
                
    Hyprnote
            
            
        
        
    
        
        
            
                
    Krater.ai
            
            
        
        
    
        
        
            
                
    LastMile AI
            
            
        
        
    
        
        
            
                
    MacWhisper
            
            
        
        
    
        
        
            
                
    Nekton.ai
            
            
        
        
    
        
        
            
                
    NoteVocal
            
            
        
        
    
        
        
            
                
    OpenAI
            
            
        
        
    
                
                    
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
                
            Integrations
            
                
    AnotherWrapper
            
            
        
        
    
        
        
            
                
    Azure AI Speech
            
            
        
        
    
        
        
            
                
    Baseten
            
            
        
        
    
        
        
            
                
    Hyprnote
            
            
        
        
    
        
        
            
                
    Krater.ai
            
            
        
        
    
        
        
            
                
    LastMile AI
            
            
        
        
    
        
        
            
                
    MacWhisper
            
            
        
        
    
        
        
            
                
    Nekton.ai
            
            
        
        
    
        
        
            
                
    NoteVocal
            
            
        
        
    
        
        
            
                
    OpenAI
            
            
        
        
    
                
                    
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
        
        
        
    
                
            Pricing Details
        No price information available.
        
        
    
    
    
        
        
            
                Free Trial
            
            
        
        
    
        
        
            
                Free Version
            
            
        
        
    
            Pricing Details
        No price information available.
        
        
    
    
    
        
        
            
                Free Trial
            
            
        
        
    
        
        
            
                Free Version
            
            
        
        
    
            Deployment
            
                Web-Based
            
            
        
        
    
        
        
            
                On-Premises
            
            
        
        
    
        
        
            
                iPhone App
            
            
        
        
    
        
        
            
                iPad App
            
            
        
        
    
        
        
            
                Android App
            
            
        
        
    
        
        
            
                Windows
            
            
        
        
    
        
        
            
                Mac
            
            
        
        
    
        
        
            
                Linux
            
            
        
        
    
        
        
            
                Chromebook
            
            
        
        
    
                Deployment
            
                Web-Based
            
            
        
        
    
        
        
            
                On-Premises
            
            
        
        
    
        
        
            
                iPhone App
            
            
        
        
    
        
        
            
                iPad App
            
            
        
        
    
        
        
            
                Android App
            
            
        
        
    
        
        
            
                Windows
            
            
        
        
    
        
        
            
                Mac
            
            
        
        
    
        
        
            
                Linux
            
            
        
        
    
        
        
            
                Chromebook
            
            
        
        
    
                Customer Support
            
                Business Hours
            
            
        
        
    
        
        
            
                Live Rep (24/7)
            
            
        
        
    
        
        
            
                Online Support
            
            
        
        
    
                Customer Support
            
                Business Hours
            
            
        
        
    
        
        
            
                Live Rep (24/7)
            
            
        
        
    
        
        
            
                Online Support
            
            
        
        
    
                Types of Training
            
                Training Docs
            
            
        
        
    
        
        
            
                Webinars
            
            
        
        
    
        
        
            
                Live Training (Online)
            
            
        
        
    
        
        
            
                In Person
            
            
        
        
    
                Types of Training
            
                Training Docs
            
            
        
        
    
        
        
            
                Webinars
            
            
        
        
    
        
        
            
                Live Training (Online)
            
            
        
        
    
        
        
            
                In Person
            
            
        
        
    
                Vendor Details
Company Name
Rev.ai
Website
www.rev.ai/
Vendor Details
Company Name
OpenAI
Country
United States
Website
openai.com/blog/whisper/
Product Features
Speech Recognition
                                        Audio Capture
                                        
                                    
                                    
                                    
                                        Automatic Form Fill
                                        
                                    
                                    
                                    
                                        Automatic Transcription
                                        
                                    
                                    
                                    
                                        Call Analysis
                                        
                                    
                                    
                                    
                                        Concatenated Speech
                                        
                                    
                                    
                                    
                                        Continuous Speech
                                        
                                    
                                    
                                    
                                        Customizable Macros
                                        
                                    
                                    
                                    
                                        Multi-Languages
                                        
                                    
                                    
                                    
                                        Specialty Vocabularies
                                        
                                    
                                    
                                    
                                        Speech-to-Text Analysis
                                        
                                    
                                    
                                    
                                        Variable Frequency
                                        
                                    
                                    
                                    
                                        Voice Recognition
                                        
                                    
                            
                        Product Features
Speech Recognition
                                        Audio Capture
                                        
                                    
                                    
                                    
                                        Automatic Form Fill
                                        
                                    
                                    
                                    
                                        Automatic Transcription
                                        
                                    
                                    
                                    
                                        Call Analysis
                                        
                                    
                                    
                                    
                                        Concatenated Speech
                                        
                                    
                                    
                                    
                                        Continuous Speech
                                        
                                    
                                    
                                    
                                        Customizable Macros
                                        
                                    
                                    
                                    
                                        Multi-Languages
                                        
                                    
                                    
                                    
                                        Specialty Vocabularies
                                        
                                    
                                    
                                    
                                        Speech-to-Text Analysis
                                        
                                    
                                    
                                    
                                        Variable Frequency
                                        
                                    
                                    
                                    
                                        Voice Recognition
                                        
                                    
                            
                        Transcription
                                        AI / Machine Learning
                                        
                                    
                                    
                                    
                                        Annotations
                                        
                                    
                                    
                                    
                                        Audio/Video File Upload
                                        
                                    
                                    
                                    
                                        Automatic Transcription
                                        
                                    
                                    
                                    
                                        Collaboration Tools
                                        
                                    
                                    
                                    
                                        File Sharing
                                        
                                    
                                    
                                    
                                        For Manual Transcription
                                        
                                    
                                    
                                    
                                        Full Text Search
                                        
                                    
                                    
                                    
                                        Multi-Language Support
                                        
                                    
                                    
                                    
                                        Natural Language Processing (NLP)
                                        
                                    
                                    
                                    
                                        Playback Controls
                                        
                                    
                                    
                                    
                                        Speech Recognition
                                        
                                    
                                    
                                    
                                        Subtitles
                                        
                                    
                                    
                                    
                                        Text Editor
                                        
                                    
                                    
                                    
                                        Timecoding
                                        
                                    
                            
                         
        