Average Ratings 1 Rating

Total
ease
features
design
support

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

This is how you make podcasts. Record. Transcribe. Edit. Mix. It's as easy as typing. Descript gives you complete control over your podcast. Edit text to edit audio. Drag and drop to add music or sound effects. The Timeline Editor allows you to fine-tune your music and volume by adding fades or editing the volume. Both automatic and human-powered transcriptions with industry-leading accuracy and powerful collaboration tools. Automatic transcription is the industry leader with unmatched accuracy. Fast turnaround and only pennies per minute

Description

We have developed and are releasing an open-source neural network named Whisper, which achieves levels of accuracy and resilience in English speech recognition that are comparable to human performance. This automatic speech recognition (ASR) system is trained on an extensive dataset comprising 680,000 hours of multilingual and multitask supervised information gathered from online sources. Our research demonstrates that leveraging such a comprehensive and varied dataset significantly enhances the system's capability to handle different accents, ambient noise, and specialized terminology. Additionally, Whisper facilitates transcription across various languages and provides translation into English from those languages. We are making available both the models and the inference code to support the development of practical applications and to encourage further exploration in the field of robust speech processing. The architecture of Whisper follows a straightforward end-to-end design, utilizing an encoder-decoder Transformer framework. The process begins with dividing the input audio into 30-second segments, which are then transformed into log-Mel spectrograms before being input into the encoder. By making this technology accessible, we aim to foster innovation in speech recognition technologies.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Nekton.ai
AI Sparks Studio
AnotherWrapper
Baseten
Bolna
MacWhisper
Monster API
NoteVocal
Outline
ReByte
Shift
Shownotes
Spark NLP
Undrstnd
VESSL AI
Vocode
Whisper Notes
Wufoo
brancher.ai
eWebinar

Integrations

Nekton.ai
AI Sparks Studio
AnotherWrapper
Baseten
Bolna
MacWhisper
Monster API
NoteVocal
Outline
ReByte
Shift
Shownotes
Spark NLP
Undrstnd
VESSL AI
Vocode
Whisper Notes
Wufoo
brancher.ai
eWebinar

Pricing Details

$10 per user per month
Free Trial
Free Version

Pricing Details

No price information available.
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Descript

Founded

2017

Country

United States

Website

www.descript.com

Vendor Details

Company Name

OpenAI

Country

United States

Website

openai.com/blog/whisper/

Product Features

Audio Editing

Audio Effects
Batch Processing
Export Audio (Multiple File Types)
Record Live Audio
Record Multiple Simultaneous Tracks
Scrub, Search, Bookmark
Sound Editing Tools
Spectral Analysis / FFT
Speech Synthesis (TTS)
Swappable Patches
Virtual Instruments
Virtual Mixing
Voice Changer

Podcast

Audio Editing Tools
Audio Recording
Audio to Text Transcription
Brand Safety
Create Cover Art
Distribution Tools
Import / Export
Live Broadcasting
Market Intelligence
Monetization / Advertising Management
Podcast Web Hosting
Reporting / Analytics
Sounds Effects / Music
Subscriber Management
Supports Multiple Hosts/Guests
Video Support

Screen Recording

Annotations / Drawing
Audio Capture
Backup
Collaboration Tools
File Sharing
Multi-Screen Recording
Screen Capture
Speech-to-Text
Video Editing
YouTube Uploading

Transcription

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Video Editing

3D Video Editing
Audio Tools
Brand Overlay
Collaboration
Media Library
Social Sharing
Speed Adjustment
Split / Merge
Supports HD Resolution
Text Overlay
Video Capture
Video Stabilization

Product Features

Speech Recognition

Audio Capture
Automatic Form Fill
Automatic Transcription
Call Analysis
Concatenated Speech
Continuous Speech
Customizable Macros
Multi-Languages
Specialty Vocabularies
Speech-to-Text Analysis
Variable Frequency
Voice Recognition

Transcription

AI / Machine Learning
Annotations
Audio/Video File Upload
Automatic Transcription
Collaboration Tools
File Sharing
For Manual Transcription
Full Text Search
Multi-Language Support
Natural Language Processing (NLP)
Playback Controls
Speech Recognition
Subtitles
Text Editor
Timecoding

Alternatives

Alternatives

VEED Reviews

VEED

VEED.IO
Transcribe Reviews

Transcribe

Wreally