Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Piper is a rapidly operating, localized neural text-to-speech (TTS) system that is particularly optimized for devices like the Raspberry Pi 4, aiming to provide top-notch speech synthesis capabilities without the dependence on cloud infrastructure. It employs neural network models developed with VITS and subsequently exported to ONNX Runtime, which facilitates both efficient and natural-sounding speech production. Supporting a diverse array of languages, Piper includes English (both US and UK dialects), Spanish (from Spain and Mexico), French, German, and many others, with downloadable voice options available. Users have the flexibility to operate Piper through command-line interfaces or integrate it seamlessly into Python applications via the piper-tts package. The system boasts features such as real-time audio streaming, JSON input for batch processing, and compatibility with multi-speaker models, enhancing its versatility. Additionally, Piper makes use of espeak-ng for phoneme generation, transforming text into phonemes before generating speech. It has found applications in various projects, including Home Assistant, Rhasspy 3, and NVDA, among others, illustrating its adaptability across different platforms and use cases. With its emphasis on local processing, Piper appeals to users looking for privacy and efficiency in their speech synthesis solutions.
Description
Vois is an innovative desktop AI voice studio designed for users to produce high-quality speech in 23 languages with a selection of over 63 lifelike voices, all seamlessly integrated into one application. This platform streamlines the entire process by merging scripting, voice generation, editing, arrangement, mastering, and exporting, thus removing the necessity for various tools or online services. Users can either write scripts or import them, assign distinct voices to different speakers, and generate dialogues featuring multiple speakers. They can also arrange audio clips on a multi-track timeline, utilizing features such as crossfades and timing adjustments to enhance their projects. The application comes equipped with advanced mastering tools, including LUFS normalization, de-essing, EQ, and limiting, while also providing export presets tailored for popular platforms like Spotify, YouTube, and audiobook distribution. Furthermore, it offers the capability of voice cloning from brief audio samples, empowering users to craft unique voices that can be utilized in various languages, ultimately expanding their creative possibilities. This comprehensive toolset makes Vois a valuable asset for anyone looking to elevate their audio production experience.
API Access
Has API
API Access
Has API
Pricing Details
Free
Free Trial
Free Version
Pricing Details
$29 per month
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Rhasspy
Country
United States
Website
github.com/rhasspy/piper
Vendor Details
Company Name
Vois
Country
United States
Website
vois.so/
Product Features
Text to Speech
API
Adjust Speaking Rate / Pitch
Audio Optimization
Custom Lexicons
Different Voice Choices
Multi-Language Support
Synchronize Speech
Product Features
Podcast
Audio Editing Tools
Audio Recording
Audio to Text Transcription
Brand Safety
Create Cover Art
Distribution Tools
Import / Export
Live Broadcasting
Market Intelligence
Monetization / Advertising Management
Podcast Web Hosting
Reporting / Analytics
Sounds Effects / Music
Subscriber Management
Supports Multiple Hosts/Guests
Video Support