Google Cloud Speech-to-Text
An API powered by Google's AI technology allows you to accurately convert speech into text. You can accurately caption your content, provide a better user experience with products using voice commands, and gain insight from customer interactions to improve your service. Google's deep learning neural network algorithms are the most advanced in automatic speech recognition (ASR). Speech-to-Text allows for experimentation, creation, management, and customization of custom resources. You can deploy speech recognition wherever you need it, whether it's in the cloud using the API or on-premises using Speech-to-Text O-Prem. You can customize speech recognition to translate domain-specific terms or rare words. Automated conversion of spoken numbers into addresses, years and currencies. Our user interface makes it easy to experiment with your speech audio.
Learn more
Otter.ai
Otter is where conversations are. With Otter, your AI-powered assistant, you can create rich notes for interviews, meetings, lectures, and other important voice conversation. The Otter advantage is a benefit for organizations. Otter is trusted by all sizes of teams to transcribe important conversations. Otter 2.0, our shiny new release, offers more functionality to enhance collaboration and productivity. The Teams plan is designed for small and medium-sized businesses as well as teams in larger companies. You can record and review your conversations in real-time. You can search, play, edit, organize and share your conversations on any device. Otter allows you to record conversations on your smartphone or web browser. You can import or sync recordings from other services. Zoom can be integrated. Real-time streaming transcripts are available. Within minutes, rich, searchable notes can be created with text, audio, images and speaker ID. To inform others and stay on the same page, you can share or export voice notes.
Learn more
Rubidium
Rubidium empowers top companies to integrate voice commands and text-to-speech capabilities within their offerings. The Voice Trigger feature operates as a constant listening engine that activates upon hearing a specific "magic word." This identification process utilizes an advanced, compact Automatic Speech Recognition (ASR) engine that functions quietly in the background, differentiating the trigger phrase from other sounds and speech. With ASR technology, users can effortlessly and securely manage a variety of functions via voice commands, including accepting or rejecting calls, setting up devices, and controlling music playback and selection. Currently, Rubidium's innovations are present in over 50 million consumer products, partnering with renowned global brands like RIM (Blackberry), GN Netcom (Jabra), Panasonic, Uniden, CSR, Mattel, General Motors, Electrolux, and numerous others. As a result, these partnerships have significantly expanded the reach and usability of voice-activated technology across diverse industries.
Learn more
Twilio Voice
Create a scalable voice experience with the API that connects millions globally. With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Customize your experience the way you want by using a wide range of customization resources, such as our Voice SDK, speech recognition, Interactive Voice Response (IVR), and recording transcriptions.
Whether you're looking to set up global conferencing or alerts & notifications, Twilio has the support you need for building with Voice, such as our Twilio Runtime and Studio developer tools. Find docs, code samples, and helper libraries to start building today.
Learn more