An API powered by Google's AI technology allows you to accurately convert speech into text. You can accurately caption your content, provide a better user experience with products using voice commands, and gain insight from customer interactions to improve your service. Google's deep learning neural network algorithms are the most advanced in automatic speech recognition (ASR). Speech-to-Text allows for experimentation, creation, management, and customization of custom resources. You can deploy speech recognition wherever you need it, whether it's in the cloud using the API or on-premises using Speech-to-Text O-Prem. You can customize speech recognition to translate domain-specific terms or rare words. Automated conversion of spoken numbers into addresses, years and currencies. Our user interface makes it easy to experiment with your speech audio.
Learn more
LM-Kit.NET is an enterprise-grade toolkit designed for seamlessly integrating generative AI into your .NET applications, fully supporting Windows, Linux, and macOS. Empower your C# and VB.NET projects with a flexible platform that simplifies the creation and orchestration of dynamic AI agents.
Leverage efficient Small Language Models for on‑device inference, reducing computational load, minimizing latency, and enhancing security by processing data locally. Experience the power of Retrieval‑Augmented Generation (RAG) to boost accuracy and relevance, while advanced AI agents simplify complex workflows and accelerate development.
Native SDKs ensure smooth integration and high performance across diverse platforms. With robust support for custom AI agent development and multi‑agent orchestration, LM‑Kit.NET streamlines prototyping, deployment, and scalability—enabling you to build smarter, faster, and more secure solutions trusted by professionals worldwide.
Learn more
Subanana
Subanana is a cutting-edge web application designed for converting audio and video content into subtitles, transcripts, and meeting summaries, supporting over 80 languages with exceptional accuracy, particularly for Asian and mixed-language speech like Cantonese, Mandarin, Japanese, and Korean, which are often inadequately addressed by English-centric tools. Users can easily import files or links from platforms like YouTube, Instagram, or Facebook to create subtitles, which can be customized with a glossary and AI-driven corrections before being exported in various formats such as SRT, VTT, TXT, DOCX, bilingual subtitles, or as burned-in video. For transcripts, the app offers features like speaker identification, the elimination of filler words, and the automatic addition of punctuation and paragraph breaks for clarity. Additionally, it provides templates for meeting summaries that capture decisions and action items, along with a unique bot that integrates with Google Meet and Microsoft Teams to analyze recordings after meetings conclude. Furthermore, Subanana offers live captioning services that provide real-time translations during events, enhancing accessibility and understanding for diverse audiences.
Learn more
Rev
Rev offers premium on-demand, manual, and automated transcription, closed captioning, and foreign subtitling services. Rev has 170,000+ clients, ranging from freelance journalists to global corporations. Rev processes more audio/video than any other provider, and can scale to meet any customer's requirements. Pricing is straightforward, starting at $0.25 per audio/video min for automated speech-to text services and $1.25/min manual with 99% accuracy. Rev.ai is a speech recognition engine available to companies who request it.
Learn more