Best SpeechPulse Alternatives in 2025
Find the top alternatives to SpeechPulse currently available. Compare ratings, reviews, pricing, and features of SpeechPulse alternatives in 2025. Slashdot lists the best SpeechPulse alternatives on the market that offer competing products that are similar to SpeechPulse. Sort through SpeechPulse alternatives below to make the best choice for your needs
-
1
An API powered by Google's AI technology allows you to accurately convert speech into text. You can accurately caption your content, provide a better user experience with products using voice commands, and gain insight from customer interactions to improve your service. Google's deep learning neural network algorithms are the most advanced in automatic speech recognition (ASR). Speech-to-Text allows for experimentation, creation, management, and customization of custom resources. You can deploy speech recognition wherever you need it, whether it's in the cloud using the API or on-premises using Speech-to-Text O-Prem. You can customize speech recognition to translate domain-specific terms or rare words. Automated conversion of spoken numbers into addresses, years and currencies. Our user interface makes it easy to experiment with your speech audio.
-
2
Speechmatics
Speechmatics
$0 per monthSpeechmatics is the most accurate and inclusive speech-to-text API ever released. Speechmatics is the world’s leading expert in Speech Technology, combining the latest breakthroughs in AI and ML to unlock the business value in human speech. Businesses use Speechmatics worldwide to accurately understand and transcribe human-level speech into text regardless of demographic, age, gender, accent, dialect, or location in real-time and on recorded media. Combining these transcripts with the latest AI-driven speech capabilities, businesses build products that utilize summarization, topic detection, sentiment analysis, translation, and more. How is Speechmatics different? * The most accurate speech recognition on the market * 55 languages with vast accent and dialect coverage * Cloud-based or on-premises deployment options for data security * Real-time transcription with low latency and high accuracy * Real-time translation with 69 language pairs * Speech Understanding features such as Summaries, Sentiment, Topic Detection, Chapters, Audio Events * Fast and secure transcriptions for pre-recorded audio * Automatic translation and language identification * A culture of R&D in deep learning and speech recognition -
3
Rev
Rev
$1.25 per minuteRev offers premium on-demand, manual, and automated transcription, closed captioning, and foreign subtitling services. Rev has 170,000+ clients, ranging from freelance journalists to global corporations. Rev processes more audio/video than any other provider, and can scale to meet any customer's requirements. Pricing is straightforward, starting at $0.25 per audio/video min for automated speech-to text services and $1.25/min manual with 99% accuracy. Rev.ai is a speech recognition engine available to companies who request it. -
4
Twilio Voice
Twilio
$0.0085 per minCreate a scalable voice experience with the API that connects millions globally. With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Customize your experience the way you want by using a wide range of customization resources, such as our Voice SDK, speech recognition, Interactive Voice Response (IVR), and recording transcriptions. Whether you're looking to set up global conferencing or alerts & notifications, Twilio has the support you need for building with Voice, such as our Twilio Runtime and Studio developer tools. Find docs, code samples, and helper libraries to start building today. -
5
High-quality speech-to-text software that is highly accurate with an integrated advanced text editor. Translate in English and 50+ languages. Automated transcriptions, captions, and voiceovers make it easy to increase your online audience. Our video caption software can subtitle and caption your videos to make your message clearer. You can reach millions more people around the globe by automatically translating your videos into other languages. Maestra allows you to transcribe audio to text. Get started today! One study found that websites that include transcripts to videos had a 16% increase in revenue. Because search engines can crawl words more easily than videos, this allows more people to find your site online. Try Maestra as your new transcription service. You can easily edit your automatically generated transcripts. Bolded text will automatically be added to the current time.
-
6
Alibaba Cloud Intelligent Speech Interaction
Alibaba Cloud
$1.40 per hourIntelligent Speech Interaction is based on the most current technologies, including speech recognition, speech synthesizer, and natural language understanding. Intelligent Speech Interaction can be integrated into products by enterprises to allow them to listen, understand and converse with users. This provides a rich human-computer interaction experience. Intelligent Speech Interaction is available in Mandarin Chinese and Cantonese Chinese. It is also available in English, Japanese Korean, French, Indonesian, Korean, French, and Japanese. Please stay tuned for more languages. Intelligent Speech Interaction can be used in a variety of situations, including intelligent Q&A and intelligent quality inspection. It also allows for real-time subtitles for speeches and transcription of audio recordings. Intelligent Speech Interaction has been used in many industries, including finance, insurance, eCommerce, and smart home. -
7
Work by Speech
Mikołaj Magowski
FreeWork by Speech is the only application that allows you to work on a computer by speaking, without using a keyboard and mouse. Application Key Features: - Effective work on a computer using speech alone - Quiet speaking support - Application switching and opening via speech - Built-in speech commands to perform the most common actions - Advanced custom speech commands management - Macro recording - Separate dictation mode - Support for all mouse actions, quick and repeatable by speech - A customizable mousegrid that can also be moved using speech - Automatic mousegrid optimization for each used program - Very low system resources usage - Works with any microphone under Windows 10 and 11 - Available for the English language only - Updates are free -
8
Our automatic speech recognition engine can recognize many accents in English and can be localized to any language. The ASR engine is compatible with standard telephony, as well as web- and mobile applications. The Automatic Speech Recognition Engine by GoVivace can be used to recognize voice commands from electronic devices, such as smartphones, tablets, computers, and smartphones, using a microphone. This automatic speech recognition engine compares spoken input with a variety of pre-specified options and converts speech to text. The application's grammar is the entire list of pre-specified options. It powers the interface between the dialog-speaker (and the back-end processing). GoVivace's patent Automatic Speech Recognition solution requires only a very simple grammar to be processed. It can also handle very large grammars to support complex tasks.
-
9
zeemo
zeemo
$7.99 per hourUpload subtitle and video files to match text to video content. Upload raw transcript files and video without any timeline information. Transcripts will automatically be added with timestamps. You can edit it online and then download subtitle files or video with sub-titles directly. Original video language supports English and Spanish, Simplified Chinese and Traditional Chinese, Cantonese as well as Japanese, Korean, French (with subtitles), German, Italian, Vietnamese, Arabic, and Portuguese. A single line word limit is the maximum number words that can be included in a line subtitles. The system will make reasonable cuts based on the single-line word limit if a paragraph contains many words. This will ensure that the maximum number of words in a subtitle line does not exceed the limit. This improves the subtitle display and facilitates reading. -
10
Transcribe
Wreally
Transcribe saves thousands each month in transcription time for journalists and podcasters, students, and professional transcriptionists around the world. Converting audio notes, lectures and speeches, as well as podcasts, to text can increase productivity and save you time. Turn on your headphones and start speaking. It's as easy as that. Our dictation engine can convert your speech into text instantly. This is a lot faster than typing. We can speak English, Spanish, French and Hindi. -
11
SpeechText.AI
SpeechText.AI
$19 one-time paymentTranscribe audio and video to text with domain-specific speech recognition. How it works. SpeechText.AI is an artificial intelligence software that converts speech to text and allows audio transcription. Upload audio and video files. AI transcription software can transcribe speech to text in all file formats. Select domain. Select an industry domain and an audio type from predefined categories. This will improve the recognition accuracy for domain-specific words. Transcribe. Our speech transcription engine uses state of the art deep neural network models to convert audio to text with near human accuracy. Edit and Export Use interactive editing tools to search, modify, and verify audio transcriptions. Export your content in different formats. SpeechText.AI: Why SpeechText.AI A variety of features that will allow you to transcribe audio and video in just seconds. Speech recognition. Powerful speech to text technology. SpeechText.AI is fully GDPR compliant. All our physical servers are hosted in Europe (France) and we encrypt all your data sent between you and the service. SpeechText.AI is fully automated, hence your data is confidential and the process has no place for human-factor and other risks that manual transcription has. -
12
Trint
Trint
The easiest way to record, transcribe, and share your phone's audio right from your smartphone! Trint's mobile application lets you capture the important moments, wherever and whenever you want. Wired: "Amazing!" Google - "Rocket-fueling Innovation!" We know that work doesn't always take place in an office. So we created the mobile app to allow you to access Trint's AI transcription wherever you are. You can record live interviews and import files directly from your phone without any complicated equipment. All you need is the app! Record live conversations. Trint can import audio files from other apps. You can share transcripts and assign editing permissions in-app. Trint transcripts can be easily followed by an intuitive player. All files are saved to your device and to the cloud, so you don't have to worry about losing any. Download audio to your device. While you record, drop markers from your Apple Watch. You can capture in 28 languages right from your iPhone, including English, Spanish and Chinese Mandarin, Hindi, and many more. -
13
Txtplay
Txtplay
€0.25 per minTxtplay makes your audio and video accessible to everyone. It also extracts hidden power from your media: searchable metadata. This makes compliance, SEO, and archiving much easier. Upload your media and choose your language. Our speech recognition engine will do the rest and notify you when it's finished. While our AI does the work, you can continue to work. Our online text editor connects your media to the transcript. You can update, highlight and detect speakers, search through your text and scroll in your audio and video. We support more than 20 formats, including VTT, SRT, and.docx. You can fine-tune your export with details such as Timecode, Atlas format and speakers. Developer-friendly options are also available. -
14
iSpeech Translator
iSpeech
iSpeech Translator™ allows you to speak and translate any word or phrase, including email and text in multiple languages. iSpeech®, creator of DriveSafe.ly®, an award-winning leader in texting and driving apps, brings the app's speech recognition and text to speech capabilities. You can speak or type any phrase, and the app will translate it in your language. -
15
Whisper
OpenAI
We have developed and are open-sourcing Whisper, a neural network that approximates human-level robustness in English speech recognition. Whisper is an automated speech recognition (ASR), system that was trained using 680,000 hours of multilingual, multitask supervised data from the internet. The use of such a diverse dataset results in a better resistance to accents, background noise, technical language, and other linguistic issues. It also allows transcription in multiple languages and translation from these languages into English. We provide inference code and open-sourcing models to help you build useful applications and further research on robust speech processing. The Whisper architecture is an end-to-end, simple approach that can be used as an encoder/decoder Transformer. The input audio is divided into 30-second chunks and converted into a log Mel spectrogram. This then goes into an encoder. -
16
AccuSpeechMobile
AccuSpeechMobile
AccuSpeechMobile's robust, modern speech recognition technology is optimized for mobile devices in more than 40 languages. The industry workflow-friendly noise abatement technology is able to recognize speech in noisy environments with remarkable accuracy. The speaker-independent voice engine is available for all users right out of the box. It does not require any voice training or maintaining voice files. AccuSpeechMobile works on all devices. No middleware or voice server is required. There are no changes to the backend systems (WMS ERP, ERP, EAM, and CMMS). To fully utilize the functionality of device-based data gathering, you don't need a network connection or cloud. AccuSpeechMobile fully supports multimodal capabilities so users can both hear spoken information and use intelligent scanners to communicate their commands. In conjunction with text-to speech and text-to speech commands, the ability to refer to additional information on your device screen is also available. -
17
PowerSpeak
Saince
PowerSpeak by Saince is a powerful front-end medical speech recognition software. The solution includes over 30 medical language definitions, which allows you to use this technology regardless of your specialization. It is a great solution for clinical documentation and reporting. This software is ideal for radiologists as well as physicians of all specialties. PowerSpeak Medical speech recognition software is more flexible than other solutions on the market, which limit you to using it on one device. PowerSpeak's advanced speech recognition algorithms ensure that you receive 99% accuracy in the transcribed text every single time. This means that you can spend less time correcting errors and more time working. -
18
TextGears
TextGears
$4.90TextGears provides translation, paraphrasing and text checking services for hundreds companies around the globe. Free demo available online. API allows to integrate TextGears text analysis into any modern software product. On-premise installation will be the best options for those companies that cannot use any services our of the corporate network. Supported languages include: English, French, German, Portuguese, Russian, Italian, Arabic, Spanish, Japanese, Chinese and Greek. -
19
Dragon Professional
Nuance Communications
$699 one-time payment 1 RatingDragon Professional is an advanced speech recognition software designed to help professionals create accurate documentation quickly by converting speech to text with up to 99% accuracy. Optimized for Windows 11 and compatible with Windows 10, it caters to various industries such as finance, education, and healthcare. Users can dictate documents up to three times faster than typing, transcribe recorded audio, and customize workflows with personalized commands and vocabulary. The software also includes access to Dragon Anywhere Mobile, a cloud-based dictation tool for iOS and Android, enabling productivity from any location. -
20
EaseText Text to Speech Converter
EaseText Software
$3.95/month EaseText Text to Speech is a cutting-edge offline TTS program that seamlessly transforms text into natural and lifelike voice. EaseText Text to Speech converter is the best choice for anyone who wants to create content, teach, or simply want to get top-notch speech synthesis. Key Features 1 Offline Functionality Work seamlessly without internet connection. Access lifelike speech synthesis wherever you are. 2 Voice Variety Choose from over 1300 voices in a vast library. 3 Language Support Support for 30 languages including English, Spanish and Dutch, Italian, Chinese Russian, Portuguese, German and more. 4 Voice Cloning Use advanced AI-powered voice copying to duplicate and use your voice. Bulk Conversion 6 Real-Time Processor Privacy Assurance 7 Affordable Pricing 9 User-Friendly Interface -
21
Braina
Brainasoft
$29 per yearBraina (Brain Artificial), is an intelligent personal assistant, voice recognition, automation, and human language interface for Windows PC. Braina is an AI software that can interact with your computer via voice commands in almost all languages. Braina allows you to convert speech into text in over 100 languages around the world. Braina's artificial intelligence allows you to control your computer with natural language commands. This makes your life much easier. Braina is not a Siri/Cortana clone, but a powerful personal productivity software. It's not a chatbot. It's designed to be super functional and assist you in completing tasks. -
22
Virtual Speech Center
Virtual Speech Center
Virtual Speech Center provides innovative speech therapy software and apps for schools, private practices and independent speech pathologists. We offer a variety of mobile apps for speech therapy that are compatible with IPad and IPhone devices. Speech pathologists can use some of our apps at no cost. Virtual Speech Center is a leader in incorporating games into speech and language therapy apps. Our apps feature puzzles, board games, as well as games with carnival and sports themes. You can purchase our apps individually or in bundles. Virtual Speech Center's TheraPlatform speech technology software includes telepractice and documentation, billing, intake forms, and e-claim submission module. It is designed for speech and language pathologists. Virtual Speech Center offers innovative speech therapy apps to schools, parents, independent speech pathologists, and private practices. -
23
Dragon Legal
Nuance Communications
$799 one-time paymentDragon Legal v16 is an advanced speech recognition software designed specifically for legal professionals, featuring a specialized language model trained on over 400 million legal terms and documents. It enables attorneys to dictate contracts, briefs, and citations with up to 99% accuracy, significantly speeding up documentation compared to traditional typing. The software allows users to create custom voice commands for automating repetitive tasks and supports transcription of recorded audio, improving overall workflow efficiency. Optimized for Windows 11 and compatible with Windows 10, Dragon Legal v16 also includes accessibility features like audio playback of dictated text and customizable macros. Additionally, it integrates with Dragon Anywhere Mobile, a cloud-based dictation tool for iOS and Android, ensuring legal professionals can stay productive from anywhere. -
24
Echo Speech-to-Text
Echo Speech-to-Text
$5Voice typing Voice typing. Real-time voice transcription. Echo - Speech to Text is a cutting-edge voice typing tool. It works on most websites. Experience the highest level of accuracy in speech recognition. Key Features - Automatic Punctuation : Enjoy automatic punctuation to create polished, professional texts. - Voice Type Directly Into Textbox: No weird overlaid or copy-pasting. - Multi-language Support: Supports 50+ languages, including English, Spanish, German, French, etc. - Custom Vocabularies : Add specialized nouns or specialized vocabulary to improve transcription accuracy. - Keyboard shortcut: Start and stop voice recognition quickly using a simple keyboard short cut. Trusted and secure We respect your privacy and do not collect, store or share any of your data. We DO NOT store any dictation texts in our database. HIPAA Compliance In practice, we comply with HIPAA. Audio recordings are not stored. Transcriptions are not stored. -
25
aiOla
aiOla
aiOla is a deep tech Conversational, Voice, and Speech AI lab with an enterprise-level ASR foundation model and TTS technology. It’s designed to help enterprises and developers adapt speech technologies to any process, whether through seamless API integration or an intuitive in-house app – We specialize in speech-to-text and text-to-speech AI that deliver unmatched accuracy (95%), in any language, accent, jargon, vertical or acoustic environment. Our patented ASR technology, backed by world-renowned researchers, empowers enterprises to capture spoken data in real-time, structure it, and turn it into actionable insights through a centralized data platform. From empowering frontline workers with hands-free workflows to enabling voice AI agents with enterprise-grade ASR and TTS, aiOla seamlessly integrates into workflows, internal apps and products. With 120+ languages, robust privacy features, and real-time processing, we’re the trusted partner for enterprises looking to drive efficiency, collect more data and make smarter decisions through AI-driven conversational technology. -
26
Azure AI Speech
Microsoft
The Speech SDK makes it easy to create voice-enabled apps quickly and confidently. The Speech SDK can accurately transcribe speech to text, create natural-sounding text/speech voices, and translate spoken audio. It can also be used to recognize speaker during conversations. Speech studio allows you to create custom models that are tailored to your app. Speech studio offers state-of the-art speech-to-text, speech-to-text, and award-winning speaker recognition. Your speech input is not recorded during processing, so your data remains yours. You can create custom voices, add words to your base vocabulary, and build your own models. Speech can be run anywhere, in the cloud and at the edge in containers. Transcribe audio in more than 92 languages. Call center transcription can help you gain customer insight, improve customer experience with voice-enabled assistants and capture key discussions in meetings. Text to speech allows you to create apps and services that can speak conversationally using more than 215 voices and 60 languages. -
27
Deepgram
Deepgram
$0You can use accurate speech recognition at scale and continuously improve model performance by labeling data, training and labeling from one console. We provide state-of the-art speech recognition and understanding at large scale. We do this by offering cutting-edge model training, data-labeling, and flexible deployment options. Our platform recognizes multiple languages and accents. It dynamically adapts to your business' needs with each training session. Enterprise-specific speech transcription software that is fast, accurate, reliable, and scalable. ASR has been reinvented with 100% deep learning, which allows companies to improve their accuracy. Stop waiting for big tech companies to improve their software. Instead, force your developers to manually increase accuracy by using keywords in every API call. You can train your speech model now and reap the benefits in weeks, instead of months or even years. -
28
Knovvu Speech Recognition
Sestek
Automate customer processes, evaluate agent performance objectively, and ensure 100% efficiency in your operations. Many consumers are using connected appliances to interact with their everyday lives in new ways. Speech is becoming a natural and intuitive interface for human-machine interaction, despite the fact that connected devices often lack screens. This technology is revolutionizing how people interact with their devices. Machines and applications can now understand spoken commands with Knovvu Speech Recognition by Sestek. These devices can listen to and interpret spoken commands, allowing users to interact with them by speaking aloud and not using keystrokes and buttons. Our automatic speech recognition software is fully functional. Many companies use technology to provide simple and intuitive self-service solutions. -
29
Rubidium
Rubidium
Rubidium allows leading companies to embed voice commands or text to speech into their products. Voice Trigger is an engine that is "always on" and listens to your voice. It wakes up when the correct "magic word" is said. Voice Trigger uses a miniature footprint Automatic Speech Recognition engine to distinguish between the trigger phrase, the rest of speech, sounds, and noise. Automated Speech Recognition is able to control any function through voice commands. You can use it to accept and reject calls, set up and install devices (pairing, calibration and interconnection), and so on. Voice dialing, music streaming control, and music selection. Today, Rubidium technology can be found in more than 50 million consumer products. Customers and partners include leading global brands like RIM (Blackberry), GN Netcom(Jabra), Panasonic and Uniden. -
30
Voicepoint Cloud
Voicepoint
High-availability Voicepoint Cloud, with a Swiss data centre, offers a cost-effective solution for anyone who needs to prepare a lot. This cloud solution is sophisticated and high-performance. You can use integrated speech recognition from Dragon Legal Anywhere, Dragon Professional Anywhere, or Dragon Medical Direct. The result will be displayed as text in the target application. The Voicepoint Cloud also offers access to Winscribe, a dictation management tool that covers all aspects of speech-based documentation. The cloud-based Voicepoint speech recognition solution and dictation system supports documentation from anywhere, whether you're at work, at the clinic, at home, or out. -
31
VoxCommando
VoxCommando
VoxCommando allows you to control your multimedia Home Theatre PC (HTPC) using speech recognition and command utilities. VoxCommando is available locally without any privacy issues. Voice control can be added to your home automation. It can be used as an aid tool to speed up daily tasks, reduce your dependence on the keyboard and mouse, or simply for fun! VoxCommando is unique in that it can be customized to any speech recognition application. It can be used with a variety of home automation and multimedia programs, including favorites like MediaMonkey and Kodi. Because it knows what media is in your library, it can accurately recognize speech. -
32
CADopia
CADopia
CADopia is a powerful Computer-Aided-Design software for engineers, architects, designers and drafters -- virtually anyone who creates, edits, or views professional drawings. CADopia 19 can be downloaded in 12 languages: Chinese, Czech, English and German. CADopia Professional Services can help maximize the return on your investment in CAD technology. CADopia offers upfront consulting, custom application development, training for staff, technical support, and project outsourcing. Productivity-enhancing drafting tools like custom construction plane, entity snaps grids, entity, and polar tracking allow you to complete your drawings accurately and efficiently. -
33
Vocola 3
Vocola 3
Dictation with Windows Speech Recognition works well with "WSR-friendly applications" such as MS Word, Outlook, or PowerPoint. Dictated text can be inserted directly into text. Commands like "Delete hedgehog", for example, can refer to specific text. WSR dictation is less effective for "WSR-unfriendly applications" such as MS Excel, Gmail and most other programming environments. Dictation cannot be inserted directly into text and commands can't refer to text. Vocola makes this easier by allowing direct dictation for WSR unfriendly applications and by allowing modification and correction of the just-dictated phrase. Vocola and WSR share the same speech profile so any improvements made via training, correction or the speech dictionary will benefit both WSR dictation as well as Vocola dictation. Vista makes it difficult to dictate to WSR-unfriendly programs. Every word raised the correction panel. -
34
SpeechMotion
vChart
Document a patient's encounter using voice recognition or dictation. You can also document on the go with a solution that is tailored to your environment. To solve common documentation problems, such as lowering costs and integrating your workflows, you must choose a solution that meets your evolving needs. With a partner who is committed to your success, you can improve workflow efficiency and physician adoption. This will result in a rapid return on your investment. SpeechMotion, a leading national provider of US transcription, speech recognition and voice capture technologies, partners with healthcare facilities to develop a customized solution that supports both short- and long-term goals. SpeechMotion offers healthcare facilities the flexibility they need to document a patient's story quickly and efficiently, all under a single product and service umbrella. -
35
AppTek
AppTek
AppTek is a global leader for artificial intelligence (AI), machine learning (ML), and natural language understanding. AppTek's platform provides industry-leading streaming and batch technology solutions in cloud or on-premise to organizations in a wide range of markets, including media and entertainment, government, enterprise, and call centers. AppTek's solutions are developed by top scientists and researchers who are among the most respected in the world. They can be used in a variety of languages, dialects, channels, and languages. AppTek uses deep neural networks to understand and transcribe speech and text data, providing more precise and efficient tools. -
36
Happy Scribe
Happy Scribe
$9 per month 1 RatingHigh-tech A.I. Working side-by-side with the best language professionals. Our interactive editors are designed for subtitlers and transcribers. They will make it easier to interact with your subtitles and transcripts. Interactive editors offer endless possibilities. You can collaborate with all your stakeholders by sharing transcripts and subtitles in edit or view-only mode. Export in any format you can imagine. Our platform will prepare files for you that are ready to be uploaded to any platform. Upload files of any length and size. All formats are supported by our software. Translate your transcriptions and subtitles automatically in the most popular languages. Import public links and synchronize happy Scribe with your current workflow. You can create spaces to share files with your team. Integrate seamlessly with your favorite apps: Youtube, Zapier, and many more. All files are private and protected. Your subtitles will be protected. -
37
Scribe
Scribe Technology Solutions
$59.95/month/ user ScribeNow is now available! ScribeMobile's flagship product, ScribeMobile Speech Recognition, is now available in your palm. This is the future of medical documentation. ScribeNow! ScribeMobile's already strong set of documentation services is enhanced by ScribeNow! ScribeNow! ScribeNow allows providers to quickly and easily record encounters using speech recognition. Providers have the flexibility they need to improve productivity, profitability, patient care, and patient care. This easy-to-use solution has a wide variety of integration capabilities. Scribe TeleCare, an innovative solution that allows healthcare providers to continue servicing their clients and have completed documentation to support their care of their patients. It also facilitates reimbursement using one easy-to-use tool. You don't have to use an app not designed for healthcare to connect to your patients remotely. -
38
Augnito combines Speech Recognition AI power with mobility. With best-in-class accuracy, Augnito allows you to edit, format, or complete reports at the speed and ease of human speech. You can now access your personal templates and short forms from any computer, whether you're at work, at home, or on the road. This program is best suited for those who need to create detailed reports, such as radiology, histopathology, and surgical notes. You can also dictate your reports from anywhere around the world. Augnito can recognize different accents and pronunciations without any profile training. Augnito is built with the most advanced deep learning technology and has the entire language for medicine that covers 50+ sub-specialties and all the popular generic and drug names.
-
39
Voice Finger
Voice Finger
$9.99 one-time paymentIt allows zero computer contact and does not require keyboards or mices. You can rest your hands on the computer and speak to it. This is the best solution for people with disabilities or computer injuries. Some speech recognition software assumes that you can type and click to complete certain tasks. Voice Finger was created to perform all tasks by speaking. For gamers who are serious about gaming. Voice Finger is a third-hand that allows competitive gamers to hit keys and buttons, while the gamer moves and shoots. Voice Finger allows you to control the keyboard with short commands. Windows default speech recognition can recognize a lot of long commands, such as "Press 1", “Press A” and "Press down thirty times". Voice Finger reduces all commands to a minimum length like "Press 1", "Press A", and "Press down 30". You can still use the mouse buttons with commands such as "click left", or "click right", while simultaneously holding keys like Control, Shift, and Alt. -
40
Azure Speaker Recognition
Microsoft
A Speech service feature that verifies and identifies speakers. Facilitate frictionless, secure customer experiences Streamlining verification processes can improve customer experience. Voice verification allows for secure, frictionless customer interactions in a wide variety of solutions, including web applications and call centers. Passphrases and free-form voice input can be used to verify speakers. Streamlining verification processes can improve customer experience. Voice verification allows for secure, frictionless customer interactions in a wide variety of solutions, including web applications and call centers. Passphrases and free-form voice input can be used to verify speakers. Multiple speakers can unlock value: From a group of enrolled speaker, you can determine a speaker's identity. Speaker identification allows you to assign speech to individual speakers and support multiuser voice recognition to create personalized interactions. -
41
SpeechWrite
SpeechWrite
SpeechWrite offers a variety of cloud dictation and voice recognition solutions that can be used to meet the needs of modern professionals. Solutions that can be scaled and modified to meet the needs of all organizations. Our digital dictation and transcription solutions are the best in the industry, allowing for efficient communication between authors and transcribers. Flexible workflow settings allow you to receive your written dictations quickly, whether you are at work or on the go. Your voice is your most powerful tool. Use it! Our simple yet sophisticated technology allows you to improve your work environment and work smarter. We listen, learn, and collaborate to support your every step of the process. Along with professional guidance and support, we also offer professional guidance. -
42
Dragon Law Enforcement
Nuance Communications
It is no longer necessary to read handwritten notes or recall details from hours ago. Officers can simply speak to create detailed, accurate incident reports three times faster than typing and with up 99% recognition accuracy-Zall by Voice. Dragon uses a next-generation speech engine powered with Nuance Deep Learning technology to achieve high recognition accuracy while dictating. This is ideal for people working in different environments and for users with accents. You can quickly and accurately dictate to enter data into RMS, CAD systems, or other applications. Officers and support staff simply dictate where they would normally type and then fill in and navigate within the form fields using their voices. -
43
Dragon Professional Anywhere
Nuance Communications
Nuance Dragon Professional Anywhere allows busy professionals, even remote workers, to use the power of their voice to quickly and easily create more detailed and precise documentation. Knowledge workers and field professionals should dictate mission critical documentation, not technology limitations. Conversational AI allows professionals in the private and public sectors to document more naturally. Professionals can quickly and easily record details of client meetings using speech recognition, which is up to 3x faster than typing. It's also up to 99% accurate. While most people speak at more than 120 wpm, they type at less that 40 wpm. You can speak as much or as little as you want, and there are no limits on how many people can hear you. Business professionals can work from anywhere, and can focus on their clients and business instead of technology. -
44
Ctalk
Ctalk
You can enjoy the benefits of a contact center, IVRs, speech recognition, call recordings, unified communications and outbound dialing, without having to replace your existing telephony system. The Ctalk Contact Centre system 'wraps' around your existing PBX, adding features and capacity. You don't need to replace your existing PBX. You can effectively handle more calls and contact with the same resources or less. Reduce your IT costs and dependency. By empowering multiple administrators to manage calls on the fly. First contact resolution can be dramatically increased. Know who is calling, why they are calling, and route them to the correct agent. 24/7 automated services seamlessly blend with proactive outbound calls. -
45
wolkvox
Microsyslabs
Wolkvox, a cloud-based call centre management software, helps businesses streamline communications across multiple web chat applications and social media channels like Telegram, WhatsApp Line, Line, Twitter, Facebook and Instagram. Organizations can manage their interactions via video calls, email, SMS, phone, and mobile devices. wolkvox allows enterprises to monitor multiple customer groups, record and analyze client interactions, and generate reports to track agent and campaign performance. It features a drag-and drop interface, simultaneous calling and Artificial Intelligence (AI-enabled speech analysis), gamification, and many other features. Administrators can also use the predictive dialer for creating custom rules for virtual agents, call routing, messages, and templates for email campaigns and SMS campaigns. Wolkvox integrates with many third-party ERP, business intelligence and CRM systems. -
46
Dictation.io
Dictation.io
Google Chrome uses speech recognition to create emails and documents. Dictation accurately transcribes your speech into text in real-time. You can add paragraphs, punctuation marks and smileys to your text using voice commands. Dictation can recognize and transcribe popular languages such as English, Espanol and Francais. With simple voice commands, you can add new paragraphs and punctuation marks. To insert a smiley, say "New Line" or "Smiling Face". Google Speech Recognition is used to translate your spoken words into text. It saves the converted text locally in your browser and does not upload any data. Learn more. You can dictate text in any language using your voice, without the need for a keyboard or mouse. -
47
Clarifai
Clarifai
$0Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for building better, faster and stronger AI. We help enterprises and public sector organizations transform their data into actionable insights. Our technology is used across many industries including Defense, Retail, Manufacturing, Media and Entertainment, and more. We help our customers create innovative AI solutions for visual search, content moderation, aerial surveillance, visual inspection, intelligent document analysis, and more. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been a market leader in computer vision AI since winning the top five places in image classification at the 2013 ImageNet Challenge. Clarifai is headquartered in Delaware -
48
Fusion Speech
Dolbey
The most important technology advancement in the dictation/transcription industries is back-end speech recognition. Fusion Speech®, powered by Nuance's SpeechMagic™, harnesses this powerful technology to allow facility-wide deployment in almost every medical specialty. Fusion Voice® captures dictation, Fusion Speech processes it, and Fusion Text® increases productivity. The Fusion modules result in cost savings in reoccurring labor costs and outsourced fees. This is the speech recognition solution that you have been looking for. While other speech recognition solutions have offered cute gimmicks, they are not sustainable business applications. Fusion Speech gives you the tools to deploy speech recognition that yields tangible and measurable returns for your investments. -
49
TrulyNatural
Sensory
Sensory is a pioneer in embedded neural network-based speech detection and has been the industry leader in optimizing speech recognition software with small footprints. The first embedded large vocabulary continuous speech recognizer (LVCSR), was created from this extensive experience and continuous innovation. Sensory's voice recognition software is embedded, so it doesn't need a wifi connection. Many applications don’t need or want to rely upon cloud-based connections to perform high-performance speech recognition. Others are looking for a client-cloud distributed system that offers optimal performance. More processing is being done at the edge because of market concerns about privacy, performance, and bandwidth. -
50
Verbatim
Saince
A speech recognition and radiology reporting system that anyone can afford. Verbatim is the latest and most technologically advanced solution in speech recognition and radiology reporting. It won't break the bank. You can quickly and easily complete your reports with a 99 percent accuracy and intuitive workflows.