Alibaba Cloud Intelligent Speech Interaction
Intelligent Speech Interaction is based on the most current technologies, including speech recognition, speech synthesizer, and natural language understanding. Intelligent Speech Interaction can be integrated into products by enterprises to allow them to listen, understand and converse with users. This provides a rich human-computer interaction experience. Intelligent Speech Interaction is available in Mandarin Chinese and Cantonese Chinese. It is also available in English, Japanese Korean, French, Indonesian, Korean, French, and Japanese. Please stay tuned for more languages. Intelligent Speech Interaction can be used in a variety of situations, including intelligent Q&A and intelligent quality inspection. It also allows for real-time subtitles for speeches and transcription of audio recordings. Intelligent Speech Interaction has been used in many industries, including finance, insurance, eCommerce, and smart home.
Learn more
Speech2Structure
Doctors spend two-thirds of their time documenting a patient's treatment. They spend far less time with patients or conducting interviews. Averbis is developing Speech2Structure, a software solution that allows doctors to spend more time with patients. The software allows for the recording of patient documentation live by voice and then structured on-the fly. Speech2Structure is able to recognize and correct many linguistic variations, including negations, suspected diagnoses, past diagnoses, and others. When recognizing diagnoses. Also, the corresponding diagnoses are made from microbiology or pathological laboratory results. Recorded medications can also be used to help diagnose.
Learn more
Speechmatics
Speechmatics is the most accurate and inclusive speech-to-text API ever released.
Speechmatics is the world’s leading expert in Speech Technology, combining the latest breakthroughs in AI and ML to unlock the business value in human speech.
Businesses use Speechmatics worldwide to accurately understand and transcribe human-level speech into text regardless of demographic, age, gender, accent, dialect, or location in real-time and on recorded media. Combining these transcripts with the latest AI-driven speech capabilities, businesses build products that utilize summarization, topic detection, sentiment analysis, translation, and more.
How is Speechmatics different?
* The most accurate speech recognition on the market
* 55 languages with vast accent and dialect coverage
* Cloud-based or on-premises deployment options for data security
* Real-time transcription with low latency and high accuracy
* Real-time translation with 69 language pairs
* Speech Understanding features such as Summaries, Sentiment, Topic Detection, Chapters, Audio Events
* Fast and secure transcriptions for pre-recorded audio
* Automatic translation and language identification
* A culture of R&D in deep learning and speech recognition
Learn more
Clarifai
Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for building better, faster and stronger AI. We help enterprises and public sector organizations transform their data into actionable insights.
Our technology is used across many industries including Defense, Retail, Manufacturing, Media and Entertainment, and more. We help our customers create innovative AI solutions for visual search, content moderation, aerial surveillance, visual inspection, intelligent document analysis, and more. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been a market leader in computer vision AI since winning the top five places in image classification at the 2013 ImageNet Challenge. Clarifai is headquartered in Delaware
Learn more