Best Speech Recognition Software in China

Find and compare the best Speech Recognition software in China in 2024

Use the comparison tool below to compare the top Speech Recognition software in China on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    Twilio Voice Reviews

    Twilio Voice

    Twilio

    $0.0085 per min
    409 Ratings
    See Software
    Learn More
    Create a scalable voice experience with the API that connects millions globally. With Twilio Voice, you can build unique phone call experiences with one API, to create, receive, control and monitor calls with just a few lines of code. Customize your experience the way you want by using a wide range of customization resources, such as our Voice SDK, speech recognition, Interactive Voice Response (IVR), and recording transcriptions. Whether you're looking to set up global conferencing or alerts & notifications, Twilio has the support you need for building with Voice, such as our Twilio Runtime and Studio developer tools. Find docs, code samples, and helper libraries to start building today.
  • 2
    Google Cloud Speech-to-Text Reviews
    Top Pick
    See Software
    Learn More
    An API powered by Google's AI technology allows you to accurately convert speech into text. You can accurately caption your content, provide a better user experience with products using voice commands, and gain insight from customer interactions to improve your service. Google's deep learning neural network algorithms are the most advanced in automatic speech recognition (ASR). Speech-to-Text allows for experimentation, creation, management, and customization of custom resources. You can deploy speech recognition wherever you need it, whether it's in the cloud using the API or on-premises using Speech-to-Text O-Prem. You can customize speech recognition to translate domain-specific terms or rare words. Automated conversion of spoken numbers into addresses, years and currencies. Our user interface makes it easy to experiment with your speech audio.
  • 3
    Play.ht Reviews

    Play.ht

    Play.ht

    $199 per month
    87 Ratings
    See Software
    Learn More
    "Play.ht: The AI-Powered Text-to-Voice Generation Tool for Hollywood Studios and Enterprises" Play.ht is revolutionizing the voiceover industry with its high-fidelity AI voices that sound just like human voice talent. From Hollywood studios to large enterprises, Play.ht is the go-to tool for creating realistic and engaging voiceovers quickly and effortlessly. With Play.ht, you can generate entire performances with multiple speakers, edit their pacing, and create unique versions of each paragraph - all within seconds. Say goodbye to the hassle of scheduling and hiring voice talent, and hello to a streamlined, efficient process that delivers top-quality results. Whether you're an auto manufacturer or a Hollywood studio, Play.ht's API access and online rich-text editor make it easy to scale up and simplify your voice work. Join the ranks of satisfied customers and schedule a live demo today.
  • 4
    Maestra Reviews
    High-quality speech-to-text software that is highly accurate with an integrated advanced text editor. Translate in English and 50+ languages. Automated transcriptions, captions, and voiceovers make it easy to increase your online audience. Our video caption software can subtitle and caption your videos to make your message clearer. You can reach millions more people around the globe by automatically translating your videos into other languages. Maestra allows you to transcribe audio to text. Get started today! One study found that websites that include transcripts to videos had a 16% increase in revenue. Because search engines can crawl words more easily than videos, this allows more people to find your site online. Try Maestra as your new transcription service. You can easily edit your automatically generated transcripts. Bolded text will automatically be added to the current time.
  • 5
    Happy Scribe Reviews

    Happy Scribe

    Happy Scribe

    $9 per month
    1 Rating
    High-tech A.I. Working side-by-side with the best language professionals. Our interactive editors are designed for subtitlers and transcribers. They will make it easier to interact with your subtitles and transcripts. Interactive editors offer endless possibilities. You can collaborate with all your stakeholders by sharing transcripts and subtitles in edit or view-only mode. Export in any format you can imagine. Our platform will prepare files for you that are ready to be uploaded to any platform. Upload files of any length and size. All formats are supported by our software. Translate your transcriptions and subtitles automatically in the most popular languages. Import public links and synchronize happy Scribe with your current workflow. You can create spaces to share files with your team. Integrate seamlessly with your favorite apps: Youtube, Zapier, and many more. All files are private and protected. Your subtitles will be protected.
  • 6
    Transkriptor Reviews

    Transkriptor

    Transkriptor

    $9.99 per month
    1 Rating
    Transcript audio automatically and convert audio to text Transkriptor allows you to upload your file and convert it to text. Transkriptor's powerful artificial Intelligence generates online transcriptions in a matter of minutes. Many professionals and students use Transkriptor. Transkriptor can be used for video transcription, lecture transcription, and interview transcription. Transkriptor creates editable TXT, word or SRT files. Transkriptor allows you to download your transcriptions in seconds. You can also use Transkriptor’s online editor to make quick and easy edits. Get more out of school, work, or life by signing up today. Transkriptor, despite being one of the most powerful AI solutions, is very easy to use. Transkriptor is an online speech to text converter. Upload your file and you can start.
  • 7
    Zubtitle Reviews

    Zubtitle

    Zubtitle

    $8 per month
    1 Rating
    In minutes, create amazing videos for social media. Our online video editor makes it easy to create stunning videos. Zubtitle's simple but powerful tools will allow you to edit faster and turn your videos into engaging content for social media. Our built-in Text editor will help you grab your audience's attention by creating a headline that teases the content. Our auto-subtitle engine allows you to easily add and modify the text and timing of your sub-titles. Zubtitle helps you reach a wider audience. With just a few clicks, you can optimize your video for any social media platform using our all-inclusive video recycling tool. Our quick tools allow you to crop and adjust the aspect ratio of your video to fit any social media platform. Our powerful trimming tool will highlight the most eye-catching parts of your video. Your unique branding will make you stand out from other creators. To build a loyal fanbase, express your creativity and make your content instantly recognisable.
  • 8
    VoiceboxMD Reviews
    Advanced medical dictation software was created for doctors and practitioners. All EHR platforms and mobile devices supported.
  • 9
    Dragon Professional Individual Reviews

    Dragon Professional Individual

    Nuance Communications

    $500 one-time payment
    1 Rating
    You are a business professional and have to deal with a lot of documentation every day. Dragon Professional Individual is a tool that can help you complete documents faster and more accurately in the office. This will allow you to focus on revenue-generating tasks. Dragon uses a next-generation speech engine that leverages Deep Learning technology to adapt to your voice and environmental variations, even while you are dictating. You can create documents and reports quickly and accurately and complete computer tasks in record-breaking time, all by speaking. Dragon will only correct mistakes if you use the most common words and phrases. You can keep up with documentation while on the road or in the field. Dragon can be used with popular form factors, such as touchscreen computers and portable laptops.
  • 10
    GoVivace Reviews
    Our automatic speech recognition engine can recognize many accents in English and can be localized to any language. The ASR engine is compatible with standard telephony, as well as web- and mobile applications. The Automatic Speech Recognition Engine by GoVivace can be used to recognize voice commands from electronic devices, such as smartphones, tablets, computers, and smartphones, using a microphone. This automatic speech recognition engine compares spoken input with a variety of pre-specified options and converts speech to text. The application's grammar is the entire list of pre-specified options. It powers the interface between the dialog-speaker (and the back-end processing). GoVivace's patent Automatic Speech Recognition solution requires only a very simple grammar to be processed. It can also handle very large grammars to support complex tasks.
  • 11
    LumenVox Reviews
    Top Pick
    AI-driven speech recognition technology and voice authentication technology can transform customer engagement. Our 20-year history has been dedicated to ensuring that our partners are successful through collaboration. Our curiosity keeps us innovating for 20 more years. Our flexible speech-enabling technology allows you to create a solution that meets all your customers' needs, reliably and affordably. We do one thing well. Speech-enabling your applications is our specialty. Deliver great voice automation and interactions. LumenVox ASR/TTS can be used for simple commands or more complex questions. This will help you increase efficiency on both ends of the phone line. You won't ever repeat yourself. You will have the most flexibility in terms of capabilities, deployment, and monetization. LumenVox can help you create it if you can think of it. Our intuitive technology and toolsets make it easier to reduce time from development to deployment.
  • 12
    LilySpeech Reviews
    LilySpeech allows you to type anywhere in Windows using your voice, instead of using your fingers. It can be used with any app to send emails, perform Google searches, Facebook chats, Skype calls, and more. It can be used wherever you would normally type.
  • 13
    Otter.ai Reviews

    Otter.ai

    Otter.ai

    $8.33 per month
    2 Ratings
    Otter is where conversations are. With Otter, your AI-powered assistant, you can create rich notes for interviews, meetings, lectures, and other important voice conversation. The Otter advantage is a benefit for organizations. Otter is trusted by all sizes of teams to transcribe important conversations. Otter 2.0, our shiny new release, offers more functionality to enhance collaboration and productivity. The Teams plan is designed for small and medium-sized businesses as well as teams in larger companies. You can record and review your conversations in real-time. You can search, play, edit, organize and share your conversations on any device. Otter allows you to record conversations on your smartphone or web browser. You can import or sync recordings from other services. Zoom can be integrated. Real-time streaming transcripts are available. Within minutes, rich, searchable notes can be created with text, audio, images and speaker ID. To inform others and stay on the same page, you can share or export voice notes.
  • 14
    Dragon Home Reviews

    Dragon Home

    Nuance Communications

    $200 one-time payment
    1 Rating
    Dragon uses a next-generation speech engine that leverages Deep Learning technology to adapt to your voice and environmental variations, even while you are dictating. Dragon intelligently converts spoken words into text three times faster than typing, with up to 99 percent recognition accuracy. It's easy to get started with Dragon, thanks to its intuitive user interface and minimal training. You can now select a block and "play back" it for proofreading and editing, while you listen to what was dictated. Dragon is compatible with the most popular touchscreen tablets and PCs of today, so you can interact with your favorite apps at home or at school.
  • 15
    Augnito Reviews
    Augnito combines Speech Recognition AI power with mobility. With best-in-class accuracy, Augnito allows you to edit, format, or complete reports at the speed and ease of human speech. You can now access your personal templates and short forms from any computer, whether you're at work, at home, or on the road. This program is best suited for those who need to create detailed reports, such as radiology, histopathology, and surgical notes. You can also dictate your reports from anywhere around the world. Augnito can recognize different accents and pronunciations without any profile training. Augnito is built with the most advanced deep learning technology and has the entire language for medicine that covers 50+ sub-specialties and all the popular generic and drug names.
  • 16
    Clarifai Reviews
    Clarifai is a leading AI platform for modeling image, video, text and audio data at scale. Our platform combines computer vision, natural language processing and audio recognition as building blocks for building better, faster and stronger AI. We help enterprises and public sector organizations transform their data into actionable insights. Our technology is used across many industries including Defense, Retail, Manufacturing, Media and Entertainment, and more. We help our customers create innovative AI solutions for visual search, content moderation, aerial surveillance, visual inspection, intelligent document analysis, and more. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been a market leader in computer vision AI since winning the top five places in image classification at the 2013 ImageNet Challenge. Clarifai is headquartered in Delaware
  • 17
    Speechmatics Reviews

    Speechmatics

    Speechmatics

    $0 per month
    Speechmatics is the most accurate and inclusive speech-to-text API ever released. Speechmatics is the world’s leading expert in Speech Technology, combining the latest breakthroughs in AI and ML to unlock the business value in human speech. Businesses use Speechmatics worldwide to accurately understand and transcribe human-level speech into text regardless of demographic, age, gender, accent, dialect, or location in real-time and on recorded media. Combining these transcripts with the latest AI-driven speech capabilities, businesses build products that utilize summarization, topic detection, sentiment analysis, translation, and more. How is Speechmatics different? * The most accurate speech recognition on the market * 55 languages with vast accent and dialect coverage * Cloud-based or on-premises deployment options for data security * Real-time transcription with low latency and high accuracy * Real-time translation with 69 language pairs * Speech Understanding features such as Summaries, Sentiment, Topic Detection, Chapters, Audio Events * Fast and secure transcriptions for pre-recorded audio * Automatic translation and language identification * A culture of R&D in deep learning and speech recognition
  • 18
    Sembly Reviews

    Sembly

    Sembly

    $10 per month
    Sembly is a web and mobile app that accompanies you on your Teams, Zoom, and Google Meet meetings, making meeting content available for review, search, and sharing. Share a part or the whole meeting with your team so everyone can get up-to-speed, even if they didn’t attend. Save time with summaries that Sembly generates automatically. Sembly is available in English across Web, iOS & Android mobile apps. The smartest AI meeting assistant that helps easily review & share meeting takeaways, meeting records and transcriptions. Turns your meetings into searchable text, highlights key discussion moments, creates notes and summaries. Use Sembly Team to unlock powerful AI analytics to help you and your team achieve more, while attending less! Sembly automatically syncs to your calendar to join and record all your scheduled meetings on all major conferences platforms. This reduces the need to take notes on-call. You can review what was said, search through all your meetings, and share key items with your team members or friends. You can review what was said at a particular meeting or search for it in all of your meetings. Designed for businesses of all sizes, Sembly is an AI-based meeting management solution!
  • 19
    Scribe Reviews

    Scribe

    Scribe Technology Solutions

    $59.95/month/user
    ScribeNow is now available! ScribeMobile's flagship product, ScribeMobile Speech Recognition, is now available in your palm. This is the future of medical documentation. ScribeNow! ScribeMobile's already strong set of documentation services is enhanced by ScribeNow! ScribeNow! ScribeNow allows providers to quickly and easily record encounters using speech recognition. Providers have the flexibility they need to improve productivity, profitability, patient care, and patient care. This easy-to-use solution has a wide variety of integration capabilities. Scribe TeleCare, an innovative solution that allows healthcare providers to continue servicing their clients and have completed documentation to support their care of their patients. It also facilitates reimbursement using one easy-to-use tool. You don't have to use an app not designed for healthcare to connect to your patients remotely.
  • 20
    Simon Says Reviews

    Simon Says

    Simon Says

    $0.17/one-time
    It used to be difficult to transcribe meetings. Simon Says used advanced artificial intelligence technology to quickly and accurately transcribe recordings. Transcription costs only $1 per 30 minute. It costs $2 to transcribe a 1-hour meeting. You can then refer back to the notes and next steps and share them with others. This iOS app lets you record audio from your meetings or interviews, transcribe it, and view and bookmark the transcript. The transcript can be exported to Word, text and a variety of other formats. There are better things you can do. Get auto-transcribing to help you find the most meaningful moments in your meetings. Apple featured Simon Says in their keynote announcing Final Cut Pro X. Download the Simon Says macOS app from the Mac App Store to import files from your Mac computer.
  • 21
    Voximal Reviews

    Voximal

    Ulex Innovative Systems

    $25/month/channel
    VoiceXML interpreter added for your business. It runs on the Asterisk open-source framework. It allows you to extend and manage Asterisk solutions using the VoiceXML standard language. Voximal is a modern and innovative piece. It runs on the Asterisk open-source framework. It allows you to extend and manage Asterisk solutions using the VoiceXML standard language. Asterisk allows you to make, receive, and monitor calls from your platform. Your telephony system can be highly scalable. VoiceXML syntax allows you to control your calls. Voximal makes it easy to make, manage, and route calls. A VoiceXML interpreter can be added to Asterisk. To create complex voice telephony services and IVR portals, you can use the standard VoiceXML language. Voximal is compatible to most Asterisk releases and Linux distributions.
  • 22
    SpeechText.AI Reviews

    SpeechText.AI

    SpeechText.AI

    $19 one-time payment
    Transcribe audio and video to text with domain-specific speech recognition. How it works. SpeechText.AI is an artificial intelligence software that converts speech to text and allows audio transcription. Upload audio and video files. AI transcription software can transcribe speech to text in all file formats. Select domain. Select an industry domain and an audio type from predefined categories. This will improve the recognition accuracy for domain-specific words. Transcribe. Our speech transcription engine uses state of the art deep neural network models to convert audio to text with near human accuracy. Edit and Export Use interactive editing tools to search, modify, and verify audio transcriptions. Export your content in different formats. SpeechText.AI: Why SpeechText.AI A variety of features that will allow you to transcribe audio and video in just seconds. Speech recognition. Powerful speech to text technology. SpeechText.AI is fully GDPR compliant. All our physical servers are hosted in Europe (France) and we encrypt all your data sent between you and the service. SpeechText.AI is fully automated, hence your data is confidential and the process has no place for human-factor and other risks that manual transcription has.
  • 23
    SoapBox Reviews

    SoapBox

    Soapbox Labs

    upon request
    SoapBox was created for children. Our mission is to transform learning and play for children all over the world using voice technology. Our low-code, scalable platform has been licensed by education and consumer businesses worldwide to provide world-class voice experiences for literacy, English language tools, smart toys and games, apps, robots, and other market products. Our proprietary technology is independent and reliable. It can be used by children of all ages, from 2-12 years. It can also be used to recognize different dialects and accents around the world and has been independently verified not to have any racial bias. Privacy-by-design is the approach used to build the SoapBox platform. Our work and philosophy are based on protecting children's fundamental right to privacy.
  • 24
    Picovoice Reviews
    Picovoice is the developer-first voice AI platform with a mission to accelerate the adoption of voice AI. Acknowledging the limitations of the cloud and lack of transparency, Picovoice differentiates itself by on-device processing, publishing open-source benchmarks and making its technology available to anyone. Picovoice’s offerings, speech-to-text, voice search, wake word, intent and voice activity detection run anywhere from tiny MCUs to web browsers, providing an immersive experience.
  • 25
    Work by Speech Reviews

    Work by Speech

    Mikołaj Magowski

    Free
    Work by Speech is the only application that allows you to work on a computer by speaking, without using a keyboard and mouse. Application Key Features: - Effective work on a computer using speech alone - Quiet speaking support - Application switching and opening via speech - Built-in speech commands to perform the most common actions - Advanced custom speech commands management - Macro recording - Separate dictation mode - Support for all mouse actions, quick and repeatable by speech - A customizable mousegrid that can also be moved using speech - Automatic mousegrid optimization for each used program - Very low system resources usage - Works with any microphone under Windows 10 and 11 - Available for the English language only - Updates are free
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next