Speech Recognition Software
Speech Recognition software allows computers to interpret human speech and transcribe it to text, or to translate text to speech. Speech Recognition solutions also allow users to use voice commands to control computers. These applications are used in interactive voice response (IVR) systems to help quickly route incoming calls to the correct destination. Speech Recognition software is related to IVR software.
Filter Results (75)
- Cloud, SaaS, Web (55)
- Installed - Mac (7)
- Installed - Windows (26)
- Mobile - Android Native (14)
- Mobile - iOS Native (18)
- Audio Capture (47)
- Automatic Transcription (40)
- Call Center (9)
- Call Logging (7)
- Call Recording (10)
- Concatenated Speech (14)
- Customizable Macros (21)
- Multi-Language (39)
- Speech-to-Text Analysis (33)
- Voice Recognition (33)
The industry leading speech recognition software used by doctors, lawyers, and other professionals to convert speech into text. Starting at $119.99 for the Premium Edition, Dragon has been used by thousands of professionals for dictation and transcription for over 30 years. Runs on both Windows and Mac platforms. Turn speech into text by dictating into Windows-based applications at speeds up to 160 words per minute.
Technical computing system that provides tools for image processing, geometry, visualization, machine learning, data mining, and more. Technical computing system that provides tools for image processing, geometry, visualization, machine learning, data mining, and more.
Sonix is not a typical transcription service. Sonix is an online platform. Upload a file to Sonix, and you'll have an online transcript in less than 5 minutes. Browser-based transcript stitches audio/video to text. Easily search & analyze all your transcripts for qualitative analysis and decoding. Multiuser permissions make it easy to share transcripts across team members. Create video subtitles and captions in minutes. Dozens of export options, integrations, and API. Independently reviewed as the most accurate automated transcription service. $5/hour of audio/video! Transcripts in under 5 minutes.
KOOKOO is an omnichannel contact center suite used by 1500+ businesses worldwide for their inbound and outbound interactions. Access enterprise level cloud features at 40% lower TCO, in both VOIP and PSTN countries. Reduce handle times, and exceed SLAs with multiple tools: IVR, speech recognition, intelligent call routing, bots, live monitoring, dialers and more. Go live in a few hours, even integrating with your existing telecom provider if needed. KOOKOO CloudAgent is a perfect fit for your inbound and outbound contact center. Access enterprise level features at 40% lower TCO.
Multi-language speech recognition software with the ability to dictate in any third party software or to fill forms on websites. Apart from dictation, Braina also provides voice command features that allows you to search the web, open file, programs & websites, find information, set reminders, take notes and much more. You can use your voice to dictate text to your Windows computer, automate processes and improve your personal and business productivity. Multi-language speech recognition software with the ability to dictate in any third party software or to fill forms on websites.
A speech recognition and conversion solution with multi-language speech recognizer, documents & emails transcriber, and more. A speech recognition and conversion solution with multi-language speech recognizer, documents & emails transcriber, and more.
Through technology, insight and experience, BigHand delivers success for the future by helping its clients achieve professional productivity and operational excellence. The leading software technology company has developed a range of solutions from task delegation, document creation, matter pricing, digital dictation workflow, intuitive reporting and analytics, that help busy people achieve more in less time and organizations become more efficient and effective. BigHand offers speech, workflow, document creation, process improvement, matter pricing and BI solutions for law firms of all sizes.
NexGen Mobile Solutions (formerly Entrada) cloud-based engagement platform for healthcare providers streamlines workflows & reduces physician burnout. Providers can view their clinical schedule and EHR patient data from their mobile device and dictate patient encounters anytime, anywhere that populate inside the EHR. They can also communicate with their care team through secure text messaging. Available on Android and iOS platforms for physician groups of all specialties and sizes. NexGen Mobile Solutions (formerly Entrada) solves physician burnout by improving EHR workflows through its speech-driven documentation.
CallFinder is an affordable and scalable SaaS speech analytics solution designed to help both small and large businesses automate the quality monitoring process for improved agent performance and compliance. With CallFinder's call transcriptions and sentiment analysis, businesses gain 100% visibility into customer-agent interactions and an un-biased scoring methodology to implement across agents and teams allowing them to deliver a better Customer Experience. Schedule a live demo today! CallFinder is a SaaS speech analytics solution built to help SMBs improve agent performance and customer experience.
Speech recognition software for hospitals and medical practices. Allows to dictate notes straight into a Windows-based EMR. Speech recognition software for hospitals and medical practices. Allows to dictate notes straight into a Windows-based EMR.
Go Transcribe provides the latest software invention to convert speech in to text which will save you time, money and effort. Simply upload your files onto our platform using any device and your file will be converted in a matter of minutes. The transcription can be viewed on our unique online editor. You can playback the original file and jump to specific parts of the audio and make amendments to the transcription where required. Your transcription can be downloaded to several popular formats. Cloud based transcription service powered by artificial intelligence. Automatically converts audio/video files into text
SmartAction is the only provider of a fully automated voice and text-enabled omnichannel solution, running in the cloud and powered by artificial intelligence. Our solution, IVA, is a centralized AI engine that automates customer service across voice, SMS, text, chat, mobile, and social media. We consultatively work with companies to deliver effortless customer service across any of the channels their customers choose. A fully automated voice and text-enabled omnichannel solution, running in the cloud and powered by artificial intelligence.
Reason8 is an AI-powered service for automatic note taking and preparation of summaries for in-person business and scrum meetings. We provide the best note taking quality on the market because we use multiple smartphones and AI patent pending approach to boost quality of speaker separation and drafting meeting summaries. We are actively working on advanced summarization, collaboration features for teamwork, and integrations with project management services and communication tools. AI-powered service for automatic note taking and preparation of summaries for in-person business and scrum meetings
Great speech recognition & instant voice translation web app that emphasizes on simplicity and natural speech by auto punctuating. Features: AUTO-PUNCTUATION, marks and saves TIMESTAMPS, editable, AUTOMATICALLY SAVES, transcribes audio files, phone conversations and exports to captions. No user registration necessary. Use it for dictation, transcription, interviews, hard of hearing, real time interpreter and more. Speechlogger is powered by Google's ASR APIs to achieve best results. Great free speech recognition & instant voice translation web app that emphasizes on simplicity and natural speech by auto punctuating.
Trint uses artificial intelligence to power its web-based automated transcription platform. Audio and video files are uploaded to Trints online software and then transcribed using automated speech recognition. The Trint Editor is the marriage of a text editor to an audio/video player: the transcribed text is stitched to the audio or video file, making it simple to search, verify and edit the machine-generated transcripts. Trint goes beyond transcription to provide the most innovative platform for searching, editing & getting the most out of your content.
Mobile and Cloud-based solution for businesses that helps upload audio files through web, mobile, or cloud & document them to text. Mobile and Cloud-based solution for businesses that helps upload audio files through web, mobile, or cloud & document them to text.
Castel Detect LIVE is the LIVE alternative for contact center speech analytics. It provides LIVE compliance and post-call analysis, supporting your quality assurance initiatives. This centers focus on agent behaviors positively and negatively impacting customer experience outcomes. Our analytics process occurs during a LIVE call, so you can take real-time action to ensure compliance and best practice adherence. We provide voice-based analytics, event targeting, agent alert, and workflow tools. Castel Detect LIVE analyzes LIVE calls with high accuracy, alerts, reminders, scripting, and call scoring. Ensure real-time compliance.
Web-based application that allows providers universal access to their work, as well as e-signature and report management capabilities. Web-based application that allows providers universal access to their work, as well as e-signature and report management capabilities.
WSR is an enterprise speech recognition solution that offers front-end (client-side) and back-end (server-side) voice-to-text recognition. With WSR, speech recognized text can be accessed immediately by the author or automatically sent to support staff for review and editing (if needed) - enabling your key earners to focus their time on more revenue generating activities and less on administrative tasks. WSRs voice-to-text technology is easy to use, accurate and light on IT resources. An enterprise speech recognition solution that offers front-end (client-side) and back-end (server-side) voice-to-text recognition.
Speech surveillance and metrics analysis software. This includes text transcription with alert generation and disposition mechanism, and metrics analytics. Speech surveillance and metrics analysis software. This includes text transcription with alert generation and disposition mechanism, an
Language channel type and accent agnostic speech-to-text solution. Speaker identification and voice activity detection technologies. Language channel type and accent agnostic speech-to-text solution. Speaker identification and voice activity detection technologies.
Express Dictate software is a voice recording program that works like a dictaphone. It lets you use your PC or Mac to send dictation to your typist by email, Internet or over the computer network. Professional dictation voice recorder. Works like a traditional dictaphone. Send dictation instantly via the Internet. HIPAA compliant secure encryption. Record to wav, mp3 or dct formats. Easy-to-use interface so you can be dictating in just minutes. Record and send dictation directly from your computer with Express Dictate Digital Dictation Software.
Crescendo Speech is the first engine to support speaker independent speech recognition for large vocabularies. Available for both front and back-end use, the engine requires zero training with out-of-the box accuracy rates reaching over 95%. Comprehensive speech recognition solution for professional, dictation-intensive environments.
Bytescribe's online transcription dictation software platform, allows transcriptionists to effortlessly manage their workflow so receiving dictation files to transcribe and delivering completed work is fast and easy. Dictators can dictate from any phone to the provided toll-free number, use a handheld recorder or our iShuttle Dictate iPhone/iPod/iPad app. WebShuttle is designed to manage all aspects of transcription workflow for organizations of any size. Multi-line telephone dictation and transcription system. Stores on a computer hard drive as compact voice files.
VoltDelta OnDemand Solutions provides a hosted infrastructure for enabling virtual contact centers and home agent call distribution and management, inbound and outbound voice recognition applications, and voice of the customer call and agent screen recording. VoltDelta supports more than 2.4 billion calls and 2 billion SMS text messages per year. Hosted automation center to handle all IVR/speech applications with intelligent ACD and CTI abilities.
Cloud-based speech recognition software that enables users to play games, control applications and create custom speech commands. Cloud-based speech recognition software that enables users to play games, control applications and create custom speech commands.
Software utilizing voice biometrics to create solutions for security, either web based or installed, with custom reporting and more. Software utilizing voice biometrics to create solutions for security, either web based or installed, with custom reporting and more.
Rubidium, covers the entire scope of a voice dialogue system: input, output and interaction. We are continuously innovating industry leading speech processing solutions for embedded applications, such as TTS, ASR, Speech Compression and Biometric Speaker ID. We help OEMs/ODMs provide customers with a hands-free, more productive user experience. Our low cost, small footprint, multi-lingual VUI solutions enable consumer product developers to get their products to market as fast as possible. Speech processing solutions for embedded applications, such as TTS, ASR, Speech Compression and Biometric Speaker Identification.
Speech recognition for your audio and video files. Speech to text, speaker diarization, voice activity detection. API for easy integration of SpokenData speech recognition into various applications. Advanced transcription editor, adaptive speech recognizer adaptation on user data. Speech recognition for your audio and video files. Speech to text, speaker diarization, voice activity detection.
A web-enabled, application service provider (ASP) technology platform for traditional and speech recognized medical transcription. SpeechRite for radiology is a front end speech recognition program with excellent quality, and comprehensive workflow that supports all dictation preferences. It is offered at NO COST, NO HARDWARE, NO RISK, and PAY-PER-USE. It integrates with all PACS/RIS using xml file exchange. It has modules for CTRM, BIRADS, Addendums, Priors, Templates, and macros. ASP web-based dictation and transcription workflow solution for hospitals, MTSOs, clinics, physicians, of any size.
Ameyo Engage is a Cloud-based Call Center Software that allows a business to take control of their operations by deploying faster changes to Customer Interaction Initiatives and engaging employees, which results in Better Customer Experience, increased Sales & Collections, and ultimately acquire loyal Customers & create happy Employees. Grow your business by gaining customer loyalty with a world class customer contact center software
A secure, cloud-based speech recognition platform for clinicians to securely document patient encounters of all types. Meet more patients and focus on providing care by significantly reducing the time spent in documentation. iPhone and Android apps. No profile creation or training needed. There are no upfront costs; only pay a monthly fee. Access to eCareNotes Customer Service Team 24x7 included. eCareNotes Cloud-based Speech Recognition for Clinicians: Simple - Affordable - EMR Ready
Speech recognition and radiology reporting solution that everyone can afford Verbatim is the industrys newest and technically most advanced speech recognition and radiology reporting solution that does not burn a hole in your pocket. With the accuracy of 99% and built-in intuitive workflows, you can complete your reports fast and easy. Verbatim from Saince is a versatile and powerful front end speech recognition software.
Turn speech into text with voice recognition software that is ver 98% accurate & based on conversational modeling for health care & IT. Turn speech into text with voice recognition software that is ver 98% accurate & based on conversational modeling for health care & IT.
Yactraqs audio mining solution provides call centers with advanced speech analytics capabilities that allow our customers to make call center recordings searchable and reportable. Our customers can utilize our tool to index 100% of their recorded phone calls to uncover high impact and actionable data on Voice-of-the-Customer insights, agent performance evaluation, customer service analysis, compliance applications, and more. Yactraq is cutting edge in audio mining and speech analytics with machine learning driven insights extracted from any audible media.
Sesame is a voice biometric identification system. Sesame uses natural speech for real-time caller identification, creating a voice print based on previous calls without the need of any enrollment process. What can Sesame do for you? Combats Call Center fraud, classification, anti-spam, answering machine detection, sentiment analysis and management Voice biometric identification system with automatic identification of clients voice, gender, age and language.
Wynyard VFA is an analyzing tool that helps in identifying the person behind an unclaimed voice or decoding the speech in a readable format from an unclear voice. It is a web application that recognizes the identity of the speaker. The application is beneficial for the law enforcement and Government bodies to prevent crimes. The best way to analyze recorded voices and reveal identity.
Phonexia transforms voice to knowledge with its innovative speech analytics and voice biometrics technologies. Its Phonexia Speech Platform is the first on the market using exclusively deep neural networks to allow speaker identification with extremely accurate and fast results. A university spin-off, Phonexia has been delivering its technologies to call-centers, financial institutions, and security agencies in more than 60 countries since 2006. Phonexia transforms voice to knowledge with its innovative speech analytics and voice biometrics technologies.
GoVivaces Automatic Speech Recognition engine can accurately recognize spoken words and convert speech into text. It supports several English accents and can be localized to any language. Also, it supports standard telephony as well as web and mobile applications. The GoVivace's ASR engine is suitable for a wide variety of applications such as IVR systems, call transcription, live dictation and closed captioning. An Automatic Speech Recognition engine which understands natural language accurately and converts speech into text.
AppTek artificial intelligence and machine learning-based automatic speech recognition and machine translation platform is deployed for the media and entertainment industry as well as call centers. Leveraging over 30 years worth of experience its scientists and research engineers support the research and development of practical systems AppTek enables the highest quality automatic speech recognition and machine translation solutions available anywhere for enterprises everywhere. AppTek offers proprietary artificial intelligence and machine learning-based automatic speech recognition and machine translation.
With its Voice API, TENIOS operates an interface for voice services, which enables the integration of customer-specific voice applications via web technologies into the cloud communications platform. The Voice API bundles a number of functions (in particular dynamic call control) that allow software applications to initiate and receive calls without developers having to deal with telecommunications technologies and protocols. The TENIOS Voice API enables the integration of speech services into your cloud telephony via common web technologies (https, REST).
AISB Engine powered by ArmorVox is a language independent voice biometric engine designed for integration into third party applications, solutions and services which using patented speaker adaptive machine learning algorithms. Applications include contact centers and IVR, websites, chat, messaging, digital apps, social media and wearable technologies. Crossmatch 25M Voiceprints per hour verifying within Milliseconds. Average Company saves 15M with Voice Biometrics over 3 years. Current leading authentication and biometric identification solutions cannot prevent hacking and identity theft!
Designed to understand human spoken language expressed in a natural way by converting speech-to-text in real-time, using DNN models. Designed to understand human spoken language expressed in a natural way by converting speech-to-text in real-time, using DNN models.
On-premise communications tool which assists contractors with voice transcription, scheduling, documentation, and task planning. On-premise communications tool which assists contractors with voice transcription, scheduling, documentation, and task planning.