20 results
Why Capterra is free
Twilio provides simple, pay-as-you-go APIs for businesses to build scalable, reliable voice and SMS apps for the web or mobile devices.
Twilio is the worlds leading cloud communication platform that enables you to engage customers across channels - SMS, voice, video, email, WhatsApp and more. Pay-as-you-go APIs allow businesses to scale communications reliably. Learn more about Twilio

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Descript is an all-in-one audio and video software that makes editing as simple as editing a word doc. Edit video by editing text.
Descript is an all-in-one audio and video editor that makes editing as easy as a word doc. Upload media or record directly in Descript to instantly transcribe your file into text, then tweak the text to directly edit your media clips. Edit out filler words and silent gaps with a single click. Record your screen and webcam for presentations and video messages and edit out mistakes before publishing. Export your project to other pro apps. Learn more about Descript

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Writing tools that include a translator, proofreading, sentence rephrasing, dictionary, and text-to-speech feature.
Ginger Software is an award-winning productivity-focused company that helps you write faster and better, thanks to grammar checker, punctuation, and spell checker tools which automatically detect and correct misused words and grammar mistakes Learn more about Ginger

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Human-like AI voiceover & text to speech platform with 34 languages & 160+ voices.
Human-like AI voiceover & Text to Speech platform with 34 languages & 160+ voices. Clone the voice of your choice for your IVR / OHM systems, or customize your brand's voice on marketing & sales channels! Leverage our TTS APIs to integrate seamlessly with your existing tools to automate the creation and distribution of audio files Learn more about LOVO Studio

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Instantly Transform Any Text Into a 100% Human Sounding VoiceOver With Only 3 Clicks!
Voicely is a cloud-based app which produces human sounding voice-over from your text. Voicely allows you to change the Voice Type, Pitch, & Speed as well as add professional background music to give more depth and excitement to your voice-over. This, of course, is completely optional. Learn more about Voicely

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Software API to convert text into natural sounding audio files for websites and applications.
Software API to convert text into natural sounding audio files for websites and applications.

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Text-to-speech software offering more than 65 languages and 400 voices, including both standard and AI (neural) voices.
Blakify is a Text-To-Speech app that turns any text into audio. Social Media, Voice Over's, Podcasts, or YouTube. These are just a few ways you can utilize our software. Instead of paying voice actors to narrate text, video presentation, or even your next Audiobook, Blakify can do all this in a matter of seconds. With 65 languages and over 400 voices, you can even turn your blog post from, say English to French, paste in the article, and let Blakify do all the work Learn more about Blakify

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
A text-based voice over maker with hyper-realistic AI voices. Suited for Enterprises, SMBs as well as freelancers.
You can start by typing in your script or just upload your home-style voice recording and convert it into a studio-quality AI voice over within minutes. MURF also makes it really simple to match the timing of your voice with videos or presentations within the tool itself. With MURF you can - Generate realistic voices via text for presentations and videos - Convert home-recorded audio or video to AI professional voices - Edit your audio through text Learn more about Murf Studio

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Text to Speech Converter Create realistic voices for any text in seconds
PollySpeech allows you to turn any text into lifelike speech, allowing you to create various media content such as audiobooks, podcasts, voice content, and also applications that talk and build entirely new categories of speech-enabled products. PollySpeech.com’s Text-to-Speech (TTS) service uses advanced deep learning technologies of leading cloud service providers such as Amazon Web Services, Microsoft Azure, Google Cloud Platform and IBM Cloud to synthesize natural-sounding human speech. Learn more about PollySpeech

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Cloud Text-to-Speech is a Google-powered Text-to-Speech API that can convert text into natural-sounding speech.
Cloud Text-to-Speech is a Google-powered Text-to-Speech API that can convert text into natural-sounding speech.

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Synthesia CREATE is a web platform that enables an entirely new form of AI-driven video production for professionals.
Synthesia CREATE is a web platform that enables an entirely new form of AI-driven video production for professionals. Rather than filming content with a camera, we use software to simulate real video eliminating the need for film crews, studios and cameras. This allows for fast and affordable creation of presenter-led video. Rather than filming content with a camera, CREATE uses AI to simulate real video eliminating the need for film crews, studios, actors and cameras. Learn more about Synthesia

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
It is a text-to-speech solution that uses machine learning technology to create applications with human-like text-to-speech voices.
It is a text-to-speech solution that uses machine learning technology to create applications with human-like text-to-speech voices.

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Listen2It automatically converts text content into audio, choosing from 600+ text to speech voices.
Listen2It automatically generates an audio version of text content. Choosing from 600+ lifelike text to speech voices in 75 different languages, users can give their brand a unique voice. In addition, listen2It gives full control to the user to customize advanced controls like pitch, speed, tone, creating millions of voice combinations. It also offers a pre-built audio player with customizable designs, colours and buttons to match the brand. Learn more about Listen2It

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Next generation AI Voiceover & Text to speech platform with human-like voices.
When you’re making video content that will be viewed all over the world, you need different voiceover options when it comes to languages and accents. Talkifier makes it easy to find the perfect voice for your video content with 400+ voice options in 65 languages with full commercial rights at your fingertips. Whether you’re launching video content in another country or trying to localize an online course, you can choose from a wide range of voices to match your needs. Learn more about Talkifier

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Voiceley is a Cloud-based solution that enables users to translate any text into lifelike speech using advanced Neural TTS providers.
Voiceley allows you to turn any text into lifelike speech, allowing you to create various media content such as audio books, podcasts, voice contents and also applications that talk, and build entirely new categories of speech-enabled products. Voiceley’s Text-to-Speech (TTS) service uses advanced deep learning technologies of leading cloud service providers such as Amazon Web Services, Microsoft Azure, Google Cloud Platform and IBM Cloud to synthesize natural sounding human speech. Learn more about Voiceley

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Azure Text to Speech is a text-to-speech service that can easily convert text from 70+ languages into lifelike text.
Azure Text to Speech is a text-to-speech service that can easily convert text from 70+ languages into lifelike text.

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Watson Text-to-Speech is an API cloud service that can convert text into natural-sounding audio within the Watson application.
Watson Text-to-Speech is an API cloud service that can convert text into natural-sounding audio within the Watson application.

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
WellSaid is a text-to-voice solution that can create natural voiceovers as well as voice avatars for any branded digital content.
WellSaid is a text-to-voice solution that can create natural voiceovers as well as voice avatars for any branded digital content. Simply enter text in the Studio, and in just a click, you have realistic ai text to voice for any project. Learn more about Wellsaid

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
Build a World of Audio for your Audience - Create smart audio experiences from your content in just a few clicks
Trinity Audio with its unified audio platform is an AI company helping content creators of all sizes to build their audio future and provide audio experiences for their audiences. The company’s technology instantly converts content to audio with the most natural sounding voices, continuously learns listeners' behavior, and creates futuristic smart audio experiences, covering every stage of the audio journey from creation to distribution. Working with Trinity Audio, content creators Learn more about Trinity Audio

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices
TTSAI® Pro Cloud-based solution that allows users to convert text to speech in 80 languages with 1K+ voices by Artificial Intelligence
TTSAI® Pro service covers 90+ languages and 1000+ voices and is increasing continuously. TTSAI® Pro supports both standard voice and AI voice (Known as Neural Voice). With standard voice, you got lower cost. With AI voice, you got fluent voices. TTSAI® is a project made with the heart. All the income will be used to finance research in order to improve the quality of life and accessibility of the disabled and those who are digitally excluded. TTSAI® is STAR LEVEL One, CAIQ, GDPR CoC Learn more about TTSAI Pro

Features

  • AI Voices
  • Multi-Language
  • Multi-Voice
  • Phonetic Variation Detection
  • Audio Editor
  • Custom Voices