TL;DR:
Text to speech (TTS) is technology that allows users to convert text content into audio content, often with the use of AI. It is used for a variety of purposes, such as accessibility and content creation. Some top TTS providers are Google Cloud Text-to-Speech, Murf, and NaturalReader. TTS voice quality has improved and sounds impressively humanlike, but still doesn’t sound exactly like a real human voice.
--
Marnie is an imaginary content creator.
She’s great at what she does—she can churn out quality content on a deadline and makes every effort to connect with her intended audience.
But there’s one thing she struggles with…
Voiceovers.
And the reality for her is that it’s quite time-consuming to create quality voiceovers.
On the other hand, Jake works for an organization that needs to make frequent announcements to staff and members.
Often they want their announcements to be voice announcements—but this just isn’t feasible.
What do Marnie and Jake have in common?
They can both benefit from the same thing: a text to speech solution.
Let’s go over the in’s and out’s of what text to speech is and how you can use it for your own purposes.
Text to speech (TTS) is a method of data input-output in which you can enter text content and receive an automatic audio output—without the need to manually create a recording.
In the current technological climate, text to speech is normally accomplished through the use of AI technology.
AI is able to synthesize a human voice, detect the text output you give to it, and then provide you with an audio playback of that synthesized human voice.
Text to voice is simply another way to say “text to speech.” There is no difference between these two phrases.
Text to speech occurs through the use of text to speech online generator tools, typically powered by AI.
These tools often allow you to select the “voice” from a range of voice options, depending on the style and language that you’re targeting.
Some top TTS online generators include:
These TTS generators will take the text input that you give them and will read aloud the content according to the voice options you select.
So with the examples from the introduction above, Marnie and Jake would be able input the script that they want, and the TTS generator would read aloud that text with a human-like voice.
Note I said human-like…
Does the speech output actually sound like a human voice?
In reality, most AI outputs do fall short of fully sounding like a real human.
But they do a pretty good job all things considered, and the technology can only get better.
So how do you actually use text to speech?
Text to speech is used for a variety of purposes, including but not limited to…
So what are some good free online TTS options?
Well, there are a few. But the free options do come with a caveat: you’re not able to access full features.
One key feature you may not be able to access is the ability to actually download your audio output—so keep that in mind.
But if all you want to do is hear (not download) an audio playback, there are some options available to you…
These all have limited free options that could be useful to you.
Most of the services that have a free option also have more robust paid options, so the list above would be the same.
Note that some of the pricing mentioned below would be at an annual rate, although monthly options are available as well…
There are a couple of options for text to speech that you can pursue via Google…
This was mentioned briefly above, but Google Cloud has a text to speech option.
(After all, they have everything else so why not text to speech?)
The caveat with Google Cloud’s option is that it doesn’t have its own platform, so you’d need developers to get it set up.
Another option besides Google Clouds is to use Google Docs, Google Sheets, or Google Slides.
For this to work, you first need to install a screen reader app.
Using Google Docs as the example—once you have the screen reader app installed, you can visit the “Accessibility” section within Google Docs and select “Turn on Screen Reader Support.”
A new “Accessibility” tab will appear in your main toolbar.
Select that “Accessibility” and the “Verbalize to screen reader” option to see your text to speech options within your Google document.
What about the actual voices in voice to text software?
Are the voices actually high quality? Are they natural-sounding?
Text to speech voices are better than they’ve ever been, thanks to AI.
While there’s still a long way to go before they can sound totally natural—like a real human being reading a script—some come surprisingly close.
One example of a voice to text tool that specifically markets itself as “natural” (it’s even in the name!) is NaturalReader.
And the reviews seem to be consistent with that claim to a natural sound.
(You can test out some of their voices to see for yourself.)
The bottom line is that it is better for a human to do a voiceover or to read the text in question. However, in the absence of a human who is able to physically do it—or in the absence of resources to fully invest in a quality human-recorded voiceover—text to speech voices aren’t too bad of a substitute.
One example of how powerful text to voice options can be is the recent Joe Biden voice imitation scandal, in which a robocall was sent out imitating Joe Biden’s voice via an AI text to speech tool.
This incident shows both the powerful potential and the dark side of text to speech.
Note that we do not condone in any way, shape, or form using text to speech to deceptively impersonate anyone—and most certainly not the President of the United States.
A similar example to Joe Biden is using text to voice solutions that imitate Donald Trump’s voice.
Again, this technology isn’t perfect but it is pretty amazing.
And again, we do not condone in any way using this technology to deceptively impersonate anyone at all.
It’s just an example of the amazing capabilities of text to speech in our current AI-driven climate.
Here are some answers to your frequently asked questions regarding text to speech…
Enabling text to speech will vary based on the device or service you’re wanting to enable it on.
For most devices and services (including both Android and Apple), you will go the the Accessibility section, which on phones is located under your Settings.
iPhones actually have two text-to speech options: one for reading back text that you select on the screen and one for your phone to read out loud text that you input into it.
To disable text speech from a device or service, you would simply go to the Accessibility section (where the text to speech option was enabled), and change the setting back.
On a phone, this should be a toggle that you can tap to disable the feature.
As mentioned above, to turn off text to speech on an Android device, go to Settings→Accessibility→Text-to-speech output.
From there you should be able to toggle off the text to speech option.
TikTok specifically has a text to speech function you can use when creating your videos.
While text to speech can be a good option for many situations, in some cases SMS can be a better option.
For example, people in the current climate are more drawn to text messages than they are to receiving a phone call from an automated voice.
Alternatively, you can combine both concepts for maximum effect—i.e., you could send a text message that links to the text to speech content you’re wanting to share.
Try a free SMS platform trial today to see if it would be a good fit for you.
Start sending mass text messages to your entire list today!
GET FREE TRIAL