This humanlike synthesized speech could be the future of audiobooks

Share

Synthesized voices like those used by Siri and Alexa are fine for telling us the day’s weather forecast or how many minutes remain on a cooking timer, but would you really want their flat, monotonous tones reading you audiobooks? Probably not, which is why most of us turn to human-voiced services like Audible to get our audiobook fix. Human voice actors might not get the nod for too much longer, however, due to to the pioneering work of a London-based startup called DeepZen.

Using artificial intelligence algorithms, augmented by the technological firepower of IBM’s Power A.I. and Watson technologies, DeepZen has developed text-to-speech tools that not only sound human at first listen, but can also pick up on the emotional cues needed for reading text in a compelling manner. In doing so, the company claims that it could reduce the time and cost to produce audiobooks by up to 90%.

“Our system is truly revolutionary,” Taylan Kamis, CEO and co-founder of DeepZen, told Digital Trends. “It works using deep learning and neural networks to understand how a human talks and reads. We then train the system so it can recognize where to apply the right emotions and intonation when reading a piece of text. The result is humanlike speech very closely resembling the real thing.”

Inevitably, work like this can be cast as yet another example of cutting-edge A.I. tools threatening a human profession. In this case, that profession involves actors who, despite what a few high-profile figures are able to achieve, don’t have the most steady, stable careers as it is. It would be naive to think that software such as this won’t have an impact on the future of voice actors, but, as Kamis points out, there are plenty of scenarios in which tools such as DeepZen’s could be a net positive for humanity.

For example, it could make possible the creation of audiobooks based on works by new and emerging writers, or from publishers who don’t have the luxury of big budgets. It could also be used to help develop superior text-to-speech tools for people who have dyslexia or otherwise have trouble reading.

“As for the future, we are also looking at producing voice-overs for the video production industry, as well as gaming, where there is a need for real-time text-to-speech to enhance the player experience,” Kami said. “We are also looking at other languages.”

You can check out a sample of the system here.

Editors’ Recommendations

Amazon reportedly has thousands of workers listening to Alexa chats
The best shows on Netflix right now (June 2019)
Game of Thrones season 8 is coming! Here’s everything we know so far
What is Google Duplex? The smartest chatbot ever, explained
The best unlocked phones you can buy

News

Company:

How a rumored CPU might embarrass the PS5

Does your Mac need antivirus software in 2024? We asked the experts

One of Tesla’s biggest competitors is making a phone, and it looks great

Live Pixel 9 Pro photos surface, highlighting rumored design changes

Long-overdue Wear OS 4 update is coming to one of our favorite smartwatches, sort of

HP LaserJet Pro MFP 3101fdw review: a fast business printer for home offices

Spigen Ultra Hybrid Samsung Galaxy S24 case review: Should you buy it?

Razer Kishi Ultra review: Should you buy it?

The Asus ROG Zephyrus G16 completely challenged my expectations

CUKTECH 20 Power Bank review: Should you buy it?

I’ve worn two of the best smart rings. Here’s which one you should buy

I did a camera test with two $1,800 phones. Then something annoying happened

Google Pixel 7a vs. Pixel 7: don’t buy the wrong Pixel

This is the most unusual Galaxy S23 Ultra camera test I’ve ever done

I tested the Galaxy S23 Ultra and iPhone 14 Pro cameras. Only one is a winner

How to search ChatGPT conversations

How to set up Windows 11 without a Microsoft account

How to transfer a Wear OS smartwatch from one phone to another

How to type an em dash in Windows

Ask Jerry: How to fight email spam

8 iPhone browser apps you should use instead of Safari

Are Facebook and Instagram still down? Here’s what we know

Are Facebook and Instagram still down? Here’s what we know

The 1Password Android app just got a huge upgrade

I never knew I needed this mini Mac app, but now I can’t live without it

This humanlike synthesized speech could be the future of audiobooks

Editors’ Recommendations

Table of contents

How a rumored CPU might embarrass the PS5

Does your Mac need antivirus software in 2024? We asked the experts

One of Tesla’s biggest competitors is making a phone, and it looks great

Live Pixel 9 Pro photos surface, highlighting rumored design changes

Long-overdue Wear OS 4 update is coming to one of our favorite smartwatches, sort of

More News

How a rumored CPU might embarrass the PS5

Does your Mac need antivirus software in 2024? We asked the experts

One of Tesla’s biggest competitors is making a phone, and it looks great

Live Pixel 9 Pro photos surface, highlighting rumored design changes