Skip to content

Spotify Launches AI Voice Translation Pilot

SASKTODAY's newest columnist, Shelly Palmer has been named LinkedIn’s “Top Voice in Technology,” and writes a popular daily business blog.
shellypalmertuesday

Spotify has launched a pilot called Voice Translation for podcasts, an AI-powered system that clones original voices as it translates podcasts into different languages using technology from OpenAI. The initiative includes podcasters such as Dax Shepard, Monica Padman, Lex Fridman, Bill Simmons, and Steven Bartlett, and it provides AI-powered voice translations in Spanish, French, and German for select episodes.

Ziad Sultan, VP of Personalization at Spotify, emphasizes the feature's ability to enhance global podcast discovery: “By matching the creator’s own voice, Voice Translation allows listeners worldwide to discover new podcasters more authentically.”

The translated episodes, available to both Premium and Free users worldwide, maintain the speaker’s distinctive speech characteristics, offering a more natural and personal listening experience. Episodes such as Lex Fridman's “Interview with Yuval Noah Harari” and Steven Bartlett’s “Interview with Dr. Mindy Pelz” are now globally accessible, bridging linguistic gaps and broadening the scope of global podcast listenership.

This may be an obvious use case for voice cloning and translation technology, but success is far from guaranteed. Translation is cultural and complicated. Metaphor does not always translate, humor is highly regional, timing is correlated with sentence structure… said differently, something may be hysterically funny in New York but be a total head-scratcher everywhere else.

Universal translation is a worthy goal; kudos to Spotify and OpenAI for this awesome pilot. My friends at Google Translate have told me that English is understood by approximately 20 percent of the global population. (Anecdotally, approximately 50 percent of the web is written in English.) Imagine how much value could be unlocked by making all of Spotify's English language audio available to the rest of the world.

Now, imagine how much value you would unlock in your business if you could universally translate your content. AI-powered video dubbing and voice cloning services are widely available, and prices are dropping quickly. At some point in the very near future, you should expect universal translation to become as commonplace as Instagram filters.

As always your thoughts and comments are both welcome and encouraged. -s [email protected]

P.S. CES® 2024 is just around the corner. We're excited to be back in Las Vegas from January 9-12 offering customized 90-minute executive briefings and floor tours (details here). I'd welcome the opportunity to discuss how we can help you and your team get the most out of CES. Just reply to this email.

 

ABOUT SHELLY PALMER

Shelly Palmer is the Professor of Advanced Media in Residence at Syracuse University’s S.I. Newhouse School of Public Communications and CEO of The Palmer Group, a consulting practice that helps Fortune 500 companies with technology, media and marketing. Named LinkedIn’s “Top Voice in Technology,” he covers tech and business for Good Day New York, is a regular commentator on CNN and writes a popular daily business blog. He's a bestselling author, and the creator of the popular, free online course, Generative AI for Execs. Follow @shellypalmer or visit shellypalmer.com

push icon
Be the first to read breaking stories. Enable push notifications on your device. Disable anytime.
No thanks