Tech & Science

New software edits voices like text

Published

July 9, 2017

The software is called VoCo and it allows the use to edit a transcript of an audio recording of a human voice by adding or replacing words. These replacement words are processed through the software and become automatically synthesized in the speaker’s voice. Such technology can allow anew interview, for example, where certain words or phrases on the part of an interviewee which are not clear to be altered. In the era of fake news however, such software could also provide the means to change the meaning or context of what someone is saying.

The new software has come from Princeton University, Engineering School in the U.S. and it is based on a sophisticated algorithm that utilizes machine learning to recreate the sound of a particular voice and also ‘learn’ from any mistakes made through human correction. The idea is to make the editing of podcasts and narration on videos, such as those placed on YouTube channels, far easier. It also negates the need for the narrator to return to the recording studio should some of the earlier recorded utterances be unclear.

In addition, the software can provide the starting point for creating personalized robotic voices that sound more natural and ‘human-like’. Despite technological advances, many computer voices continue to sound like Cybermen from Doctor Who.

Discussing the software on his university website, the lead developer Professor Adam Finkelstein said: “VoCo provides a peek at a very practical technology for editing audio tracks, but it is also a harbinger for future technologies that will allow the human voice to be synthesized and automated in remarkable ways.”

VoCo works by augmenting the waveform with a text transcript of the track and this allows the user to replace or insert new words that do not already exist in the track simply by typing in the transcript. As the user types the new word, VoCo updates the audio track and automatically synthesizes the new word by linking together snippets of audio from elsewhere in the narration. This happens due to an optimization algorithm that searches the voice recording and chooses the best possible combinations of partial word sounds, called “phonemes,” to build new words in the user’s voice.

A video from the university explains more about the genesis of the software:

The development of the software has been described in the journal Transactions on Graphics, with the paper titled “VoCo: Text-based Insertion and Replacement in Audio Narration.”

In this article:Audio, Communications, Software, Text

Written By Dr. Tim Sandle

Dr. Tim Sandle is Digital Journal's Editor-at-Large for science news. Tim specializes in science, technology, environmental, business, and health journalism. He is additionally a practising microbiologist; and an author. He is also interested in history, politics and current affairs.

World

US president signs bill to provide new aid for Ukraine

US President Joe Biden delivers remarks after signing legislation authorizing aid for Ukraine, Israel and Taiwan at the White House on April 24, 2024...

AFP18 hours ago

AfD leaders Alice Weidel and Tino Chrupalla face damaging allegations about an EU parliamentarian's aide accused of spying for China

World

Chinese spying claims deepen German far right’s woes

AfD leaders Alice Weidel and Tino Chrupalla face damaging allegations about an EU parliamentarian's aide accused of spying for China - Copyright AFP Odd...

AFP21 hours ago

Meta's growth is due in particular to its sophisticated advertising tools and the success of "Reels"

Business

Meta sees profits soar in first quarter

Meta's growth is due in particular to its sophisticated advertising tools and the success of "Reels" - Copyright AFP SEBASTIEN BOZONJulie JAMMOTFacebook-owner Meta on...

AFP14 hours ago

Iran's supreme leader Ayatollah Ali Khamenei leads prayers by the coffins of seven Revolutionary Guards killed in an April 1 air strike on the Iranian consulate in Damascus

World

Iran cuts Syria presence after strikes blamed on Israel: monitor

Iran's supreme leader Ayatollah Ali Khamenei leads prayers by the coffins of seven Revolutionary Guards killed in an April 1 air strike on the...

AFP24 hours ago

Digital Journal

Tech & Science

New software edits voices like text

Trending

Entertainment

Review: Kelli Berglund and Amadeus Serafini star in a new film directed by Tosca Musk

World

Op-Ed: Last gasp of stupidity — An American civil war as a serious topic

World

Splashy Saudi mega-project NEOM chases Chinese funds

Business

Revealed: The most common cases of credit card crime

Tech & Science

InstaDeep CEO takes AI from Tunis to London

You may also like:

World

US president signs bill to provide new aid for Ukraine

World

Chinese spying claims deepen German far right’s woes

Business

Meta sees profits soar in first quarter

World

Iran cuts Syria presence after strikes blamed on Israel: monitor