News
News
8/24/2006
06:15 PM
Connect Directly
RSS
E-Mail
50%
50%
Repost This

Researchers Prove They Can Isolate Voices In A Crowd

Researchers at the University of Missouri-Columbia say they have found a mathematical solution to the "cocktail party" problem, separating one sound from a recording of a noisy environment.

Researchers at the University of Missouri-Columbia are hoping computer programmers can help them with a solution to a decades-old "cocktail party" problem.

The researchers have found a mathematical solution that allows them to separate one sound from a recording of a noisy environment -- like a single voice from the din of conversation at a cocktail party. Mathematics professors Dan Casazza and Dan Edidin and Radu Balan, of Siemens Corporate Research, solved the problem and demonstrated that it is possible to isolate distinct voices and reconstruct spoken words.

"Our solution is called 'signal reconstruction without noisy phase,'" Edidin said. "In speech recognition technology, a 'signal' could be a recording of 25 people in a room talking at the same time. Our solution shows that we can pull out each voice individually, not just with the words, but with the voice characteristics of each individual. We showed that this 'cocktail party problem' is mathematically solvable."

The National Science Foundation and the National Security Agency funded part of the research, which could have crime fighting, homeland security and other intelligence applications.

In the past, researchers were able to separate voices but not reproduce the characteristics of the voice itself, according to a statement from the university. Casazza said existing programs that can separate and reconstruct voices are not completely reliable because they have difficulty separating voices with similar pitch characteristics. The researchers claim that a program using their solution would be more exact.

"Theoretically, our solution says you should be able to pick up voices on a squeaky old microphone and then separate them all out so that you can hear what each person is saying in his or her own voice," Casazza said. "This is a very old problem, and we have the first mathematical solution to it."

However, they were not able to create an algorithm that will produce consistent results.

"The computer we use is doing the work without an algorithmic program. It uses a system called a neural net, which is designed for the computer to teach itself. Basically, it works on trial and error," Casazza said. "This isn't consistent and cannot be duplicated easily. We need to find a way to design an implementable algorithm that could do this consistently and quickly."

Comment  | 
Print  | 
More Insights
The Agile Archive
The Agile Archive
When it comes to managing data, donít look at backup and archiving systems as burdens and cost centers. A well-designed archive can enhance data protection and restores, ease search and e-discovery efforts, and save money by intelligently moving data from expensive primary storage systems.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Elite 100 - 2014
Our InformationWeek Elite 100 issue -- our 26th ranking of technology innovators -- shines a spotlight on businesses that are succeeding because of their digital strategies. We take a close at look at the top five companies in this year's ranking and the eight winners of our Business Innovation awards, and offer 20 great ideas that you can use in your company. We also provide a ranked list of our Elite 100 innovators.
Video
Slideshows
Twitter Feed
Audio Interviews
Archived Audio Interviews
GE is a leader in combining connected devices and advanced analytics in pursuit of practical goals like less downtime, lower operating costs, and higher throughput. At GIO Power & Water, CIO Jim Fowler is part of the team exploring how to apply these techniques to some of the world's essential infrastructure, from power plants to water treatment systems. Join us, and bring your questions, as we talk about what's ahead.