Ads by TechWords

See your link here
Receive the latest technology news and information.
Computerworld Daily News (First Look and Wrap-Up)
Computerworld Blogs Newsletter
The Weekly Top 10
Cloud Computing
View all newsletters




Privacy Policy
 

Search Engines Break the Sound Barrier

August 5, 2002 12:00 PM ET

Computerworld - Do your telemarketers consistently make legally required disclaimers when selling securities? If your firm records its telemarketing calls, IT could set up audio mining software to let management search audio file archives to quickly find the answer.
Emerging audio mining tools, also called audio indexing or audio search software, offer speech processing and search technologies in a single package. The speech engine creates an index that includes a time and date stamp for each spoken word or phoneme in an audio or video file. The search engine then uses that index to allow rapid identification and playback of specific passages. The software may also apply metatags that identify the speakers or the subject of a given passage.
The speech-processing accuracy of speech-to-text engines, traditionally used to index high-quality broadcast audio, has advanced to the point where vendors are introducing new packages for indexing more informal conversations, ranging from corporate meetings to training videos and even help desk telephone conversations.
"[The technology] seems to have passed the threshold of usability," says William Meisel, president of TMA Associates, a speech-recognition consulting and market research firm in Tarzana, Calif.
Unlike speech-to-text packages, which can be trained for individual users, audio indexing products are speaker-independent. They also rely on large, language-specific vocabulary dictionaries, as well as domain models that may optimize for the type of conversation (e.g., telephone) or industry (e.g., health care). While the newest products can process audio at or faster than real time with an accuracy sufficient for searching, the output text isn't a readable transcript, cautions Jackie Fenn, an analyst at Stamford, Conn.-based Gartner Inc. And as new companies, products and terms come into use, users must update their systems regularly or face what Francis Kubala, division scientist at Cambridge, Mass.-based BBN Technologies, calls "the out-of-vocabulary problem."
Audio mining's most compelling fit may be for applications where a searchable index can replace the need for transcription. In contrast, data mining of audio content for marketing purposes is "a little bit of an evangelistic sell" at this point, Meisel says.
"The call center is a little tougher, because you may or may not discover something [with audio mining]," explains Fenn.
The technology's greatest value may be derived from embedding it in other applications. San Mateo, Calif.-based Virage Inc., for example, offers both Atlanta-based Fast-Talk Communications Inc.'s Fast-Talk and BBN's Audio Indexer as plug-ins to its VideoLogger video indexing system. More advanced applications could eventually integrate call center logs with sales activity and other customer relationship management data, analysts say.
But audiomining hasn't worked in every case. Ted Ryan, manager of collections development at Atlanta-based The Coca-Cola Co., says he wanted to use it to index television commercials last year, but "the voice-overs clashed with the music." With an accuracy rate of just 15%, he turned to manual transcriptions.
Coca-Cola also tried using audio indexing of meetings. "Our chief executive [at the time] was Cuban. When we ran it with executive speeches, it came up with gobbledygook," Ryan says.
Nonetheless, he says he's interested in testing the latest tools to index radio advertisements. And accuracy continues to improve, says Kubala, adding that he expects the word error rates for nonbroadcast audio to drop dramatically during the next three years.



Jump to comments

Software Development

Additional Resources

WHITE PAPER
Approximately 60 percent of data migration projects overrun time or budget, while some fail completely. Download this white paper, "Enhancing Your Chance for Successful Data Migration," to learn the critical steps you need to take to execute a data migration project with minimum cost and risk to your business.
WHITE PAPER
Read the Gartner research note to learn why the TCO of a server-based computing deployment used to deliver all applications to users is around 50% lower than that of an unmanaged desktop deployment.
WHITE PAPER
Economic downturns have a tendency to accelerate emerging technologies, boost the adoption of effective solutions, and punish solutions that are not cost competitive or that are out of synch with industry trends. This IDC White Paper presents the results of an IDC survey of 330 companies in Western Europe, Asia/Pacific and the Americas that measures the receptiveness to Linux and takes into consideration changing views driven by the disruptive economic environment that businesses face today.

White Papers & Webcasts

IBM Migration Factory: A smooth transition to new technology
Find out how to migrate your applications smoothly over to IBM.  

Effectively Implementing Datacenter Automation
Effectively select and deploy the best datacenter automation solution today!

Natural User Interface for Enterprise Applications
Download this Complimentary White Paper! Provided by Workday.  

Aligning IT to Business: The Rising Importance of Application Delivery Networks
Application Delivery Networking (ADN) will play a vital role in helping enterprises incorporate strategic technologies to achieve business initiatives.

Moving Beyond Monolithic - What's Next for Enterprise Application Architectures?
Download this Complimentary White Paper! Provided by Workday.  

Total Cost of Ownership of Server Computing Vs. PCs
Read the Gartner research note to learn why the TCO of a server-based computing deployment used to deliver all applications to users is...  

Mitigate Risk, Lower Costs and Improve Network Efficiency
Create a stable IP network that not only meets today's challenges, but is flexible enough to also meet future demands.