Search Engines Break the Sound Barrier
Computerworld - Do your telemarketers consistently make legally required disclaimers when selling securities? If your firm records its telemarketing calls, IT could set up audio mining software to let management search audio file archives to quickly find the answer.
Emerging audio mining tools, also called audio indexing or audio search software, offer speech processing and search technologies in a single package. The speech engine creates an index that includes a time and date stamp for each spoken word or phoneme in an audio or video file. The search engine then uses that index to allow rapid identification and playback of specific passages. The software may also apply metatags that identify the speakers or the subject of a given passage.
The speech-processing accuracy of speech-to-text engines, traditionally used to index high-quality broadcast audio, has advanced to the point where vendors are introducing new packages for indexing more informal conversations, ranging from corporate meetings to training videos and even help desk telephone conversations.
"[The technology] seems to have passed the threshold of usability," says William Meisel, president of TMA Associates, a speech-recognition consulting and market research firm in Tarzana, Calif.
Unlike speech-to-text packages, which can be trained for individual users, audio indexing products are speaker-independent. They also rely on large, language-specific vocabulary dictionaries, as well as domain models that may optimize for the type of conversation (e.g., telephone) or industry (e.g., health care). While the newest products can process audio at or faster than real time with an accuracy sufficient for searching, the output text isn't a readable transcript, cautions Jackie Fenn, an analyst at Stamford, Conn.-based Gartner Inc. And as new companies, products and terms come into use, users must update their systems regularly or face what Francis Kubala, division scientist at Cambridge, Mass.-based BBN Technologies, calls "the out-of-vocabulary problem."
Audio mining's most compelling fit may be for applications where a searchable index can replace the need for transcription. In contrast, data mining of audio content for marketing purposes is "a little bit of an evangelistic sell" at this point, Meisel says.
"The call center is a little tougher, because you may or may not discover something [with audio mining]," explains Fenn.
The technology's greatest value may be derived from embedding it in other applications. San Mateo, Calif.-based Virage Inc., for example, offers both Atlanta-based Fast-Talk Communications Inc.'s Fast-Talk and BBN's Audio Indexer as plug-ins to its VideoLogger video indexing system. More advanced applications could eventually integrate call center logs with sales activity and other customer relationship management data, analysts say.
But audiomining hasn't worked in every case. Ted Ryan, manager of collections development at Atlanta-based The Coca-Cola Co., says he wanted to use it to index television commercials last year, but "the voice-overs clashed with the music." With an accuracy rate of just 15%, he turned to manual transcriptions.
Coca-Cola also tried using audio indexing of meetings. "Our chief executive [at the time] was Cuban. When we ran it with executive speeches, it came up with gobbledygook," Ryan says.
Nonetheless, he says he's interested in testing the latest tools to index radio advertisements. And accuracy continues to improve, says Kubala, adding that he expects the word error rates for nonbroadcast audio to drop dramatically during the next three years.
Read more about App Development in Computerworld's App Development Topic Center.



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- The Keys to Distributed & Agile Application Development
- How leading firms are winning with strategies for efficient application development, without relying on co-location.
- Overcome Top 7 Admin Challenges of Active Directory
- As Active Directory's role in the enterprise has drastically increased, so has the need to secure the data. Gain insight on creating repeatable,...
- Insiders Can Ruin Your Company. Take Action.
- Did you know that 80 percent of threats to an organization come from the inside? The threat from insiders is often overlooked in...
- Top Solutions and Tools to Prevent Devastating Malware
- Custom malware frequently goes undetected. According to Forrester Research, the best way to reduce risk of breach is to deploy file integrity monitoring...
- Streamline Compliance and Increase ROI
- Streamline, simplify, and automate compliance related activities; especially those that impact multiple business units. This white paper from NetIQ, outlines solutions that will... All App Development White Papers
- Reduced TCO for Communications Applications with New Oracle SPARC Servers
- In this webcast learn how Oracle's new SPARC T4 servers and SPARC Supercluster deliver the security, performance, and scalability required for 4G network...
- Optimizing Networks for the Cloud
- Join guest speaker, Rohit Mehra, IDC Director of Enterprise Communications Infrastructure, to explore current trends, discuss best practices for optimizing Data Center and...
- Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
- Customer Spotlight: How IPC The Hospitalist Company Implemented Oracle on VMware
- Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn... All App Development Webcasts