Advances in speech recognition software are extending the utility of traditional applications -- and paving the way for broader use.
Computerworld - The velvety voice of that nice young woman on the other end of the phone is really just digits on a disk somewhere at Verizon Communications Inc., but "she" remembers that I spoke to her a few moments earlier, before I was interrupted by another call. "I apologize if I ask some questions you already answered," the voice says. She sounds genuinely contrite.
But the virtual telephone-repair lady is just getting warmed up. "I'll test your line from here," she intones. "OK, I got the line test started. It could take up to a minute. I'll also check to see if anything's changed on the line since you last called." While the test runs, she asks me for more information about my telephone problem, and she seems to understand my every response.
Presently she says, "The line test is finished now. Unfortunately, it couldn't determine if the problem is in Verizon's network or with your equipment, so we need to dispatch a technician. ... Here we are -- I've picked up all of our technicians' current schedules. The earliest we can schedule it is on Thursday, June 3, between 8 a.m. and 6 p.m. Can someone give access to the premises at that time?" The call is soon completed, and on June 3, so is the repair.
Computerized speech has come a long way in 20 years. As Verizon's system illustrates, the technology has become smarter, easier to use and more integrated with other applications. Such technical advances, plus product introductions that facilitate the deployment of the technology by mainstream developers, are enabling new uses for automated speech systems.
A Long and Winding Road
Research in automated speech recognition goes back to the 1930s, but serious commercialization of it didn't begin until 50 years later. In 1988, Dragon Systems Inc. demonstrated a PC-based speech recognition system with an 8,000-word vocabulary. Users had to speak slowly and clearly. One. Word. At. A. Time.
Image Credit: Plankton Art
Meanwhile, corporations began rolling out interactive voice response (IVR) systems. The earlier ones -- indeed, most in use today -- are menu-driven: "For your fund balance, say or press 'one.'" A few advanced systems are more conversational: "What city are you departing from?" Despite the steady advancements to bigger vocabularies, lower error rates and more natural interfaces, however, speech products have remained specialized
- Gartner Magic Quadrant for Client Management Tools The client management tool market is maturing and evolving to adapt to consumerization, desktop virtualization, and an ongoing need to improve efficiency.
- Path Selection Infographic Path Selection Infographic
- Hyperconvergence Infographic A wide range of observers agree that data centers are now entering an era of "hyperconvergence" that will raise network traffic levels faster...
- Preparing Your Infrastructure for the Hyperconvergence Era From cloud computing and virtualization to mobility and unified communications, an array of innovative technologies is transforming today's data centers.
- Cloud Knowledge Vault Learn how your organization can benefit from the scalability, flexibility, and performance that the cloud offers through the short videos and other resources...
- LIVE EVENT: 5/7, The End of Data Protection As We Know It. Introducing a Next Generation Data Protection Architecture. Traditional backup is going away, but where does this leave end-users? All Desktop Apps White Papers | Webcasts