Computerworld - HTML's ability to describe layouts and pages was a major factor in the rise of the World Wide Web. But HTML has a fundamental flaw: It assumes a graphical output display on a computer. Five or 10 years ago, that was the natural and obvious thing to do.
But nowadays people want to be able to access the Web when they're away from their desktops, using phones, pagers, handheld devices and even household appliances. While most of these devices have graphical displays, at best those displays are very small, have limited bandwidth, aren't well suited to normal Web browsing and generally don't have keyboards for input or control. In business, many areas of customer support have moved to Web-based systems, and there's a real need to make those systems accessible from any telephone without the benefit of a computer client or visual display.
In other words, we want to be able to talk to our Web pages and have them talk back to us. This is called voice browsing, and it lets users retrieve information from the Web by means of speech synthesis, prerecorded audio and speech recognition. Voice capability can be added to conventional desktop browsers, and as mobile devices become smaller, voice interaction can provide a more practical alternative to tiny keypads and undersized displays.
The World Wide Web Consortium is working to expand access to the Web to allow people to interact via keypads, spoken commands, prerecorded speech, synthetic speech and music. In 1998, the W3C sponsored a voice browsing workshop. The next year, it formed a working group whose members included AT&T Corp., British Telecommunications PLC, Lucent Technologies Inc., Philips Electronics NV, IBM, Motorola Inc. and Nokia Corp. The group is working on interrelated XML-based languages and standards for developing speech applications. Called the W3C Speech Interface Framework, this platform includes the following:
- VoiceXML 2.0, for defining dialogues and specifying the exchange of data between the user and a speech application.
- VoiceXML 2.1, a small set of features that have been widely implemented by vendors.
- Speech Recognition Grammar Specification, for specifying the structure of user input to a speech application.
- Speech Synthesis Markup Language, for specifying just how synthesized speech is rendered to the user -- e.g., the type of voice used and specific pronunciations.
- Semantic Interpretation for Speech Recognition, which defines links between grammar rules and application semantics, so that spoken variations of the same element, such as "Coke" and "Coca-Cola," are treated as equivalent.
- CCXML, for specifying call control functions.
VoiceXML is the most visible part of this framework, while the other elements are essentially infrastructure. VoiceXML leverages the other specifications for creating dialogues that feature synthesized speech, digitized audio, recognition of spoken and DTMF key (i.e., touch-tone) input, recording of spoken input and telephony. VoiceXML hides many of the complexities of telephony platforms.


- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Activities Streams Base An Integrated Social Layer
- The enterprise social software market is exploding thanks to converging trends of consumerization, cloud, and mobile. In this must-read report, "The Forrester Wave:...
- Beyond EDI: Reducing Your Automation Deficit with Business Intergration
- In this white paper, we compare EDI integration with other business integration models, and identify four keys to achieving broader business automation, "Beyond...
- Five Steps to Successful IT Consolidation
- Mapping out a 5 step consolidation process can ensure that the goals of IT consolidation are achieved. Read this white paper to learn...
- Shape Your Apps Strategy to Reflect New SaaS Licensing and Pricing Trends by Forrester Research
- Forrester¿s review of 11 vendors in SaaS enterprise resource planning (ERP), customer relationship management (CRM), and supply chain management (SCM) confirms that, motivated...
- IDC MarketScape: Worldwide Business Process Platforms 2011 Vendor Analysis
- This IDC study uses the IDC MarketScape model to assess the capabilities of vendors to support midrange to complex process improvement scenarios using... All Enterprise Architecture and SOA White Papers
- Configure, Don't Customize Your Service Desk
- Join Pink Elephant Analyst George Spalding and Nimsoft Service Desk expert Tim Rochte to learn the perils of customizing your service desk and...
- Whiteboard Presentation: Transform the Internet for Enterprise Applications - No Hardware, No Software, No Code Changes
- Watch this whiteboard presentation to learn how to transform the Internet for enterprise applications with no hardware, no software and no code changes.
- Distributed Database Security with Real-time Monitoring
- View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
- InfoSphere Warehouse Packs Demo
- These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
- Delivery Management -- Extending Lifecycle Management
- Date: Wednesday, June 20, 2012, 1:00 PM EDT
Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,...
All Enterprise Architecture and SOA Webcasts