HTML5 may help Web pages talk, listen
IDG News Service - Sometime in the near future, users might not only read Web pages but hold conversations with them as well, at least if a new activity group in the W3C (World Wide Web Consortium) bears fruit.
The W3C is investigating the possibility of incorporating voice recognition and speech synthesis interfaces within Web pages. A new incubator group will file a report a year from now summarizing the feasibility of adding voice and speech features into HTML, the W3C's standard for rendering Web pages.
AT&T, Google, Microsoft and the Mozilla Foundation, among others, all have engineers participating in this effort.
The human voice and the Web are not strangers: Google includes a voice-based Web search app in its Android smartphone operating system and Microsoft promises robust voice-driven features in its upcoming Windows Phone 7.
The HTML Speech Incubator Group is studying the feasibility of developing a standard Web interface for both speech recognition and synthesis, said group chair Dan Burnett, who is also director of speech technologies and standards at voice response system provider Voxeo.
Such an interface could be used across multiple browsers. Using built-in or plug-in voice recognition and speech synthesis engines, browsers could read pages aloud or permit users to audibly fill out Web forms.
While this work may overlap with another voice-based W3C effort, VoiceXML, the two efforts are somewhat different, Burnett said. VoiceXML wouldn't work very well for the Web, given that it was primarily designed for voice-driven applications, such as telephone-based voice response systems, where it is used widely. Like HTML itself, the voice capabilities of HTML would be stateless, or not require a dedicated session with the user.
Burnett noted that while the report would discuss the feasibility of establishing a set of interfaces, the work of developing the interfaces themselves, should they be warranted, would be taken on by another W3C group, such as the HTML Working Group.
The W3C has been busy with speech technologies on a number of other fronts as well. The organization also recently released version 3.0 of VoiceXML. In this new version, the working group added semantic descriptions of the features, and organized the functionality into modules.
The W3C also plans to shortly release version 1.1 of SSML (the Speech Synthesis Markup Language) -- often used in conjunction with VoiceXML -- that will incorporate Asian languages, and provide developers more flexibility with voice selection and handling of content in unexpected languages.
Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com



- Excel 2010 Cheat Sheet
- Register for this Computerworld Insider Cheat Sheet and gain access to hundreds of premium content articles, guides, product reviews and more.
- Forrester Total Economic Impact (TEI) Case Study - Oracle
- In this paper, Forrester Consulting examines the total economic impact and potential return on investment (ROI) realized by three Enterprise organizations as they...
- The Hidden Truth About Virtualizing Business-Critical Applications
- This IDG whitepaper highlights key findings based on the Quickpoll Survey conducted with more than 300 Enterprise and Commercial IT decision makers worldwide...
- Top 10 Myths About Virtualizing Business-Critical Applications
- Even though virtualization has brought positive change to enterprise IT over the last decade, some skepticism remains about how valuable virtualization can be...
- Enterprise Java Applications on VMware: Unix to Linux Migration Guide
- This guide focuses on key considerations for IT Architects who are in the process of migrating Java applications from UNIX to Linux as...
- Virtualizing Tier 1 Applications: A Critical Step on the Journey Toward the Private Cloud
- This IDC white paper explains how much of the Enterprise IT community is at a crossroads in extending their journey to the private... All Applications White Papers
- Live Webcast
Banish Poor Application Performance: Eliminate Business Disruptions, Increase End User Productivity - End User Experience, 30-Min Webinar
Wed. Feb. 22nd ~ 11 AM ET
Are you ready to gain the proactive ability to rapidly respond... - Apps QuickStart Series Part 2: Designing and Deploying SQL Server on VMware vSphere
- Download this webcast to learn about the design considerations for virtualizing SQL workloads, performance and scalability information and high-availability options, as well as...
- Apps QuickStart Series Part 1: Designing and Deploying Exchange 2010 on VMware vSphere
- Download this webcast to learn the virtual hardware design considerations for Exchange 2010, deployment using the building block approach, options for high-availability and...
- Virtualize Business-Critical Applications with Confidence
- Virtualizing business-critical applications has become a key focus for organizations as they move along their virtualization journey. With the launch of VMware vSphere®...
- Discover the Benefits of Virtualization for Federal Applications
- Want to say goodbye to missed SLAs? VMware can help you virtualize mission-critical applications such as Oracle, MS Exchange and SharePoint to achieve...
- Reduce Application Lifecycle Management Costs with VMware ThinApp
- Traditional desktop application deployment and management is a time-consuming and costly endeavor for IT. From development to deployment, including help desk support, the... All Applications Webcasts