Hands Off: Using Dragon NaturallySpeaking
Professional writer finds it twice as fast as keyboarding -- with some minor annoyances
Computerworld - You decide what you are going to say. You say the words. They appear on the screen. You're done.
That's what writing with speech recognition -- specifically, Dragon NaturallySpeaking Version 9.0 -- amounts to. The lifetime it took to achieve a smooth keyboarding rate of 60 words per minute no longer matters. The skill you effortlessly mastered as a child -- talking -- is all that's required to input text to your computer.
Of course, there's a little more to it than that, but the requirements are trivial compared to, say, a semester of typing class. First, you'll want a fast processor, but everything sold new today will probably suffice. You will want to install all the RAM you can in it -- a gigabyte is a good start. You'll want a fairly quiet place to work, but any place you feel comfortable talking on the phone will probably suffice. And you have to believe that consistent pronunciation is a worthy goal, and not an artificial, elitist imposition.
Is speech recognition finally good enough?
The software installs from two CDs in a conventional manner. The package also comes with a headphone/microphone, and you'll want to make sure it's connected correctly before beginning -- the correct I/O ports are not always obvious.
After it's installed, the software will examine the vocabulary of your "My Documents" folder, and ask you to read at least one short canned passage while it analyzes the way you pronounce the words. This reading is called the enrollment process and only takes a few minutes -- unlike earlier versions of the software in the last decade, where the process took nearly an hour.
Once it's running, Dragon NaturallySpeaking will install a command bar along the top of screen, with a microphone icon and various menu items. To begin dictation, you position the cursor on the screen, just as if you were about to type something. But instead of typing, you click the microphone icon and began speaking.
At this point, accuracy will probably be about 95%. That means there will be about 10 mistakes per double-spaced page. That's well short of perfection, but the mistakes will be correctly spelled words that just happen to be the wrong words. To correct an incorrect word, you select it and say the correct word. In most cases, that suffices. Correcting the text, in other words, takes hardly more time than it takes to proofread it. Inputting that text, meanwhile, happens at conversational speed, which for most people is between 120 and 150 words per minute. (The software claims to be able to handle 160 words a minute.)
By comparison, the average typing speed for an office worker is about 40 to 50 words a minute. My own is about 60 with an accuracy of 93%, so on the whole, I've found that using speech recognition is about twice as fast as typing. Those who type at hunt-and-peck speeds will experience results that are even more dramatic.
(Note: You may have to click twice)
Dragon NaturallySpeaking in action
After the user starts speaking, the yellow "results box" appears near the text insertion point, showing what Dragon thinks it has heard so far. The results will change as the phrase lengthens and Dragon is able to perform further analysis. For instance, it decided that "period" was punctuation in all three sentences.
In the third sentence, it initially placed "to" after "brought" but moments later changed it to "two," successfully analyzing the homonyms. Dragon was less successful with the homonyms in the second sentence, failing to differentiate "which" and "witch."
Dragon types the resulting text after analyzing a phrase, so that what appears on the screen may fall a sentence behind what the speaker is saying (as was the case in this example.) This can be disorienting and users are advised to not watch the screen during dictation.
When finished, the user selected the second "which" and spoke the word "witch" again. Dragon knew enough to respell it as the other homonym.
- 15 Non-Certified IT Skills Growing in Demand
- How 19 Tech Titans Target Healthcare
- Twitter Suffering From Growing Pains (and Facebook Comparisons)
- Agile Comes to Data Integration
- Slideshow: 7 security mistakes people make with their mobile device
- iOS vs. Android: Which is more secure?
- 11 sure signs you've been hacked
- What Datapipe customers need to know about the new PCI DSS 3.0 compliance standard This handy quick reference outlines what PCI DSS 3.0 is, who needs to be compliant and how Alert Logic solutions address the new...
- Defense Throughout the Vulnerability Life Cycle This whitepaper provides insight into how to leverage threat and log management technologies to protect your IT assets throughout their vulnerability life cycle.
- The Critical Role of Support in Your Enterprise Mobility Management Strategy Most business leaders underestimate the importance of tech support when they choose an EMM solution. Here's what to put on your checklist.
- Separating Work and Personal at the Platform Level: How BlackBerry Balance Works BlackBerry® Balance™ separates work from personal on the same mobile device, right at a platform level. Find out how it can work for...
- Live Webcast Best Practices for the Hyperconverged Enterprise Network To the Age of Constant Connectivity and Information overload
- Getting Ready for BlackBerry Enterprise Service 10.2 Find out how BlackBerry® Enterprise Service 10 helps organizations address the full spectrum of EMM challenges, while balancing the needs of both the...
- Containerization Options: How to Choose the Best DLP Solution for Your Organization This webcast outlines a framework for making the right choice when it comes to containerization approaches, along with the pros and cons of... All Networking White Papers | Webcasts