Category Archives: Patch on Speech

Tip: Scrolling by speech

I’ve gotten several questions lately about scrolling by speech, which is key to comfortable hands-free operation. Utter Command gives you several ways to scroll by speech. The best way depends on the situation.

To quickly look something over, use the speech command that allows you to see successive screens with a pause between changes. For example, “3 Screen Down Wait” moves down a screen, then after a default wait of two seconds moves down another screen, then two seconds later moves down a third screen. If you want a longer wait, add a specific number of seconds, e.g. “3 Screen Down Wait 5” (UC Lesson 7.23). 

To directly control the scroll bar by speech, place the mouse pointer on the scroll bar using a command like “99 by 10” and use the vertical drag command to move the scroll bar to a given point. For example “Drag By 50” moves the scroll bar to the middle. Then, if you then want to go three quarters of the way down say “Drag By 75”. You can also control the scroll bar incrementally, for instance, “Drag 3 Down” (UC Lesson 4.2, 4.5).

In some programs, including some versions of Word, the cursor moves to the page you scrolled to when you use an arrow command like “5 Down”. And in some programs, like Firefox, you can say a link number to move the cursor. In these cases you can leave the arrow parked on the scroll bar, edit the text, than say another drag command to move the scrollbar without having to move the mouse to the scrollbar again. In some programs, including WordPad, you have to move the cursor to the new page by clicking. In this case, keep the right ruler open on your screen so you can easily click back to the scroll bar when you’re ready to scroll again.

– If you use this method a lot, try naming a mouse click to move the arrow to the scroll bar at the home position (UC Lesson 10.24).

– You can also use this method to control horizontal scrollbars — use the “Drag 1-100 By” command.

– If you’re a ZoomText user, you can use this method even when the scrollbar is not showing on the screen.

Tell me what you think about scrolling by speech – reply here or let me know at info@ this website address.

Speeding Web navigation: single-step deep menu access

Utter Command speech-enables the Firefox Mouseless Browsing extension, which puts a unique number on every clickable item on a Web page. UC lets you click every item on a page, including links, by saying the number plus the word “Go”, for instance “7 Go”.

This works pretty well, but it gets even better when you discover that an item doesn’t have to be visible for you to click it.

This lets you click items that are off-screen. Better yet, it lets you click items on drop-down menus without having to first drop-down the menu. This lets you use a single step to get to any menu item in a Web application once you know the number.

For instance, to insert a horizontal line in a Google Document you can click the “Insert” menu, then click the menu item “Horizontal Line”. There’s no direct keyboard shortcut for horizontal line, so it’s usually a two-step task.

Using numbers you can say “7 Go” to drop-down the Insert menu, then “84 Go” to click  Horizontal Line. But if, like me, you add horizontal lines often enough to remember the number, you can cut straight to the chase and say “84 Go” anytime you want a horizontal line.

Tip: Not my mistake

One thing that the Dragon NaturallySpeaking speech engine could do better is hyphenation. I don’t mind so much when I say something that should be hyphenated and it’s not. I can always say the NaturallySpeaking command “hyphenate that” or the UC command “1-10 Hyphenate” after the fact if the NaturallySpeaking engine leaves out the hyphenation. I can also specify hyphenation when I want it, e.g. “on hyphen the hyphen fly” will type “on-the-fly”.

If I have something that’s not hyphenated and should be, it’s either a mistake or something I accidentally left out.

But if NaturallySpeaking puts in hyphenation where I don’t want it, there are two problems. First, there’s not an easy way to remove hyphenation after the fact — I have to select the phrase, then say it again in two phrases so it won’t be hyphenated, which is 3 steps. Second, there’s no way to specify no hyphenation.

If NaturallySpeaking over-hyphenates and I don’t notice, it looks like I’m consciously adding hyphens where they shouldn’t be. There’s nothing more annoying than having another entity introduce mistakes into your work.

Because the minuses of over-hyphenation are larger than the minuses of not hyphenating enough, when I see a phrase hyphenated when it’s not supposed to be I remove the hyphenated version from Natspeak Vocabulary so it won’t happen again.

For instance, I removed “follow-up”, which I often put as a stand-alone tag in my todo list. It’s a clunky workaround, but it’ll have to do until speech engines get better at analyzing hyphenation.

To remove a vocabulary word say “NatSpeak Vocabulary”, say the or phrase you want to delete, “Under d c” to delete and close the window, and “Enter” to confirm the change.

I think Nuance could mitigate this problem with a pair of in-line commands: “no-hyphen that” would remove hyphenation in the last phrase and “no-hyphen” would specify that something not be hyphenated, parallel to the “no-caps” command. I’m adding this to the Nuance wish list.

Tip: What to do when dictation isn't recognized as text

Occasionally the Dragon NaturallySpeaking speech engine will get mixed up about whether or not the program or field in focus is something you should be able to type text into. When this happens you’ll see lots of question marks in the recognition box.

The problem is usually easy to fix — move the focus out of whatever program this is happening in, then back in. Here’s a quick way to do that — the UC command “Notepad Open · Notepad Close”.

Tip: What to do when dictation isn’t recognized as text

Occasionally the Dragon NaturallySpeaking speech engine will get mixed up about whether or not the program or field in focus is something you should be able to type text into. When this happens you’ll see lots of question marks in the recognition box.

The problem is usually easy to fix — move the focus out of whatever program this is happening in, then back in. Here’s a quick way to do that — the UC command “Notepad Open · Notepad Close”.

Tip: Faster correction

Here’s a very quick tip that’ll speed the correction process.

If you start spelling in the Dragon NaturallySpeaking Spell Correction dialog box, but realize too late that you spoke too soon because the correct answer was one of the choices after all, all is not lost. Say the UC command “Line Delete” to get the original choices back.

Keep it simple: Keys don't need pseudonyms

I have a pet peeve in the Keep it Simple, Stupid department.

Keyboard shortcut labels containing Enter should say “Enter”, which is what’s written on the key, rather than the antiquated “Return”. Having the label match the key is better both for people who pay keys and people who speak keys.

Thunderbird/File has an example of what not to do — two labels containing “Return”. It’s a small thing, but using any amount of brainpower for this type of translation is unnecessary.

What are your speech pet peeves? Tell me about them – reply here or let me know at info@ this website address.

Keep it simple: Keys don’t need pseudonyms

I have a pet peeve in the Keep it Simple, Stupid department.

Keyboard shortcut labels containing Enter should say “Enter”, which is what’s written on the key, rather than the antiquated “Return”. Having the label match the key is better both for people who pay keys and people who speak keys.

Thunderbird/File has an example of what not to do — two labels containing “Return”. It’s a small thing, but using any amount of brainpower for this type of translation is unnecessary.

What are your speech pet peeves? Tell me about them – reply here or let me know at info@ this website address.

Tip: Link phrases and website paths


You’re using the Utter Command Site list utility to go directly to websites, e.g. “Twitter Site”. And you’re comfortable saying the numbers that appear next to links to get around websites, e.g. “11 Go”. What’s next?

Is there a place where you find yourself constantly using two link numbers in a row? Try speeding things up by saying them as one phrase. For instance, in Google documents, I say “7 Go” to drop-down the Insert menu and “84 Go” to choose Horizontal Line. After a few times you don’t have to look: say “7 Go 84 Go” to do it all at once.

Here’s another tip that starts with a riddle: how are the following two tasks connected?

1. Going directly to a website but not in your default browser

2. Emailing a website link to a friend

Answer: you can use the same command to do either one.

Say you’re in a browser that’s not your default browser, and you want to go to the Twitter website. Put the cursor in the address bar (“Go Address”). Then paste the path of any website that’s on your Site list by adding “Path” to whatever you’d normally say to go to the site, e.g. “Twitter Site Path”.

Same goes for when you want to drop a link into an e-mail message. If the site in question is on your Site list, there’s no need to go to the site and cut and paste — use the Path command to paste the Web address into your message.


Have you found a good application for double Link numbers? Tell me about them – reply here or let me know at info@ this website address.

Tip: Help on the NaturallySpeaking utilities


A few weeks ago the folks at Nuance posted a Dragon NaturallySpeaking 10 user workbook. It’s an excellent resource. It includes detailed instructions on NaturallySpeaking speech engine utilities that will make your speech experience better.

Here are my favorite parts:

The instructions for creating a user profile (page 1) explain important concepts like choosing the right dictation source. If you accidentally choose the wrong dictation source, your accuracy will not be as good.

The Vocabulary dialog box (page 8) allows you to add custom vocabulary including phrases, and import custom vocabulary from existing documents. I make sure my users have “Utter Command” and “Redstart Systems” as phrases. I have a usability complaint about the Vocabulary dialog box, however. It’s difficult to find in the menus because the menu name doesn’t match the dialog box name: NaturallySpeaking/Words/View-Edit. I think the label should be Vocabulary Editor instead.

The Formatting dialog box (page 23) allows you to control automatic formatting of special text like numbers. This section explains what you can control and how to control it.

The My Commands dialog box (page 44) allows you to create Text and Graphics commands. These boilerplate commands are relatively easy to create and can save you a lot of time. You can assign a command like “My Address” to a larger block of text, complete with line breaks and formatting.

The improving accuracy section (page 50) includes instructions for the Train Words, Acoustic Settings and Acoustic and Language Model Optimizer dialog boxes.

Here are some resources that have to do with using NatSpeak utilities with Utter Command:

In the Utter Command manual we touch on how Utter Command dovetails with NatSpeak Correction, Vocabulary, Recognition and Train Words utilities in UC Lesson 1 (1.3, 1.4, 1.5, 1.11). Also see the “Dragon NaturallySpeaking” section on UC Exchange.