Category Archives: Speech Recognition

Tip: Naming a mouse click


We're getting some good feedback from people who are speeding themselves up in all kinds of situations using UC's naming a mouse click ability. We didn't anticipate some of the ways people are using this ability. I'll detail these in a future post.

Here's how to name a mouse click:

1. Position the mouse using the mouse rulers commands on something you regularly click and can't get to any other way, for instance the Indent button in Google documents, e.g. "50 By 10"

2. When you've got the mouse exactly where you want it, say "Add Touch" to call up the UC list dialog box with the coordinates entered

3. Add a name for the coordinates, e.g. "Indent"

4. say "Enter" to put the new command on the Touch list (at this point you can repeat steps 1-4 to add more commands).

5. say "Window Close · Yes" to close the UC List dialog box, and restart NatSpeak

 

Now you can say "Indent Touch" to click the Google Documents Indent button.

 

Things really heat up when you use the naming a mouse click ability to click twice using a single speech command.

Let me know how you're using the mouse click ability by commenting here, on the Web site comment form (say "UC Make"), or sending e-mail to info@ this website address.

Tip: make sure to export your lists to back them up (say "UC List Export").

Note: some early prerelease copies of Utter Command don't contain the naming the mouse click utility. It'll be available in the general release, and all prerelease customers will get a copy of the general release when it comes out. If you're a prerelease customer and would like an upgrade before the general release, please contact us.

Tip: Speedy Attaching with the Message Paste command


Say you’re in Windows Explorer or an Open dialog box where the files you want to add to a future e-mail message reside. You could open up your e-mail program and use the attach utility to attach files, or, since you’re already right there with the files, you could speed things up this way:

To attach files to a new e-mail message starting from Windows Explorer or an open dialog box:

  1. select the files you want to attach, by saying, for instance “3 Downs” (see UC Lesson 5.5 for instructions on selecting noncontiguous files by speech)
  2. say “This Copy”
  3. say “Thunderbird Message Paste” (or Outlook, Express or Eudora, depending on the e-mail you use)


And there you have it, a new e-mail message with files attached.


Keyboard shortcuts: naming, sharing and seeing

3

The way we control computer programs is fairly inefficient.

Keyboard shortcuts are underused in favor of using a mouse to click through menus. This is short-term practical — it takes less thought to browse through menus than to remember a keyboard shortcut. But it’s not very productive.

Look at the whole picture and you find good reasons to make the less productive choice. There are barriers to using keyboard shortcuts. Help and learning tools for keyboard shortcuts are scant at best. And inconsistencies across programs make the learning task larger.

So how do we improve things?

We can (continue to) encourage software makers to improve keyboard shortcut documentation and consistency. This is important, but it’s not going to change the world.

I think things would improve greatly given universal abilities to

1. name our own keyboard shortcuts — this currently exists in some but not all programs
2. share sets of shortcuts
3. see all shortcuts for a given application, and even compare shortcuts across applications

This would provide a good mental map of functionality — both of individual programs and across the landscape of suites.

It would make efficient functionality accessible across the board. It would enable individuals, organizations, departments or corporations to make applications more efficient and even standardize shortcuts across applications. The ability to share shortcuts would put a lot of brains on the problem and make the process efficient and evolutionary.

Given a map of all shortcuts, you could make things even better by allowing the user to mark the map — maybe using color labels.

Tools like this are the equivalent of a downhill groove for water– it would make it easy to be more efficient.

Keyboard shortcuts are least standard and most lacking in Internet applications. I’m thinking an ability like this could be built into or be an add-on to a browser.

And in addition to increasing productivity across the board, keyboard shortcuts are central to accessibility. The blind community relies on keyboard shortcuts. And speech commands often tap keyboard shortcuts — they’re often the hooks people use to write custom macros, and Utter Command allows you to speak and combine keys including keyboard shortcuts.

So who’s going to step up to the plate?

Friday Tip: Cutting and pasting from the Web


I’ve been asking people two questions lately:

1. What tasks do you do the most on your computer?

2. What tasks are frustrating to do by speech?

Cutting and pasting from the Web comes up frequently in both categories, and the frustration lies in selecting text from the Web.

When it comes to selecting text from the Web there’s good news and not so good news. The good news is that it’s easy to select in Firefox. The not so good news is selecting is not as easy as it should be in Internet Explorer.

Selecting text in Firefox

In Firefox you can use the same selection commands you’d use in any program. There’s a trick, however.

The Caret Browsing hotkey — “Function 7” — toggles text navigation commands like “3 Down” and “Go Home” from acting on the text to acting on the scroll bar. If you’re clicked into a text field, you’ll see the cursor appear and disappear when you say “Function 7”. When you can see the cursor, caret browsing is on and you should be able to move the cursor in text.

To see this say

“New York Times Site” to bring up your default browser (should be Firefox) to the New York Times site

“10 By 30” to click somewhere in a text field — adjust the numbers of necessary

“2 Down” to move the scroll bar up and down (by default)

“Function 7” to toggle Caret Browsing hotkey

“2 Down” to move the insertion point in text

Once you have the insertion point where you want it, you can use regular selection commands like “1-100 Lefts/Rights/Befores/Afters/Ups/Downs/Lines/Line Ups…” commands to select precisely.

So if you wanted to move the insertion point to the beginning of a line a couple of lines down, then select the next three lines plus the first two words on a fourth line, you’d say, “2 Down Home”, “3 Lines · 2 Afters”.

Selecting text in Internet Explorer

In Internet Explorer, you can select text using the Shift mouse method (you can use this in Firefox too, but the above method is generally easier).

Say, for instance, “10 By 40” to click at the beginning of the selection, then, for instance, “40 By 60 Shift Touch” to select from the beginning coordinates to the new coordinates. If you want to adjust the selection, say the second command again with different numbers.

Copying and pasting

Once you have something selected in either program you can copy and paste by saying a copy command like “Copy to Word” or “Copy to Window 1”.

Your tasks

I’d like to know what tasks you do the most on your computer and what tasks are frustrating using speech — let me know and chances are I’ll find a way to speed you up.

Gravity on the Web

3

Computer commands of all kinds — speech, keyboard and mouse — are much easier to use when they’re consistent across programs.

At the base level, it’s important that common elements like drop-down menus act the same. You control drop-down menus without thinking — click on an element or use the Left, Right, Up, Down and Enter keys.

Consistent commands are the real-world equivalent of having the same gravity in every room, or keys turning the same way to unlock.

Web applications are looking more and more like standard computer programs, but sometimes the elements that look familiar don’t act the way we’re used to. Drop-down menus usually respond in a familiar way to the mouse, but often don’t respond to the Up, Down and Enter keys.

But perhaps things are getting better.

The first drop-down menus to show up on Google Docs didn’t respond to Left, Right, Up, Down and Enter. Then most of the folder-view drop-down menus were arrow key/Enter enabled, but not document menus. A few months ago document menus changed from looking tab-like to looking more menu like, but still didn’t respond to arrow keys and Enter. Then, sometime in the last few weeks, the Doc menus were arrow key/Enter enabled (the change didn’t show up on the update notice).

The keyboard shortcuts enable better speech navigation as well. I can say, for instance, “3 Down Enter” to choose an item in an open menu, “3 Down 2 Right Enter” to choose a color on the open color menu, or “7 Right Wait 3” to take a three-second peak at each of the seven successive menus starting with the file menu open.

This is a great trend.

Now all we need is keyboard shortcuts to open the menus in the first place. We also need the same kind of control in all Web applications, including Google spreadsheets.

Friday Tip: Remembering boilerplate and vocabulary commands

3

NatSpeak boilerplate Text and Graphics commands allow you to insert any text or graphics into a document using a single speech command. These commands can be very powerful — they’re good for adding text and graphics that you use often, such as your address or a set of directions.

The NatSpeak Vocabulary editor allows you to add words or phrases to your vocabulary that have different spoken and written forms. This allows you to make words like your email address easily pronounceable.

The key to using boilerplate and vocabulary commands is being able to remember them.

There are two ways to make these types of commands easy to remember:

1. Word them consistently

2. Make them easy to look up

I find the easiest way to remember boilerplate Text and Graphics commands is to simply say the first part of the text you’re inserting followed by “Full”. So “Redstart Full” prints the full name and address of Redstart Systems. If you have two different versions of the address, add a number. “Redstart Full 1” prints the same address in a different format.

You can use the Utter Command Clipboard facility to make anything easy to look up. Once you name your Text and Graphics command say “Line Copy To” followed by the name of the UC Clipboard file and you’ve got it recorded. For example, to keep your boilerplate commands in “UC List 1” say “Line Copy To List 1”.

Now any time you want to consult your list of commands say “List 1 File”. You can also print it out.

I also use the start-to-say method for vocabulary words that have different written and spoken forms. I’ve put my Redstart email address in as a vocabulary word with the spoken form “Kim at Red” and my Gmail address in as a vocabulary word with the spoken form “Kim at G Mail” (in address commands I use “Kim” whether or not the actual address is just Kim or something longer).

One caution in using vocabulary in this way — make sure commands are at least two words and make sure the two words are not a common phrase that you’d want to say as is. If you need to, use the “Full” method above to avoid this problem. Also make sure to save your user after adding vocabulary words.

If you wish, keep vocabulary words that have different written and spoken forms on the same list as your boilerplate commands.

The difference between boilerplate commands and written/spoken vocabulary words is a block of boilerplate is returned exactly as written, while vocabulary commands are treated like words, with appropriate spacing before and after them.

UC Commands Tip: say “NatSpeak” followed by the first one or two words in a NatSpeak dialog box title to call up that dialog box.

Commands for the dialog boxes mentioned above:

“NatSpeak My Commands” calls up the NatSpeak My Commands dialog box where you can write a boilerplate Text and Graphics macro

“NatSpeak Vocabulary” calls up the NatSpeak Vocabulary Editor dialog box

Friday Tip: Getting back to where you were on the Web

Question: when you’re looking for a website you’ve been to before, but haven’t put it on your UC List or in your bookmarks, what’s the fastest way to get back to it?

I’ve got a two-step solution for you, assuming you’re starting in an open browser. It works the same in Firefox and Internet Explorer.

1. Combine the command that puts the cursor in the address bar with the first two letters of whatever you’re looking for. If I’m looking for the Evite site, for instance for instance, I would say “Under Delta e v”

2. If the first choice is what you want say “Down Enter” to finish the job, if it’s the second choice “2 Down Enter” etc.

Friday Tip: Quick definitions

The second fastest way to look up a word in a dictionary using Utter Command is to select the word, then say “This Dictionary Search”. This command looks up the selection in dictionary.com and returns the results in a browser page.

The fastest way is to combine selecting the word and searching. For instance, “Word Dictionary Search”.

You can use these commands whether or not a browser is open.

Happy searching.

Solving the page down problem

Whenever I talk to people who use speech commands to control a computer I encourage them to complain. Something that frequently comes up is it’s a drag having to say “page down” so much.

We’ve come up with several ways to diminish the drag:

1. Several screens at once

First, “Page” is a back-of the-mouth word, which is more difficult to say than words that only use sounds that originate in the front of the mouth. This isn’t a problem for commands you don’t use frequently, but looms large when you have to repeat something over and over again.

And when you say “Page Down”, you’re really moving by screen, not by page. This is fortunate, because “Screen” is easier to say than “page”.

Using Utter Command you can say “Page Down” and “Page Up” to hit the page up and page down keys, but you can also say “Screen Down” and “Screen Up”. And you can move multiple screens: “2 Screen Down”, “5 Screen Up”

2. Right to the point

You can also go to a given screen. “Screen 3”, for instance, jumps you right to the third screen of information in a document.

And in programs whose Find facilities recognize page numbers, including pdf’s, you can go right to a given page by saying, for instance, “Find Page 22”. You can try this out on a UC lesson document: “UC Lesson 1”.

3. Wait

It’s still tedious to say “Screen Down” every couple of seconds when you want to glance quickly at subsequent pages. Try this: “3 Screen Down Wait 5”. This moves down a screen, waits 5 seconds, moves down another screen, waits 5 seconds, then moves down another screen.

4. The right tool for the job

It’s also important to look at exactly why you’re going through a document screen by screen. Often you’re looking through pages for a certain section. In this case the screen-by-screen facility isn’t the right tool for the job, but you may be using it because usually it’s the best tool available.

If you’re looking through a document that has numbers, letters or symbols to differentiate sections you can use the UC Keywords facility go directly to any of these. To see what I mean say “Find 1 Period”, “Find 3 Period” in this document. Now picture a longer document with more and longer sections, and a section outline along these lines:

1. Speech Command Problems
1.1 Page Down
1.2 Page Down Solution

2. Speech Command

You could say, for example “Find 1 Period”, “Find 1 Point 1”, “Find 1 Point 2” and “Find 2 period” to jump among these sections.

Using the UC Keyword list you can use any section organization scheme you want — numbers, letters, numbers and letters (1a., 1b….) or heading words themselves (“Find Introduction”, “Find Summary”). Sometimes I put tildas (~) at key points in a document so I can jump to those points (“Find Tilde”). I also use the word “PLACEHOLDER” this way (“Find Placeholder”).

You can also use “Wait” with keywords. I use this one to scan a document for placeholders: “Find Placeholder Wait 2 Repeat 5”.