|
|
Designing Web Pages to be Speech EnabledSpeech Enabled Web BrowsesEvery web browser has companion applications that "speech enable" it, allowing users to speak the name of a link and have the browser jump to that link as though it were clicked. The speech functionality will be automatic, requiring no special effort from the web-page designer for recognition to work satisfactorily. However, although speech recognition will work automatically, a web designer can use a little bit of care and get speech recognition to work even better. To understand what a web designer should do to make his/her pages work well with speech, it's necessary to describe what speech recognition needs to work, and how the web browses automatically speech-enable a page. For speech recognition to work, it must know what phrases a user can speak, such as "Go to Microsoft's Web Page" or "Tell me about macaws". A speech recognizer can listen for several hundred phrases at a time, and new phrases can be loaded and unloaded at any time. For a the computer to recognize a user's speech, the user must speak the full phrase exactly as it is written, without having words inserted or deleted. If the user says a phrase not on the list, the recognizer returns an "unrecognized" and the user has to rephrase his/her command. If the user said "Go to Microsoft's Page" then the phrase, "Go to Microsoft's Web Page" wouldn't be properly recognized. Speech enabled web browsers tell the speech recognizer to listen for all the links on a page. They use the same string as is displayed to the user. If the user speaks the full name then the recognizer will hear it, and the browser will jump to that link.
Fine-tuning a web page for speechHere's a simple rule to fine-tune a web-page to work with speech-enabled browser: All of the link names must be "speakable". A link named "http://www.Microsoft.com" is not very speakable because it has to be spoken as "h tee tee pee double-you double-you double-you dot Microsoft dot com". Try saying that. Also, the user doesn't know if they should speak the "h tee tee pee" part. If the link were simply "Go to Microsoft", the user could just speak that. Now that you know the general principle, here are some tips and tricks: Don't have more than 100 speakable itemsSpeech recognition accuracy and speed will deteriorate if there are more than a hundred (approximately) speakable items on a web page. Don't give speakable items the same nameIf two or more links have the same name then the computer won't be able to tell which link the user meant. Don't give speakable items similar sounding namesSpeech recognition has a hard time hearing the differences between similar sounding phrases. Example, speech recognition will confuse "Send mail to Kim" and "Send mail to Jim" easily. The more different the phrases sound, the better the accuracy will be. Make speakable items two to four words longIf a speakable item is just one word long then it's likely to sound similar to another item. If it's two or more words long then the phrases will sound different, and recognition accuracy will be higher. However, if a phrase is too long, people won't want to speak it. Don't break a link into multiple linesSome web pages will break a link into several lines, like this: Or, the page will do "cool" capitalization like this: Neither of these produce very speakable results. In the first case the user has to say "My favorite link", "to my favorite", or "spot". In the second case, the user would have to speak "M", "y", "F", "avorite", "L", or "ink" to get a recognition. Don't put ambiguous symbols and numbers in linksA link named "6/23" presents a problem because users are unsure if they need to speak, "six slash twenty three", "six twenty three", "six twenty thirds", or "June twenty third". Graphics should be speakableIn the case of graphics, you can tell what the browser expects to hear by:
Users will speak the text in the graphic. If the text in the tool-tip doesn't match the computer isn't going to understand the user. Make the title speakableAll web pages have a title that is used for the "favorites" list by browser applications. The Microsoft Voice application allows user to say "Go to <link>", where <link> is any of the names on the favorites list. If the page has a lengthy title or one with ambiguous symbols, then the user will have a difficult time speaking it.
ActiveX Control DesignActiveX controls should comply with the ActiveX Accessibility standard. Any ActiveX control that supports this will automatically have speakable items.
Using the Speech APIs through Visual Basic or JavaIf you want even more speech recognition (or text to speech) than the browser will automatically provide, then you can use ActiveX in Visual-Basic Script or Java Script. |
Send mail to askazad@hotmail.com with questions or comments about this web site.
|