SpeechTEK.com home
Biographical Information
James A. Larson
Vice President
Larson Technical Services

James A. Larson, Ph.D., is co-program chair for the SpeechTEK 2009 Conference, co-chair of the World Wide Web Consortium's Voice Browser Working Group, and author of the home-study guide, The VoiceXML Guide (www.vxmlguide.com). Larson also teaches courses in speech application development at Portland State University and Oregon Institue of Technology.

Conference Sessions By James A. Larson
SpeechTEK 2010
Sunday, August 1, 2010
1:30 p.m. - 4:30 p.m. STKU-1: Introduction to Speech Technologies

Monday, August 2, 2010
2:15 p.m. - 3:00 p.m. B104: New Languages Enable New Experiences

Tuesday, August 3, 2010
1:45 p.m. - 2:30 p.m. C203: Implementing and Deploying Statistical Grammars

SpeechTEK Europe 2010


SpeechTEK 2009
Sunday, August 23 2009
1:30 p.m. - 5:00 p.m. STKU-1: Introduction to Speech Technologies

Tuesday, August 25 2009
3:30 p.m. - 4:15 p.m. D204: Speech in Government

Wednesday, August 26 2009
2:30 p.m. - 3:45 p.m. D304: Panel: Best Practices for Developers

SpeechTEK 2008
Sunday, August 17
1:30 p.m. - 4:30 p.m. STKU-2: Introduction to Speech Technologies

Tuesday, August 19
10:45 a.m. - 11:30 a.m. D201: Securing CCXML and VoiceXML Applications

SpeechTek 2007
Tuesday, August 21
1:30 p.m. - 2:30 p.m. D202: Synthetic Agents that Speak & Listen

Wednesday, August 22
2:00 p.m. - 3:00 p.m. D303: Is Paul English Right?
3:15 p.m. - 4:15 p.m. E304: The Lighter Side of Speech Deployments

Articles By James A. Larson
Social media can incorporate voice to better engage users.
Posted 01 Jul 2010 - July/August 2010 Issue - by
Setting the standard for determining what a caller wants
Posted 05 Mar 2010 - March/April 2010 Issue - by
Who will get the ball rolling on creating the rules?
Posted 01 Nov 2009 - November/December 2009 Issue - by
User property extraction systems pull more information.
Posted 27 Aug 2009 - September 2009 Issue - by
Both allow for greater use of voice on mobile devices
Posted 01 Jun 2009 - June 2009 Issue - by
Technology exists to help the elderly live on their own.
Posted 02 Apr 2009 - March/April 2009 Issue - by
We need to look beyond basic processing to fuller understanding.
Posted 01 Nov 2008 - November/December 2008 Issue - by
Multimodal applications present a new type of user interface.
Posted 22 Aug 2008 - September 2008 Issue - by
The next phases of IVR development centers on multimodality and faster transactions.
Posted 01 Jun 2008 - June 2008 Issue - by
New stars present opportunities for using speech technologies.
Posted 01 Apr 2008 - April 2008 Issue - by
Creating and implementing automatic classifiers is not as easy as it sounds.
Posted 25 Jan 2008 - January/February 2008 Issue - by
Technology is available to handle many of the more mundane translation tasks
Posted 01 Oct 2007 - October 2007 Issue - by
Speech vendors should support SISR for easier porting of grammars and applications between platforms
Posted 09 Jul 2007 - July/August 2007 Issue - by
Electronic companions are the wave of the future in consumer electronics
Posted 01 May 2007 - May 2007 Issue - by
The customer experience isn't about completion rates and ROI; it's about achieving an intended task easily, efficiently, and even enjoyably
Posted 01 Mar 2007 - March 2007 Issue - by
When many people hear the words, "natural language," they immediately think of Star Trek's android, Data, who speaks and understands everyday English. Some software vendors claim anything beyond discrete speech recognition (in which users must pause between speaking each word) as "natural."
Posted 09 Nov 2006 - November/December 2006 Issue - by
Developing and sharing content is a growing activity on the Internet. In addition to passively observing Internet content, users are actively adding to it by uploading their pictures to flickr.com, and sharing their thoughts in blogs and wikis. Readers rate books on amazon.com, and teens post real and fantasy personas on the extremely popular myspace.com, hoping to attract the attention of other teens.
Posted 12 Sep 2006 - September/October 2006 Issue - by
Posted 03 Jul 2006 - July/August 2006 Issue - by
Case study by PIKA Technologies>

NuVoxx focuses on developing and marketing advanced IVR (Interactive Voice Response) services, which it  provides on a hosted model on a pay-per-use basis. The company also builds productivity solutions for call centers, and recently developed and deployed an advanced call recording and monitoring solution for NuComm International, its sister company and the largest privately-held Canadian provider of customer relationship and contact center services.>

Posted 13 Jun 2006 - - by
Three trends that enable IVR systems for mom and pop shops without costing an arm and a leg include prepackaged applications, free starter VoiceXML platforms, and reusable dialog components.
Posted 08 May 2006 - May/June 2006 Issue - by
Graphical user interfaces have been the standard user interface for computer users for over 20 years. It's time to up-level the user's computing experience by voice-enabling applications.
Posted 01 Mar 2006 - March/April 2006 Issue - by
Because many telephones do not have hardware that supports speech processing, a voice server is placed in the network to act as a client on behalf of telephones.
Posted 01 Jan 2006 - January/February 2006 Issue - by
Researchers and practitioners are extending VoiceXML using various techniques to provide new functionality. These include the RDC library tags, xHMI meta language, and a prototype implementation of VoiceXML which supports dictation speech recognition. RDC Tag Library – Developers frequently use Struts1 or other application frameworks to generate HTML. The goal of the Reusable Dialog Component (RDC) project is to provide a similar framework for VoiceXML. Like Struts, RDC has a tag library that hides the…
Posted 07 Nov 2005 - November/December 2005 Issue - by
A Toolkit of Metrics for Evaluating VUIsInvestors use standard metrics such as stock price and projected revenue per share to choose investment opportunities. Likewise, consumers use standard metrics such as floor space, number of bedrooms, or number of bathrooms when purchasing houses. This paper presents a toolkit containing some specific metrics for evaluating voice user interfaces (VUIs). The speech industry should use criteria from this toolkit to: Judge the most efficient of several VUIs for…
Posted 07 Nov 2005 - November/December 2005 Issue - by
Many Web developers use XML to represent data and a transformation language, such as XSLT, ASP, or ColdFusion, to transform the XML data to an HTML user interface. Developers change the values of the XML data without having to manually recode the HTML user interface.
Posted 20 Jun 2005 - July/August 2005 Issue - by
Michael Chavez, vice president of client services at ClickFox, explains that "by leveraging a strategic combination of customer behavior intelligence, customer service interviews and surveys, organizations can reduce customer frustration with IVR systems, which will result in drastic savings, while also improving customer satisfaction."
Posted 01 May 2005 - - by
Matt Nickerson describes how mobile phones enable callers to speak and listen to virtual agents. Using the same device to speak with family, friends and business associates, callers speak with software agents that enable synthetic interviews with individuals in photographs of historical events in a museum. This represents a new way of interacting with objects that are usually only viewed.
Posted 26 Apr 2005 - May/June 2005 Issue - by
At SpeechTEK 2004, a group of leading VUI designers attended the Voice User Interface (VUI) workshop directed by Dr. James A. Larson. Taking the lead for an article on the best practices in VUI, Dr. Larson collected and coordinated this team of VUI specialists to compile the Ten Guidelines for Designing a Successful Voice User Interface. Speech Technology Magazine would like to thank the authors for their contributions to this article and Dr. Larson for…
Posted 06 Jan 2005 - January/February 2005 Issue - by
Customers often ask questions about products, services, delivery dates and account information, as well as, offer suggestions and complaints. If customers do not receive satisfactory answers to their questions, they become disillusioned with the company and take their business elsewhere. In short, good customer service support is the key to repeat business. …
Posted 23 Nov 2004 - November/December 2004 Issue - by
Have you ever wished you could change your VoiceXML platform to use a speech synthesizer or speech recognizer from a different vendor? Have you ever wanted to move your speech synthesizer or speech recognizer to a different server? The Internet Engineering Task Force is proposing a new standard that will provide this flexibility.
Posted 23 Nov 2004 - November/December 2004 Issue - by
Posted 11 Sep 2004 - September/October 2004 Issue - by
The W3C Voice Browser Working Group reviewed more than 700 requests for change to VoiceXML 1.0. After careful deliberation, many of these were adopted, resulting in VoiceXML 2.0 which became a recommendation in March 2004. The VBWG is now working on two efforts to make VoiceXML even better.
A user profile contains information that describes how to personalize a speech user interface to meet the needs for a specific user.
Posted 24 May 2004 - May/June 2004 Issue - by
New technology will change the way people interact with computers. PCs enabled users to use a keyboard and screen rather than review printed reports. The Xerox Star and Apple Macintosh introduced Graphical User Interfaces (GUIs) which made the mouse and other pointing devices popular. Now, we are on the verge of a revolution in technology that makes computing portable. Separating user interface devices from the computing device will dramatically change how people interact with computers.
Posted 08 Mar 2004 - March/April 2004 Issue - by
A speech application does not work if users do not speak when prompted with a question.
Posted 09 Jan 2004 - January/February 2004 Issue - by
Recently, the W3C Multimodal Working Group published a first working draft of EMMA — the Extended MultiModal Annotation markup language — EMMA (www.w3.org/TR/emma/). EMMA’s intended use is to represent the semantics for information entered via various input modalities and the resulting integrated information.
Posted 25 Nov 2003 - November/December 2003 Issue - by
Have you ever been in a place where speaking to a VoiceXML application on a cell phone is impractical? As you know, not all locations or situations are suitable for using speech-enabled handheld devices.
Posted 25 Aug 2003 - September/October 2003 Issue - by
"Welcome to the Ajax speech application. It is possible to perform any of several actions at any time during this application, including the common actions of asking for help, transferring the call to an operator, stopping the application and returning to the main menu." This sample prompt is too long and confusing for most users. However, by using a controlled language, you can improve the prompt for your customers.
Posted 30 Jun 2003 - July/August 2003 Issue - by
Speech standards include terminology, languages and protocols specified by committees of speech experts for widespread use in the speech industry. Speech standards have both advantages and disadvantages. Advantages include the following: developers can create applications using the standard languages that are portable across a variety of platforms; products from different vendors are able to interact with each other; and a community of experts evolves around the standard and is available to develop products and services based on the standard.
Posted 30 Jun 2003 - July/August 2003 Issue - by
The Voice Browser Working Group has finished the technical work on the three major languages in the W3C Speech Interface Framework—VoiceXML 2.0, the Speech Recognition Grammar Specification, and the Speech Synthesis Markup Language. These languages will soon become “check-off items” in a list of features provided by the leading speech platforms.
Posted 05 May 2003 - May/June 2003 Issue - by
In the past three years, the World Wide Web Consortium Voice Browser Working Group has produced several reports that define languages in the W3C Speech Interface Framework. Developers use the W3C Speech Interface Framework languages to create speech applications.
Posted 06 Mar 2003 - March/April 2003 Issue - by
Information processing and the Internet are merging with the telecommunications industry to develop mobile devices using interactive services with global access. Speech recognition offers the most natural way for consumers to use new communication devices and services.
Posted 14 Jan 2003 - January/February 2003 Issue - by
PC users access the World Wide Web using a graphical user interface (GUI) that is commonly specified with HTML. Telephone and cell phone users access the Web using a verbal user interface (VUI) often specified with VoiceXML.
Posted 21 Nov 2002 - November/December 2002 Issue - by
As the cartoon illustrates, users become frustrated when speech applications don’t work. Testing minimizes this frustration by detecting and resolving many speech application problems before they cause user frustration.
Posted 10 Sep 2002 - September/October 2002 Issue - by
VoiceXML has revolutionized the development of telephony applications, in that telephone users can call Web sites and converse with VoiceXML applications. The system uses a TTS synthesizer or prerecorded voice and users can respond by speaking answers to the questions. Currently VoiceXML is weak in telephony controls. About all you can do in VoiceXML is and , which are powerful enough for many applications, but not powerful enough for many others, such as event notification and conferencing. However, with the coming of new call control capabilities, many of these restrictions will be overcome.
Posted 11 Jul 2002 - July/August 2002 Issue - by
VoiceXML and SALT are both markup languages that describe a speech interface. However, they work in very different ways, largely due to two reasons: (i) they have different goals, (ii) they have different Web heritages.
Posted 21 May 2002 - May/June 2002 Issue - by ,
Student teams are very creative when asked to design and implement speech applications of their choice. Here are some of the prototype speech applications recently implemented by students at Georgia Institute of Technology, Washington State University, Portland State University and Oregon Health and Sciences University.
Posted 21 May 2002 - May/June 2002 Issue - by
Users can choose from among several devices to access the World Wide Web. These devices include PCs with a visual Web browser, such as Netscape's Navigator or Microsoft's Internet Explorer, to interpret HTML files downloaded from a Web server and executed on the PC; telephones and cell phones with a verbal browser to interpret VoiceXML files downloaded from a Web server and executed on a voice server; and WAP phones with a visual browser to interpret WML files downloaded from a server and executed on the WAP phone.
Posted 29 Mar 2002 - March/April 2002 Issue - by
The goal of speaking to your computer, and having it do what you say, has been a goal of IBM’s speech recognition research team for over 25 years.
 

MarketPlace - Sponsored Links
ITIResearch.com
A collection of market research and reports for executive management and business & IT professionals
Start Talking…Get Connected
nameConnector transforms the way staff and customers connect with each other by providing a powerful speech-enabled interface to any existing or planned telecom system. Achieve robust operational and financial benefits with a wide variety of innovative, turnkey applications. Find out why leading hospitals, universities, and Fortune 500 companies use nameConnector. Click here.
Gold Sponsors
Silver Sponsor
Media Sponsors