SpeechTEK.com home
SpeechTEK 2009
August 24 - 26, 2009 • Marriott Marquis • New York, NY
Final Program & Presentation LinksFinal Program [PDF]Conference At a Glance [PDF]
Exhibitor ListEvent SponsorsSpeakers
2009 Speech Industry Awards2009 HomePrevious Conferences
News RoomCompany WorkshopsCRM Evolution
Attendee SurveyExhibitor SurveyPhotos/WCBS Radio Spot
SpeechTEK 2009 - Wednesday, August 26 2009
SUNRISE DISCUSSIONS
SD301 – How to Approach a Multilingual System Design
8:00 a.m - 8:45 a.m
MODERATOR:
Dr. Nava Shaked, CEO - Business Technologies

In this session we will suggest a checklist for issues needed to be taken into consideration when designing a multilingual speech system. We will go over them, make a prioritized list, and see how it differs from one organization to another. In addition, we will present the important do's and don'ts in designing a multilingual system and finally offer some insights from experience to give the audience a head start. The session will be interactive as much as possible to discuss and share actual problems and solutions for these challenges.

SD302 – Long-Term Speech Application Monitoring, Maintenance, and Expansion
8:00 a.m - 8:45 a.m
MODERATOR:
Dr. Silke Witt-Ehsani, Vice President, VUI Design Center, Design Center - TuVox
Download Presentation

Now that speech applications are often running for a number of years, it has become critical to have a viable approach to measure performance
over time as well as to maintain and update an application. This session is targeted at both people responsible for their company's speech applications as well as speech application professionals who are in contact with the same speech application year after year. This session will outline the elements of a speech application’s lifecycle beyond initial go-live and present best practices. Attendees will be asked to share their best and worst experiences with maintaining applications.

SD303 – Taking a SIP of Java-Building Voice Mashups Using SIP Servlets
8:00 a.m - 8:45 a.m
MODERATOR:
R.J. Auburn, Chief Technology Officer - Voxeo

The Java world has seen more than its fair share of telephony APIs come and go over the years. Things like JAIN SIP, JTAPI, Parlay, and JAIN SLEE have been showing up on product road maps for several years now, but SIP Servlets (just released as a 1.1) are finally seeing some industrywide adoption in application servers, both closed and open source. SIP servlets provide a simple API that mimics the widely deployed HTTP servlet model, allowing for Java developers to easily adapt their existing code and framework for communication applications. This session will provide a short introduction to SIP servlets and show people how they can quickly create a converged application using the technology. We will also show how you can quickly plug into simple web APIs using SOAP and REST to create quick web mashups.

TRACK A: BUSINESS STRATEGIES
KEYNOTE PANEL - SaaS in Speech
9:00 a.m - 10:00 a.m
MODERATOR:
Nancy Jamison - Jamison Consulting

The low-cost, high-value of the Software-as-a-Service (SaaS) model, which has already revolutionized the CRM market, is now making inroads into the speech industry. Hear what some of the leading SaaS speech technology vendors have to say about SaaS in the speech technology industry.

Paul Watson - Convergys Corporation
Jamie Bertasi, General Manager of Business Solutions - Microsoft Tellme
R.J. Auburn, Chief Technology Officer - Voxeo
Mr. Dan Reed, Corporate Vice President - Voxify
A301 – Panel: Hosting Relationships
10:45 a.m - 11:30 a.m
MODERATOR:
Elaine Cascio, Vice President - Vanguard Communications Corp

Hosted speech solutions offer significant benefits for many organizations making their first forays into speech technology. But automated speech applications and your business grow and change over time, and your relationship with your hosting provider should change as well. Learn how to get the most out of your ongoing relationship with your hosting provider in the face of the inevitable changes in your business.

Laura Marino, Sr. Director of Product Management- Nuance On-Demand, Enterprise - Nuance Communications
Mike Mateer, VP Systems Development - West Interactive
A302 – Measuring Customer Experience
11:45 a.m - 12:30 p.m
MODERATOR:
Eduardo Olvera, Senior User Interface Designer, Professional Services - Nuance Communications
CEI: The Caller Experience Index
James Mesbur, Voice Interaction Designer/Speech Scientist Engineer - SpeechCycle

Understanding and evaluating the Caller Experience (CE) provided by voice-driven applications is key to measuring whether a system is achieving its business goals. Knowing how callers perceive a system can help guide business and design decisions by pointing out when callers have trouble versus get helped. SpeechCycle will discuss why this is a critical success factor and talk about methods it has pioneered for measuring caller experience in near real time, providing valuable feedback about true application performance. 

A Solid Approach to Measuring and Improving CSAT
Kristie Goss, Senior Analyst, VUI Designer - Convergys Corporation
Download Presentation

This presentation covers an experiment that was conducted to identify what impacts customer satisfaction. Additionally, this talk provides a list of CSAT factors that can be used to determine which enhancements may help or hurt customer satisfaction. A methodology for prioritizing each factor will also be presented. VUI designers and business stakeholders will learn an approach for accurately measuring CSAT, how to choose the best CSAT question(s), and a method for improving CSAT.

A303 – Evolution of Multimodal Customer Care
1:45 p.m - 2:30 p.m
MODERATOR:
Ms Darla Tucker, Customer Solutions Executive - Convergys
Evolving Applications From Single to Multimodality: Assessing Impact of Change
Simon Reavely, Principal Architect - Orange Labs Boston LLC
Download Presentation

Mobile devices provide a personalized portal to a vast amount of searchable information. Although the GUI on mobile devices remains an efficient output modality, it is believed that providing access to data sources using voice recognition and text to speech will significantly enhance accessibility and uptake. We examine the existing starting points that include IVRs and thick/thin graphical applications. We discuss where and how to change applications to support multimodality and consider trade-offs.

The Multimodal Wireless Future of Customer Care
Roberto Pieraccini, Chief Technology Officer - SpeechCycle

The increasing adoption of wireless communications and smartphones creates the conditions for the evolution of traditional self-service customer care toward a multimodal interaction paradigm. The combination of separate GUI and VUI modes of interaction into a multimodal interface (MUI) allows people to take advantage of their complementary nature for an improved user experience. MUI, together with enterprise voice search—the multimodal evolution of natural language call routing—will drive adoption of self-service customer care toward higher levels of automation.

A304 – Emerging Trends in Speech
2:45 p.m - 3:45 p.m
MODERATOR:
Phillip Hunter, UX for Partners - Microsoft Tellme
Elaine Cascio, Vice President - Vanguard Communications Corp
Ryan Joe, Associate Analyst, Customer Interaction Technologies - Datamonitor
Aphrodite Brinsmead, Associate Analyst - Datamonitor
Download Presentation

As companies strive for greater customer intimacy through personalized service, customer-based sales, and proactive outreach, what is the role of speech? Hear real-life examples of successful, customer intimate applicationsand critical steps you can take to use speech to engage, not alienate, customers. We’ll cover the best ways to combine speech with live interactions to enhance the customer experience and how your company can leverage the power of speech for higher levels of customer intimacy and loyalty.

TRACK B: PRINCIPLES OF VUI DESIGN
KEYNOTE PANEL - SaaS in Speech
9:00 a.m - 10:00 a.m
MODERATOR:
Nancy Jamison - Jamison Consulting

The low-cost, high-value of the Software-as-a-Service (SaaS) model, which has already revolutionized the CRM market, is now making inroads into the speech industry. Hear what some of the leading SaaS speech technology vendors have to say about SaaS in the speech technology industry.

Paul Watson - Convergys Corporation
Jamie Bertasi, General Manager of Business Solutions - Microsoft Tellme
R.J. Auburn, Chief Technology Officer - Voxeo
Mr. Dan Reed, Corporate Vice President - Voxify
B301 – Bilingual Spanish/English Design
10:45 a.m - 11:30 a.m
MODERATOR:
Mary Constance Parks, Principal Interaction Designer - Nuance Communications
¡Dígame! Tell Me What I Need to Say
Ms. Sondra Ahlén, Principal VUI Consultant/Owner - SAVIC
Download Presentation

Speech applications need to be designed differently for Spanish than for English.  Cultural and language differences, including "Spanglish," affect design requirements and how callers interact with the application. Our usability study of a U.S. English/Spanish application uncovers new issues.  Menu items, backoff strategies, the preference for touchtone versus speech, and even how to choose participants for a bilingual usability test must be considered.  This presentation will provide a stepping stone for creating better Spanish language speech applications.

How to Present Names of Geographical Locations in Spanish Systems
Eduardo Olvera, Senior User Interface Designer, Professional Services - Nuance Communications

When Spanish-speaking users call an automated system in the U.S., the resulting conversation might not be entirely in Spanish, particularly when dealing with names of geographical locations. The way an IVR presents such information can help or hinder usability, and it can make a statement about how a company relates to its Hispanic customers. This presentation will examine key factors for making the best possible choice, such as caller background (demographics, caller preferences, acculturation), inherent characteristics of the location names, common usage, and the purpose of the system.

B302 – Designing for Cultural Differences
11:45 a.m - 12:30 p.m
MODERATOR:
Steve Chirokas, Executive Director, Marketing - VoltDelta OnDemand
6 Ways to Bulletproof Your Multicultural Global Speech Deployment
Aaron Fisher, Director of Speech Services - West Interactive

English speakers globally interact with dialogues differently, depending on their country of origin, culture, and cultural norms. When deploying speech applications in a global context, VUI and grammar designers need to have sensitivity to multicultural differences in the manner in which English is spoken. While most VUI designers realize that designing in another language (such as Spanish) is fundamentally different than designing for English, less is known about the differences in how the English language is spoken globally. Dialogue designs and grammars must reflect these differences for real success.

¡Viva la Diferencia! Diversity and Unconventional Approaches Toward Reaching and Keeping Spanish-Speaking Callers
Ramón Solórzano Jr. Ph.D., Principal - Parasol Communication Services

This presentation will provide new independent research on the factors that come into play when Hispanic consumers choose which language they want to use in an IVR. We present preliminary findings from consumer interviews, using anthropological methodology to observe activities in everyday settings to sift out less visible cultural perspectives. This study challenges the assumption that people who are primarily Spanish speakers prefer to use Spanish IVRs and takes into account variables such as time in the U.S., dynamics of bilingual acquisition, dialectical differences, education level, and attitudes toward English and Spanish.

B303 – Challenging Contexts of Use
1:45 p.m - 2:30 p.m
MODERATOR:
Michael Smith, Self Service Solution Opportunity Architect, Professional Services - Avaya, Inc.
They’re Not Always Listening: Designing for Distracted Callers
Mary Constance Parks, Principal Interaction Designer - Nuance Communications

Speech applications are often designed with the underlying assumption that callers are listening to all the prompts throughout the call. Therefore, when callers do get distracted, they are often left without a means of completing their tasks or achieving their goals. This talk will provide examples of distraction and describe its impact on the caller experience. Specific techniques will be shown that can help callers succeed at tasks, even when they have been distracted.

Prescription for Healthcare IVRs
Nancy Gardner, Senior Analyst, RTM Consulting and Professional Services - Convergys Corporation
Download Presentation

This prescription for improved customer satisfaction and increased call containment focuses on these best practices: caller-authentication solutions that meet HIPAA guidelines and address concerns about personal privacy and security; assistive dialogue design strategies geared toward an aging population; and intelligent use of backend customer information. This presentation examines solutions in real-world healthcare applications such as prescription refill, new member enrollment, and claim status.

B304 – Panel: The VUI Showdown: Data for Menu Design
2:45 p.m - 3:45 p.m
MODERATOR:
Peter Leppik, President and CEO - Vocal Laboratories, Inc.

Every VUID’s favorite session returns for 2009! This year we’ve established a theme for the VUI Showdown: best practices in menu design. The Showdown will begin with a show-and-tell of empirical data from several ongoing studies of menu design. Be prepared to listen critically and chime in with your own experiences. Bring your opinions to help set the course for follow-up research and take part in the always-lively discussion.

Jenni McKienzie, Voice Interaction Designer - Travelocity
Dr. Silke Witt-Ehsani, Vice President, VUI Design Center, Design Center - TuVox
TRACK C: BEST PRACTICES FOR DEVELOPERS
KEYNOTE PANEL - SaaS in Speech
9:00 a.m - 10:00 a.m
MODERATOR:
Nancy Jamison - Jamison Consulting

The low-cost, high-value of the Software-as-a-Service (SaaS) model, which has already revolutionized the CRM market, is now making inroads into the speech industry. Hear what some of the leading SaaS speech technology vendors have to say about SaaS in the speech technology industry.

Paul Watson - Convergys Corporation
Jamie Bertasi, General Manager of Business Solutions - Microsoft Tellme
R.J. Auburn, Chief Technology Officer - Voxeo
Mr. Dan Reed, Corporate Vice President - Voxify
C301 – VUI Considerations for Speaker Verification
10:45 a.m - 11:30 a.m
MODERATOR:
Dr. Judith Markowitz, President - J. Markowitz Consultants
Considering Text Dependency for Speaker Verification
Ron DeLucia, Senior Consultant, Self Service Solutions - Avaya
Download Presentation

Choosing between text-dependent or text-independent verification greatly impacts the caller's enrollment process, performed to effectively capture the caller's voice print, and the authentication process, used to optimally identify the caller afterward. We also consider functionality such as "liveness" testing that can be used to thwart potential recorded attacks, and the simple "single step" authentication and identification process. Each of these aspects will guide your selection between text-dependent and text-independent verification and ensure that your design attains a proper balance between security and convenience.

Designing for Customer Acceptance of Voice Biometrics
Mr David John Attwater, Senior Scientist, EIG UK - Enterprise Integration Group
Download Presentation

This presentation will discuss the key factors underlying customer acceptance of voice biometrics. Based on direct evidence from customer testing in Europe and the U.S., it will address the topics of customer trust and the role of brand in acceptance of biometric technology, mapping customer understanding of biometrics to dialogue design principles, customer perception of trade-offs between security and enrollment, and authentication time and effort.

C302 – Improving Grammars
11:45 a.m - 12:30 p.m
MODERATOR:
Dr. Daniel C Burnett, Director of Speech Technologies - Voxeo
The Art and Science of Speech Grammar Coverage Analysis
Dominique Boucher, Lead Software Developer - Nu Echo Inc.

As the speech recognition grammars required by today's applications become increasingly complex, identifying and fixing grammar coverage problems can become quite challenging. Using real-life examples, this session will provide an overview of some of the best practices and techniques for effective speech grammar debugging and coverage analysis. In particular, we will showcase tools no speech scientist should live without: an interactive sentence explorer and a sophisticated, highly customizable sentence generator.

How to Effectively Add Weights to SRGS Grammars
Stephen Keller, Project Manager - LumenVox
Download Presentation

Weighting is an important and often misunderstood tool in the grammar toolbox. By adding weights to SRGS grammars, developers can help automatic speech recognizers distinguish between similar-sounding words, significantly improving the accuracy of grammar-based recognizers. However, the misuse of weights will almost certainly decrease accuracy. This presentation discusses how weights are used in SRGS grammars, outlines strategies for collecting the data necessary to apply correct weighting, details a strategy of testing, and using real-world grammars and application results, demonstrates how weighting can improve the success of a speech application.

C303 – New Development Languages
1:45 p.m - 2:30 p.m
MODERATOR:
Dr. Ian Sutherland, Consulting Member of Technical Staff, Voice Lab - Oracle USA Inc.
Invoking External Resources From SCXML
James Barnett, Director - Alcatel Lucent
Download Presentation

SCXML is a new W3C language for describing control flow. This talk will describe SCXML's mechanisms for communicating with external entities. It will cover communicating with other SCXML sessions, loading data from external sources, invoking external services, and sending and receiving external events.

Improving Dialogs With EMMA
Dr. Deborah Dahl, Principal - Conversational Technologies
Download Presentation

The Extensible MultiModal Annotation specification (EMMA) is a standard of the World Wide Web Consortium for representing user input. While most commonly used for representing the interpretation of spoken input in dialogues, EMMA provides a great deal of additional information that can also be taken advantage of in speech applications. This presentation will discuss various ways in which this additional information can be used.

C304 – Tuning
2:45 p.m - 3:45 p.m
MODERATOR:
Dominique Boucher, Lead Software Developer - Nu Echo Inc.
“Get Human” With Text to Speech: Advanced TTS Tuning Techniques to Make Your Application Sound Great!
Aaron Fisher, Director of Speech Services - West Interactive

Many callers dislike listening to text-to-speech because it sounds robotic and unnatural. While TTS engines today are much better than they used to be, the capabilities of the engine itself is only one driver of the overall quality and audibility of text-to-speech. Other drivers, like quality and integrity of words and abbreviations contained in customer information systems, are equally-if not more-important. But these challenges can be addressed and text-to-speech can be made to sound very natural-it just takes time, resources, and a strong commitment to providing an outstanding voice user interface (VUI). Enterprises that devote resources toward analyzing and tuning the text-to-speech component of their applications on a regular basis, will reap the benefits of a better VUI and improved overall caller experience.

Getting Top Performance From Your Speech Application With Advanced Tuning Techniques
Mike Moore, Analytics Manager - West Interactive

Despite all of the best efforts during the initial VUI design and implementation of a speech-enabled IVR application, changes in the business landscape require continuous monitoring of application performance. Without the utilization of continuous advanced tuning, application performance can erode over time. Avoid this pitfall by planning and allocating resources toward regularly scheduled tuning. This session will offer tuning techniques to ensure ongoing improvements to your application and a successful caller experience.

A Top-Down/Bottom-Up Approach to Optimizing IVR performance to Meet SLAs
Mr. Douglas Rosenberg, Technical Consultant, Speech Science - Convergys Corporation
Download Presentation

Optimizing IVR performance to meet clients' service level agreements requires analyzing a wide range of data. There are several methods that can be used to evaluate the data in order to provide recommendations with the biggest benefits. This presentation describes a top-down and bottom-up approach that is driven by SLAs and performance data. These techniques are useful for the VUI designer during design phase and the tuning analyst during post-deployment optimization phase.

TRACK D: SOLUTIONS
KEYNOTE PANEL - SaaS in Speech
9:00 a.m - 10:00 a.m
MODERATOR:
Nancy Jamison - Jamison Consulting

The low-cost, high-value of the Software-as-a-Service (SaaS) model, which has already revolutionized the CRM market, is now making inroads into the speech industry. Hear what some of the leading SaaS speech technology vendors have to say about SaaS in the speech technology industry.

Paul Watson - Convergys Corporation
Jamie Bertasi, General Manager of Business Solutions - Microsoft Tellme
R.J. Auburn, Chief Technology Officer - Voxeo
Mr. Dan Reed, Corporate Vice President - Voxify
D301 – Designing Documentation
10:45 a.m - 11:30 a.m
MODERATOR:
Bob Cooper, CEO - Swampfox Inc.
CCS: Documenting the Complete Design
Brian Budd, Lead Engineer, GRB Technology - Barclays Bank PLC

The voice user interface forms the hub of the design for any IVR system, with every other part of the design needing to support it. This presentation will share design documentation lessons learned from the implementation of a self-service application for a major U.K. financial institution, giving details of the strategies and techniques used for document production and management, what worked, what didn't work, and ideas for future improvements.

A Picture Is Worth a Thousand Words: A Different Approach to VUI Design Documentation
Karen Owens, Senior Consultant-Speech Engineer, Avaya Professional Services - Avaya
Download Presentation

This presentation will cover an emerging process for VUI design that uses a combination of hidden data and automatic data extraction to create a comprehensive design document. First, the presentation will address the pros and cons of commonly used VUI design documentation. Next, it will show how to create hidden data in Visio objects, how to display the hidden data through data graphics, and finally, how to extract and format the hidden data.

D302 – How Airlines Improve Service With IVR
11:45 a.m - 12:30 p.m
MODERATOR:
Dr. Valentine Matula, Director Multimedia Research - Avaya
[CCS] Improving Flight information Services With Automated Speech Recognition
Mr. Michael Trank, Product Manager - APEX Voice Communications

Live customer service representatives were getting overloaded with routine flight information questions at one of the leading ultra low-cost carriers based in the U.S. The time-consuming task for representatives to provide flight information was automated with the deployment of the OmniVox3D SIP Application Server and Service Delivery Platform from APEX Voice Communications. Callers looking for flight departure and arrival information simply speak the flight number, departure and arrival city, or airport code to receive the requested information. By accessing the same database as customers on the web, OmniVox3D ensures that the same information is made available to customers calling the automated flight information service.

[CCS] How Continental Airlines Uses Outbound IVRs to Transform the Customer Experience
Ms. Sonja Lough, Manager - Self Service Communications - Continental Airlines

Continental has been a leader in the development and deployment of self-service technologies for many years. Utilizing a multichannel strategy has enabled the adoption of self-service across multiple touch points and customer groups. In 2008, Continental launched its first outbound IVR for flight check-in. While this application offers an experience similar to that of the online or kiosk check-in experience, it has been designed to leverage the unique capabilities of the voice channel and uses a module-based design, which allows new features to be integrated seamlessly. The outbound flight check-in IVR has since been extended to become an inbound IVR, based on its success and modularity.

D303 – Adding Video to Voice
1:45 p.m - 2:30 p.m
MODERATOR:
Paolo Baggia, Director of International Standards - Loquendo
Deploying Interactive Voice & Video Response Applications—Challenges and Solutions
Dr. Valentine Matula, Director Multimedia Research - Avaya

Creating an interactive voice and video response travel application for an international deployment involves many parties: the IVVR platform provider's R&D team, in-country sales team, and a business partner for local support; the hosting in-country mobile phone carrier; a travel industry company; and the consumer. This talk will cover the challenges in delivering an IVVR solution that can host not only a travel application, but a stream of applications for a range of hosted customers.

VoiceXML and Video: Which Road Leads to Rome
Bob Cooper, CEO - Swampfox Inc.

This session will focus on various approaches to adding video support to VoiceXML platforms and the pros and cons of each. The talk will discuss aspects of VXML 3 and how it, as well as some supporting standards such as SMIL, is well suited to support video. The session will begin by looking at the user experience needed for various interactions and then consider this when looking at the underlying technology.

D304 – Panel: Best Practices for Developers
2:30 p.m - 3:45 p.m
MODERATOR:
James A. Larson, Vice President - Larson Technical Services

Speech technology customers will summarize their experiences by enumerating best practices for designing, deploying, and using speech technology. Benefit from the experiences of those who have been in the trenches. Avoid the mistakes made by others.

Brian Budd, Lead Engineer, GRB Technology - Barclays Bank PLC
Ms. Sonja Lough, Manager - Self Service Communications - Continental Airlines




MarketPlace - Sponsored Links
ITIResearch.com
A collection of market research and reports for executive management and business & IT professionals
Start Talking…Get Connected
nameConnector transforms the way staff and customers connect with each other by providing a powerful speech-enabled interface to any existing or planned telecom system. Achieve robust operational and financial benefits with a wide variety of innovative, turnkey applications. Find out why leading hospitals, universities, and Fortune 500 companies use nameConnector. Click here.
Gold Sponsors
Bronze Sponsors
Official Twitter
Sponsor
Media Sponsors