SpeechTEK.com home
SpeechTek 2007 - Advance Program
August 18-20, 2008 • New York Marriott Marquis • New York, NY
Conference Overview Conference Chairs/Advisory Committee Final Program
Conference at a Glance [PDF] Exhibitor List/Floor Plan Speaker List
  Previous Conferences  
A CD-ROM is available for purchase through The Digital Record,
featuring audio and supplemental materials (such as PowerPoint slides) for most of the sessions at SpeechTEK 2007.
Visit www.digitalrecord.org to purchase the SpeechTEK 2007 Conference CD, or call toll-free 1-800-338-2111.
SpeechTek 2007 - Tuesday, August 21
TRACK A: THE YEAR OF LIVING VIRTUALLY: HOSTED & MANAGED SERVICES
Soho Room (7th floor)
Speech Technology at the Tipping Point
(Broadway Ballroom)

9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink


The market for speech technology topped the $1 billion mark in 2006, up 100% from 2 years ago. As adoption of speech systems increases in all parts of our lives, from our computers to our cars to customer relationships, it is clear that speech technology is at the tipping point. Hear from Malcolm Gladwell, who coined this phrase, about how he sees ideas, behaviors, and patterns moving through a population in much the same way as an epidemic. He posits that ideas can become contagious and reach a tipping point, just as a virus reaches critical mass. Learn what it takes to start a social epidemic and create change and how to push speech technology, and your businesses, toward the tipping point.

A201 – The Outsourcer’s Perspective
10:45 a.m - 12:00 p.m
MODERATOR: Mr Dan Miller, Senior Analyst - Opus Research, Inc.
Lynn Jobe, Sales Director, Strategic Products - Intervoice
Henry McCreary, Chief Architect and Director of Telecommunications - CSX Technology


Outsourcing is a common practice and is often the best practice for costeffective, high-quality customer service and self-service. In this session, leading service providers and their customers introduce and describe the criteria that influenced their decision to outsource and their experiences with the team of application developers, carriers, and hosted services providers.

Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN


 

A202 – Contrasting Service Provider Approaches
1:30 p.m - 2:30 p.m
MODERATOR: Clegg Ivey, VP, Mergers & Acquisitions, General Counsel - Voxeo
John Hibel, Vice President of Marketing - Voxeo
Ron Owens, Director, Multimedia Applications PSO - Nortel
Tom Smith, Senior Manager - Verizon Digital Media Services
Mike Moors, Director, Sales - Genesys


Nearly every service provider offers savings in capital expense and contact center operating costs. Increasingly, service providers differentiate themselves through partnering strategies, technology in use, application development and management resources, and support of multiple modalities of customer care and self-service.

A203 – Meeting Business Challenges
2:45 p.m - 3:45 p.m
MODERATOR: Mr Dan Miller, Senior Analyst - Opus Research, Inc.
Mark Abramson, CEO - Message Technologies Inc.
Tim Moynihan, Vice President of Marketing, Enterprise Solutions - EMPIRIX
Steven Pollock, Executive Vice President & Co-Founder - TuVox
Michael X. Zirngibl, Founder & President - Angel.com Incorporated


This has been the year of the end user. With so much emphasis on the customer experience, being able to identify snags in self-service applications at a highly granular level and to make corrections dynamically is crucial. In this session, application framework providers describe how they support application development that meets customer service objectives.

Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m

A204 – View of the Global Future
4:15 p.m - 5:15 p.m
MODERATOR: Mr Dan Miller, Senior Analyst - Opus Research, Inc.
Juan Ceballos, Chief Technology Officer - Ydilo
Dr. Andrew Hunt, Vice President, Engineering - Holly Connects
Susan Wilson, Director Business Development, DA Services - Nuance Communications


In this session, the biggest stakeholders in global, outsourced communications discuss how the distributed, virtualized contact center will integrate speech automation with multimodal communications (blending live agents with IM, chat, and video messaging) and what this means for the future of customer service.

Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m

TRACK B: DESIGNING THE SPEECH CUSTOMER EXPERIENCE
Empire Room (7th floor)
Speech Technology at the Tipping Point
(Broadway Ballroom)

9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink


The market for speech technology topped the $1 billion mark in 2006, up 100% from 2 years ago. As adoption of speech systems increases in all parts of our lives, from our computers to our cars to customer relationships, it is clear that speech technology is at the tipping point. Hear from Malcolm Gladwell, who coined this phrase, about how he sees ideas, behaviors, and patterns moving through a population in much the same way as an epidemic. He posits that ideas can become contagious and reach a tipping point, just as a virus reaches critical mass. Learn what it takes to start a social epidemic and create change and how to push speech technology, and your businesses, toward the tipping point.

B201 – Para Continuar en Espanol
10:45 a.m - 12:00 p.m
MODERATOR: Eduardo Olvera, Senior User Interface Designer, Professional Services - Nuance Communications


The world is getting smaller, and increasingly this is reflected in speech applications that include more than one language. Enabling access to a speech application in another language is not a simple matter of translating the prompts and switching the recognition algorithm to the other language. Experts in this session offer tips on adding a second language to an existing application and how best to approach the design of new multilingual applications.

The Use of Spanglish in Speech Systems Designed for Hispanics in the U.S.
Jose L Elizondo, Senior Manager, Multilingual VUI Design - Nuance Communications

What type of Spanish is the best to use for the Hispanic population in the U.S., who have backgrounds from more than 20 different countries. One solution that is not discussed often enough and that upsets language purists is mixing English and Spanish, or even creating hybrid words in “Spanglish.” This presentation addresses linguistic facts about Hispanics in the U.S. and the appropriateness of using mixed-language phrases and Spanglish words.

Adding a Second Language to Your Speech Application
Aimee Piercy, User Interface Designer - Nuance Communications

There are many challenges to adding a second language to your speech application. Hear useful advice in this presentation to help you ensure a smooth design and development process, including real-life, concrete examples drawn from practical experience working on various Spanish voice applications.

Insights to Multilingual Applications
Prabha Sundaram, Speech Specialist - Nortel

There is a growing need to design and develop speech-enabled, self-care applications that are bi- or multilingual (e.g., Spanglish, Hinglish etc.). In these design projects, it is important to understand cultural differences, analyze speaking styles, sentence constructs, etc. Learn about challenges and solutions for tailoring and designing these applications.

Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN


 

B202 – Getting the VUI Right When Recognition Goes Wrong
1:30 p.m - 2:30 p.m
MODERATOR: David C Martin, Managing Principal, Self Service Solutions EMEA, Professional Services - Avaya


Recognition errors are a fact of life for speech systems. The job of a good voice user interface is to make the errors as inconsequential as possible to the end user. Learn how to handle confirmations to build user confidence without becoming a burden, and hear about new approaches to error-handling for natural language applications.

To Confirm, or Not to Confirm ... That Is the Question
Kristie Goss, Senior Analyst, VUI Designer - Convergys Corporation

Download Presentation

Confirmations are a critical piece to the VUI puzzle. This presentation investigates various confirmation strategies and illustrates successful phrasing techniques, including a natural-sounding way to confirm dynamic data. Attendees will expand their confirmation toolkit and learn how to improve caller satisfaction and decrease call duration in their own IVR applications.

Fault Tolerance at the VUI Level
Daniel Padgett, Senior Speech Consultant - Voice Partners/VoxGen
Jessica Hicks, Speech Consultant - Versay Solutions

Download Presentation

Hear about proven strategies for building fault-tolerant VUIs for smarter speech applications. The speakers discuss the use of statistical language models, error strategies tailored to specific call paths, and the careful use of confirmations to minimize errors, improve performance, and increase overall customer satisfaction.

B203 – Making It Personal
2:45 p.m - 3:45 p.m
MODERATOR: Tom Houwing, Director - voiceandvision B.V.


Speech applications used to be conceived of as one-size-fits-all interactions in which every user had the same experience. Increasingly, organizations are capitalizing on data they already know about customers to tailor the experience specifically for them. Whatever you know about the customer can be used to predict the likely reason for the call and present relevant information and services proactively. Learn techniques for creating personalized interactions that make calls seem efficient, productive, and surprisingly satisfying.

Creating a Personalized User Experience
Barb Mackraz, Principal - Mackraz Design, LLC

A great application is not only smart and engaging, but also relevant. This session looks at how you can draw on intelligence about a user to adapt and tailor the experience for him. Topics include streamlining a user interface and offering features proactively based on usage patterns and predictions of intent—and logistical challenges you’ll encounter. This approach to design is especially effective for mobile applications in which a phone identifies a single user.

Developing Dynamic Personalized Contact Center Applications with Cisco
Cory Wright, Sr Marketing Manager, Customer Contact Business - Cisco

Download Presentation

This session shows how to create a personalized contact center application using Cisco Call Studio, speech recognition, VoiceXML, and Web or enterprise applications. Learn how the Cisco Customer Voice Portal and interactive voice response technology built into Cisco IOS work together to allow you to build and deliver self-service applications.

Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m

B204 – Watch Your Grammar
4:15 p.m - 5:15 p.m
MODERATOR: Dr. Juan E. Gilbert, IDEaS Professor & Chair, Division of Human-Centered Computing - Clemson University
Judi Halperin, Principal Consultant and Team Lead, Global Speech Engineering, Contact Center Practice - Avaya Inc.
Mr. David L Thomson, PMTS - AT&T Labs


Download Presentations

Speech applications are only as effective as the grammars they reference. As such, grammars should be constrained as tightly as possible, and grammar over-generation should be avoided at all costs. This session’s speakers discuss various methods available to help constrain grammars, as well as some grammar development pitfalls and the testing methodologies that will help avoid them.

Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m

TRACK C: ADVANCED SPEECH TECHNOLOGY SYMPOSIUM
Shubert Room (6th floor)
Speech Technology at the Tipping Point
(Broadway Ballroom)

9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink


The market for speech technology topped the $1 billion mark in 2006, up 100% from 2 years ago. As adoption of speech systems increases in all parts of our lives, from our computers to our cars to customer relationships, it is clear that speech technology is at the tipping point. Hear from Malcolm Gladwell, who coined this phrase, about how he sees ideas, behaviors, and patterns moving through a population in much the same way as an epidemic. He posits that ideas can become contagious and reach a tipping point, just as a virus reaches critical mass. Learn what it takes to start a social epidemic and create change and how to push speech technology, and your businesses, toward the tipping point.

C201 – New Approaches to Dialog Design
10:45 a.m - 12:00 p.m
MODERATOR: K.W.'Bill' Scholz, President - NewSpeech LLC


As designers are urged to create ever-more sophisticated self-service applications, the pressure for evolving new techniques grows in importance. New Eclipse-based graphical tools oriented around the identification, definition, and reuse of hierarchical dialog patterns and novel nonlinear call flows assisted by agents are described in this dialog design session.

A Graphical Tool for Pattern-Based Dialog Design
Dominique Boucher, Lead Software Developer - Nu Echo Inc.

This presentation shows an Eclipse-based, graphical environment for developing speech applications that specifically addresses the problem of capturing and expressing recurring dialog patterns. This tool transforms the process of designing and implementing dialogs by specifically orienting the design process around the identification, definition, and reuse of hierarchical dialog patterns.

Non-Linear Call Flow Design
Clifford Harlow, Vice President, Client Services - Spoken Communications

Download Presentation

Most speech IVR applications are unable to skip utterances that they don’t understand. In contrast, live agents can gather information out of sequence or discern intent. By uniquely combining speech technology with humans, callers can have a more natural, free-flowing self-service experience because they are not locked into a rigid call flow.

Adaptive Voice Dialogs Based on Automatic Speaker Classification
Joachim Stegmann, Head, Advanced Voice Solutions - T-Systems Enterprise Services GmbH

Download Presentation

This presentation describes the technology and applications of automatic speaker classification (e.g., age, gender, language, and emotions) in voice portals. It shows how dialog parameters should be adapted to achieve improved user acceptance in IVR systems. The first results from pilot implementations within Deutsche Telekom prove the feasibility and show advantages compared to conventional, non-adaptive systems.

Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN


 

C202 – Artificial Intelligence & VUI Design
1:30 p.m - 2:30 p.m
MODERATOR: K.W.'Bill' Scholz, President - NewSpeech LLC


The growing sophistication of VUI designs demands the incorporation of new technologies, including those borrowed from other disciplines. This session focuses on the novel application of artificial intelligence technology concurrently using a dialog engine and a problem-solving engine. It also illustrates the use of natural language to understand the semantics and context of any phrase being processed, making it much easier to develop the answers.

Artificial Intelligence in Voice Self-Service Applications
Mahesh Rajagopalan, President & Co - Resolvity
Dr. Jacek Jarmulak, Senior AI Scientist - Resolvity

Download Presentation

This presentation discusses how AI technologies may be used in voice self-service applications to separate the product support logic from the call flow logic, take advantage of the problem-solver’s knowledgebase to develop dialogs, improve speech recognition, create dynamic call flows, and provide an effective and efficient troubleshooting experience. Learn about strengths and weaknesses, rule-based systems, Bayesian inference, decision-trees, and knowledgebases.

Improve Your VUI Design with an AI-Based Conversational Dialog Solution
Peter Trompetter, Vice President, Global Development - GyrusLogic

Natural language understanding is an excellent augmentation to an existing or new VUI for better automated call completion and customer satisfaction. Hear about a solution that makes it easier to develop the natural language application after understanding the semantics and context of any phrase.

C203 – Advances in Video & Multimedia Application Design
2:45 p.m - 3:45 p.m
MODERATOR: K.W.'Bill' Scholz, President - NewSpeech LLC


The availability of a robust 3G infrastructure throughout Europe and much of Asia has released pent-up customer demand to add live video to extend the utility of voice communications. This session illustrates how video menus, pictures of products, live video clips, and video commercials can be managed, as well as how sample speech/video-enabled self-service applications for universities, travel, retail, and home health can be developed. Also, the use of the Adobe Flash Player, a popular standard for delivering rich Web content to develop multimedia content, will be explained and illustrated.

Speech-Enabled Video Applications: A New Level of Customer Service
Dr. Valentine Matula, Director Multimedia Research - Avaya

Download Presentation

Around the world (including in the U.S. in 2007), many consumers have access to live 2-way video. Learn how speech-enabled self-service applications can become even more effective by showing the caller a visual display or video at the same time that they use the speech application—menus, pictures of products, live video clips, and video commercials. See sample speech-enabled self-service and proactive contact/outbound applications for universities, travel, retail, and home healthcare, and hear about the process of application authoring.

Architecture for Web Multimodal Applications
Jan Sedivy, Manager, VTS - IBM

Download Presentation

Learn about extending the Adobe Flash Player with speech recognition. A lightweight, embedded VoiceXML browser (VoiceXML 2.0-compatible) is easy to control through XML protocol from Action Script to speech-enable existing or new Flash applications. The VoiceXML is controlled by a browser extension for the IE and Firefox browsers. The browser uses the IBM ViaVoice Embedded Engine for speech recognition. Hear the key aspects of the design and about the challenges faced during the implementation.

Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m

C204 – Speech-to-Text Transcription
4:15 p.m - 5:15 p.m
MODERATOR: K.W.'Bill' Scholz, President - NewSpeech LLC


Recognition technology has matured to the point that recorded telephone-quality audio from unknown speakers can be accurately transcribed. Applications such as speech-enabled e-mail have become highly needed in the mobile environment because typing is not always practical when using hand-held devices. Recent applications of speech-to-text for searching and transcribing voice data will be illustrated for other applications, including medical data transcription and the near-real-time conversion of voice mail to text.

Technology & Applications Associated with Broadcast Transcription
Sara Basson, Program Director, Speech Transcription Strategy - IBM

Download Presentation

As speech transcription technology improves and evolves, more opportunities emerge for captioning broadcast media. This presentation outlines some remaining challenges, such as latencies and understandability. It also addresses issues in combining speech transcription with other natural language technologies, such as search, translation, and named entity detection.

Are We Ready? A Look at the Latest Speech-to-Text Applications
Marie Meteer, Executive Director - Speech Technology Consortium

Download Presentation

Speech-to-text has steadily improved in accuracy during the past 2 decades, but the question remains: “Is it good enough?” The answer lies not in the technology, but in the applications. Using her experience with BBN’s STT engine, Marie Meteer describes how STT performance affects a variety of applications: where it works, where it fails, and where supporting technologies can make the difference.

Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m

TRACK D: INNOVATIVE SPEECH APPLICATIONS
Majestic Room (6th floor)
Speech Technology at the Tipping Point
(Broadway Ballroom)

9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink


The market for speech technology topped the $1 billion mark in 2006, up 100% from 2 years ago. As adoption of speech systems increases in all parts of our lives, from our computers to our cars to customer relationships, it is clear that speech technology is at the tipping point. Hear from Malcolm Gladwell, who coined this phrase, about how he sees ideas, behaviors, and patterns moving through a population in much the same way as an epidemic. He posits that ideas can become contagious and reach a tipping point, just as a virus reaches critical mass. Learn what it takes to start a social epidemic and create change and how to push speech technology, and your businesses, toward the tipping point.

D201 – Speech Applications for Emergencies
10:45 a.m - 12:00 p.m
MODERATOR: Emmett Coin, Speech Scientist - ejTalk


This session shows two case studies that illustrate how speech technology can be used in emergency situations. The Italian police use call steering to determine if the call is an emergency, and fire fighters use advanced, multimodal, mobile services enabling on-the-move rescuers to communicate and to share structured multimodal information resources including audio, video, text, graphics, and location information.

Speech Technology: Improving Large-Scale Disaster Management
Klaus Schäefer, CFO - Fire Department of Dortmund, Germany
Rainer Koch, Professor of Engineering - University Paderborn

Download Presentation

Hear how the Dortmund Fire Fighters (Germany) are using an advanced, multimodal, mobile service for emergency teams to handle large-scale rescue operations. The speakers will share their experience about how innovative technologies allow workers on the move to communicate naturally and bi-directionally and to share structured multimodal information resources, including audio, video, text, graphics, and location information.

Natural Language for Emergency Calls
Tom Hanson, Director of Product Management - Avaya

Download Presentation

Customer Case Study - Click to learn more!Natural language processing analyzes incoming calls to determine if any represent an emergency call. In this presentation, learn about the innovative techniques used by the Italian police.

Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN


 

D202 – Synthetic Agents that Speak & Listen
1:30 p.m - 2:30 p.m
MODERATOR: James A. Larson, Vice President - Larson Technical Services


Synthetic interviews enable callers to go beyond the picture or cultural object to better understand the event or concept it represents. For instance, callers may conduct synthetic interviews with individuals in photographs of historical events in a museum or objects in a cultural heritage site. Synthetic interviews represent a new way of interacting with objects that are usually only viewed. The first case study in this session describes a cell-phone application, and the second shows a multimodal application that contains a chat-bot. When applied to the world of advertising, synthetic agents will enable people to talk with advertisements in magazines, on billboards, and other advertising media.

Synthetic Agents as Virtual Tour Guides: Accessing Cultural Heritage Information through Cell Phones
Matt Nickerson, Special Projects Librarian & Honors Program Direct - Southern Utah University

Customer Case Study - Click to learn more!Many museums around the world rent audio players to their visitors to provide automated tours delivering prerecorded information about their exhibits. One alternative to the rental paradigm is using the patron’s own “digital sound player,” i.e., their personal cell phone. A system-directed dialog using VoiceXML can allow patrons to obtain a wide range of information through a menu-driven hierarchy or by questioning a synthetic agent virtual tour guide, docent, or teacher.

MAGA: A Mobile Archaeological Guide at Agrigento
Antonio Gentile, Associate Professor, Department of Computer Engine - University of Palermo

Download Presentation

Customer Case Study - Click to learn more!PDAs with ad hoc, built-in information retrieval and auto-localization functionalities can help museum visitors more naturally than using traditional audio/visual prerecorded guides. This presentation shows a userfriendly, multimodal virtual-guide system called MAGA, which is usable on multiple mobile devices (e.g., PDAs, Smartphones). An information retrieval service is accessible through a spoken-language interaction or an auto-localization service.

D203 – Speech in Banking
2:45 p.m - 3:45 p.m
Walter Rolandi Ph.D., Principal Usability Scientist - West Interactive
Marc Seltier, Communications Technologies Manager - HypoVerreinsbank Deutschland
Jeff Weiner, Senior Vice President, Telephone Channel - Bank of America
Jan Smith, Senior Program Manager - Nuance Communications


Customer Case Study - Click to learn more!Around the world, speech applications are being used to update account information and to transfer billions of dollars among accounts. Learn how two major banks have successfully deployed speech applications that revolutionized their operations. Speakers will describe solutions to problems encountered during development and deployment.

Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m

D204 – Touch to Voice: Learning from Older Consumers
4:15 p.m - 5:15 p.m
MODERATOR: Leonard Klie, Senior Editor, Speech Technology Magazine - Information Today, Inc.
Ryan Fox, Senior Vice President of Web/IVR Customer Service - Wachovia
Ava Baker, CEO & President - AJ Waters, LLC


Customer Case Study - Click to learn more!They said it couldn’t be done. In today’s high-tech environment as more and more older consumers interact with speech technology, businesses are trying to find ways to make the experience a pleasant one. Hear the latest research findings and how one organization is strategizing to find ways to engage older consumers.

Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m

TRACK E: TIPS & TECHNIQUES FOR IMPLEMENTING YOUR SPEECH APPLICATIONS
Wintergarden Room (6th floor)
Speech Technology at the Tipping Point
(Broadway Ballroom)

9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink


The market for speech technology topped the $1 billion mark in 2006, up 100% from 2 years ago. As adoption of speech systems increases in all parts of our lives, from our computers to our cars to customer relationships, it is clear that speech technology is at the tipping point. Hear from Malcolm Gladwell, who coined this phrase, about how he sees ideas, behaviors, and patterns moving through a population in much the same way as an epidemic. He posits that ideas can become contagious and reach a tipping point, just as a virus reaches critical mass. Learn what it takes to start a social epidemic and create change and how to push speech technology, and your businesses, toward the tipping point.

E201 – Tips & Techniques for Using Grammars
10:45 a.m - 12:00 p.m
MODERATOR: Dr. Daniel C Burnett, Director of Speech Technologies and Standards - Voxeo


Specifying grammars that cover the words spoken by users is one of the major keys to designing and implementing speech applications. Tuning grammars is a complex task and requires automated techniques to analyze large amounts of data. What techniques can designers and implementers use to build and tune grammars to handle unexpected events in caller responses such as background talk, side conversations, nonspeech events, restarts, and corrections? Can some of these techniques be automated?

Building Robust Grammars
Sunil Issar, Director, Architecture - Convergys Corporation

Download Presentation

Speech applications misrecognize many common events such as background talk, restarts, and corrections and treat them as valid responses with a high confidence score. This presentation discusses techniques for building robust grammars that handle these unexpected events and other reasons for frequent speech recognition errors and will also review experimental results using data from speech applications.

Building and Tuning Automated Directory Assistance Systems
Padma Ramesh, Senior VUI Developer - Nuance Communications
Krishnan Srinivasan, Senior Solutions Manager - Nuance Communications

Tuning grammars for large telcos and automating directory assistance services is a complex task and requires automated techniques to go over large amounts of untranscribed data. The key technical challenges relate to the integration of large vocabulary recognition of naturally spoken requests, disambiguation, and search and to developing automated approaches to building and tuning these grammars. In this presentation. hear two experts describe their experiences with building and tuning these grammars.

Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN


 

E202 – Speech Synthesizers on Steroids
1:30 p.m - 2:30 p.m
MODERATOR: Dr. Daniel C Burnett, Director of Speech Technologies and Standards - Voxeo


Text-to-speech synthesis is used to render dynamic data, such as newsfeeds and e-mail, and to avoid costly, time-consuming prerecording. The use of speech synthesis will increase as functionality, including pronunciation accuracy, natural timbre, intonation, and expressive voices that supply so much of an application’s persona, improves. Special attention will be given in this session to the challenging requirements of non-English-languages and new extensions to the Speech Synthesis Markup Language (SSML).

The Development of Chinese TTS Technology
Qiang Bai, Vice President - iFLYTEK

Download Presentation

Making a speech synthesis system for Chinese language has encountered many linguistic and phonetic problems that differ from Western languages. iFLYTEK has taken the language characteristics and constructed a Chinese TTS system with the help of the China national standard CSSML and the developing W3C SSML 1.1 specification. The 2008 Olympics and World Expo 2010 in Shanghai will boost the utilization of the Chinese TTS technology and the markup language for Chinese text.

New Applications & Contexts for Speech Technology
Davide Bonardo, Senior TTS Software Architect, Loquendo Technologies - Loquendo

Download Presentation

Davide Bonardo demonstrates the many TTS features available today in TTS products, how to use them to obtain effective prompts, and how to optimize TTS use. He will illustrate how to get the most out of the tools available in order to refine prompts for use in the widest variety of contexts.

E203 – Web Services & Speech
2:45 p.m - 3:45 p.m
MODERATOR: R.J. Auburn, Chief Technology Officer - Voxeo


A major problem facing speech application developers is integration with the rest of an enterprise’s IT infrastructures. Learn how to integrate speech user interfaces over a network with the rest of an enterprise’s IT infrastructure using service-oriented architecture (SOA) technology and Web services. The benefits of this approach include keeping your application portable and maintaining a clear separation of your call flow and your business logic.

Bridging IT and Telephony: Integrating Speech Applications with Enterprise IT Assets
Michael Codini, Chief Technical Officer & Co-Founder - VoiceObjects, Inc.

Download Presentation

One shortcoming of many speech applications is the lack of integration with the rest of an enterprise’s IT infrastructures. Web services integration between speech applications and other enterprise IT assets can remedy this situation. This presentation discusses service-oriented architecture (SOA) technology and Web services and the application of Web services to call center and other speech-enabled environments. See an example of a speech application that fully leverages CRM, BI, and other IT assets via Web services.

Expand VoiceXML & CCXML Using Web Services
Chris Passaretti, Senior Programmer/Analyst - Cablevision Systems Corporation

Download Presentation

This presentation discusses functionality beyond the VoiceXML/CCXML standards and suggests a standards-based Web service methodology. The benefits include keeping the application portable and a clear separation of the call flow and the business logic. Learn how Web services can help your speech application leverage other standards without comprising portability.

Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m

E204 – Deploying Speech Applications
4:15 p.m - 5:15 p.m
MODERATOR: Michael Perry, Director of Voice Self Service - Avaya
R.J. Auburn, Chief Technology Officer - Voxeo
Bob Cooper, CEO - Swampfox Inc.


Download Presentation

Even the most carefully planned deployments can go wrong. Experts in this session review lessons learned from years of customer deployments of applications with two to 20,000 ports. They discuss the features of Session Initiation Protocol (SIP), a lightweight, transport-independent protocol for messaging, and share tips on how to overcome the challenges of deploying a VoiceXML/CCXLM application into an SIP environment. Hear “R.J.’s Rules” for successful speech deployments, encapsulating lessons learned from processing more than a billion calls.

Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m





MarketPlace - Sponsored Links
ITIResearch.com
A collection of market research and reports for executive management and business & IT professionals
Silver
Bronze
Media