|
|
|
SpeechTek 2007 - Advance Program
August 18-20, 2008 • New York Marriott Marquis • New York, NY
|
|
|
|
|
|
TRACK A: THE YEAR OF LIVING VIRTUALLY: HOSTED & MANAGED SERVICES
|
|
Soho Room (7th floor)
|
Speech Technology at the Tipping Point
(Broadway Ballroom)
9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink
 |
The
market for speech technology topped the $1 billion mark in 2006, up
100% from 2 years ago. As adoption of speech systems increases in all
parts of our lives, from our computers to our cars to customer
relationships, it is clear that speech technology is at the tipping
point. Hear from Malcolm Gladwell, who coined this phrase, about how he
sees ideas, behaviors, and patterns moving through a population in much
the same way as an epidemic. He posits that ideas can become contagious
and reach a tipping point, just as a virus reaches critical mass. Learn
what it takes to start a social epidemic and create change and how to
push speech technology, and your businesses, toward the tipping point.
|
|
A201 – The Outsourcer’s Perspective
10:45 a.m - 12:00 p.m
MODERATOR: Mr Dan Miller, Senior Analyst - Opus Research, Inc. Lynn Jobe, Sales Director, Strategic Products - Intervoice Henry McCreary, Chief Architect and Director of Telecommunications - CSX Technology
|
Outsourcing
is a common practice and is often the best practice for costeffective,
high-quality customer service and self-service. In this session,
leading service providers and their customers introduce and describe
the criteria that influenced their decision to outsource and their
experiences with the team of application developers, carriers, and
hosted services providers.
|
|
Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN
|
A202 – Contrasting Service Provider Approaches
1:30 p.m - 2:30 p.m
MODERATOR: Clegg Ivey, VP, Mergers & Acquisitions, General Counsel - Voxeo John Hibel, Vice President of Marketing - Voxeo Ron Owens, Director, Multimedia Applications PSO - Nortel Tom Smith, Senior Manager - Verizon Digital Media Services Mike Moors, Director, Sales - Genesys
|
Nearly
every service provider offers savings in capital expense and contact
center operating costs. Increasingly, service providers differentiate
themselves through partnering strategies, technology in use,
application development and management resources, and support of
multiple modalities of customer care and self-service.
|
|
A203 – Meeting Business Challenges
2:45 p.m - 3:45 p.m
MODERATOR: Mr Dan Miller, Senior Analyst - Opus Research, Inc. Mark Abramson, CEO - Message Technologies Inc. Tim Moynihan, Vice President of Marketing, Enterprise Solutions - EMPIRIX Steven Pollock, Executive Vice President & Co-Founder - TuVox Michael X. Zirngibl, Founder & President - Angel.com Incorporated
|
This
has been the year of the end user. With so much emphasis on the
customer experience, being able to identify snags in self-service
applications at a highly granular level and to make corrections
dynamically is crucial. In this session, application framework
providers describe how they support application development that meets
customer service objectives.
|
|
Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m
|
A204 – View of the Global Future
4:15 p.m - 5:15 p.m
MODERATOR: Mr Dan Miller, Senior Analyst - Opus Research, Inc. Juan Ceballos, Chief Technology Officer - Ydilo Dr. Andrew Hunt, Vice President, Engineering - Holly Connects Susan Wilson, Director Business Development, DA Services - Nuance Communications
|
In
this session, the biggest stakeholders in global, outsourced
communications discuss how the distributed, virtualized contact center
will integrate speech automation with multimodal communications
(blending live agents with IM, chat, and video messaging) and what this
means for the future of customer service.
|
|
Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m
|
|
TRACK B: DESIGNING THE SPEECH CUSTOMER EXPERIENCE
|
|
Empire Room (7th floor)
|
Speech Technology at the Tipping Point
(Broadway Ballroom)
9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink
 |
The
market for speech technology topped the $1 billion mark in 2006, up
100% from 2 years ago. As adoption of speech systems increases in all
parts of our lives, from our computers to our cars to customer
relationships, it is clear that speech technology is at the tipping
point. Hear from Malcolm Gladwell, who coined this phrase, about how he
sees ideas, behaviors, and patterns moving through a population in much
the same way as an epidemic. He posits that ideas can become contagious
and reach a tipping point, just as a virus reaches critical mass. Learn
what it takes to start a social epidemic and create change and how to
push speech technology, and your businesses, toward the tipping point.
|
|
B201 – Para Continuar en Espanol
10:45 a.m - 12:00 p.m
MODERATOR: Eduardo Olvera, Senior User Interface Designer, Professional Services - Nuance Communications
|
The
world is getting smaller, and increasingly this is reflected in speech
applications that include more than one language. Enabling access to a
speech application in another language is not a simple matter of
translating the prompts and switching the recognition algorithm to the
other language. Experts in this session offer tips on adding a second
language to an existing application and how best to approach the design
of new multilingual applications.
|
The Use of Spanglish in Speech Systems Designed for Hispanics in the U.S.
Jose L Elizondo, Senior Manager, Multilingual VUI Design - Nuance Communications
What
type of Spanish is the best to use for the Hispanic population in the
U.S., who have backgrounds from more than 20 different countries. One
solution that is not discussed often enough and that upsets language
purists is mixing English and Spanish, or even creating hybrid words in
“Spanglish.” This presentation addresses linguistic facts about
Hispanics in the U.S. and the appropriateness of using mixed-language
phrases and Spanglish words.
|
Adding a Second Language to Your Speech Application
Aimee Piercy, User Interface Designer - Nuance Communications
There
are many challenges to adding a second language to your speech
application. Hear useful advice in this presentation to help you ensure
a smooth design and development process, including real-life, concrete
examples drawn from practical experience working on various Spanish
voice applications.
|
Insights to Multilingual Applications
Prabha Sundaram, Speech Specialist - Nortel
There
is a growing need to design and develop speech-enabled, self-care
applications that are bi- or multilingual (e.g., Spanglish, Hinglish
etc.). In these design projects, it is important to understand cultural
differences, analyze speaking styles, sentence constructs, etc. Learn
about challenges and solutions for tailoring and designing these
applications.
|
|
Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN
|
B202 – Getting the VUI Right When Recognition Goes Wrong
1:30 p.m - 2:30 p.m
MODERATOR: David C Martin, Managing Principal, Self Service Solutions EMEA, Professional Services - Avaya
|
Recognition
errors are a fact of life for speech systems. The job of a good voice
user interface is to make the errors as inconsequential as possible to
the end user. Learn how to handle confirmations to build user
confidence without becoming a burden, and hear about new approaches to
error-handling for natural language applications.
|
To Confirm, or Not to Confirm ... That Is the Question
Kristie Goss, Senior Analyst, VUI Designer - Convergys Corporation
Confirmations
are a critical piece to the VUI puzzle. This presentation investigates
various confirmation strategies and illustrates successful phrasing
techniques, including a natural-sounding way to confirm dynamic data.
Attendees will expand their confirmation toolkit and learn how to
improve caller satisfaction and decrease call duration in their own IVR
applications.
|
Fault Tolerance at the VUI Level
Daniel Padgett, Senior Speech Consultant - Voice Partners/VoxGen Jessica Hicks, Speech Consultant - Versay Solutions
Hear
about proven strategies for building fault-tolerant VUIs for smarter
speech applications. The speakers discuss the use of statistical
language models, error strategies tailored to specific call paths, and
the careful use of confirmations to minimize errors, improve
performance, and increase overall customer satisfaction.
|
|
B203 – Making It Personal
2:45 p.m - 3:45 p.m
MODERATOR: Tom Houwing, Director - voiceandvision B.V.
|
Speech
applications used to be conceived of as one-size-fits-all interactions
in which every user had the same experience. Increasingly,
organizations are capitalizing on data they already know about
customers to tailor the experience specifically for them. Whatever you
know about the customer can be used to predict the likely reason for
the call and present relevant information and services proactively.
Learn techniques for creating personalized interactions that make calls
seem efficient, productive, and surprisingly satisfying.
|
Creating a Personalized User Experience
Barb Mackraz, Principal - Mackraz Design, LLC
A
great application is not only smart and engaging, but also relevant.
This session looks at how you can draw on intelligence about a user to
adapt and tailor the experience for him. Topics include streamlining a
user interface and offering features proactively based on usage
patterns and predictions of intent—and logistical challenges you’ll
encounter. This approach to design is especially effective for mobile
applications in which a phone identifies a single user.
|
Developing Dynamic Personalized Contact Center Applications with Cisco
Cory Wright, Sr Marketing Manager, Customer Contact Business - Cisco
This
session shows how to create a personalized contact center application
using Cisco Call Studio, speech recognition, VoiceXML, and Web or
enterprise applications. Learn how the Cisco Customer Voice Portal and
interactive voice response technology built into Cisco IOS work
together to allow you to build and deliver self-service applications.
|
|
Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m
|
B204 – Watch Your Grammar
4:15 p.m - 5:15 p.m
MODERATOR: Dr. Juan E. Gilbert, IDEaS Professor & Chair, Division of Human-Centered Computing - Clemson University Judi Halperin, Principal Consultant and Team Lead, Global Speech Engineering, Contact Center Practice - Avaya Inc. Mr. David L Thomson, PMTS - AT&T Labs
|
Speech
applications are only as effective as the grammars they reference. As
such, grammars should be constrained as tightly as possible, and
grammar over-generation should be avoided at all costs. This session’s
speakers discuss various methods available to help constrain grammars,
as well as some grammar development pitfalls and the testing
methodologies that will help avoid them.
|
|
Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m
|
|
TRACK C: ADVANCED SPEECH TECHNOLOGY SYMPOSIUM
|
|
Shubert Room (6th floor)
|
Speech Technology at the Tipping Point
(Broadway Ballroom)
9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink
 |
The
market for speech technology topped the $1 billion mark in 2006, up
100% from 2 years ago. As adoption of speech systems increases in all
parts of our lives, from our computers to our cars to customer
relationships, it is clear that speech technology is at the tipping
point. Hear from Malcolm Gladwell, who coined this phrase, about how he
sees ideas, behaviors, and patterns moving through a population in much
the same way as an epidemic. He posits that ideas can become contagious
and reach a tipping point, just as a virus reaches critical mass. Learn
what it takes to start a social epidemic and create change and how to
push speech technology, and your businesses, toward the tipping point.
|
|
C201 – New Approaches to Dialog Design
10:45 a.m - 12:00 p.m
MODERATOR: K.W.'Bill' Scholz, President - NewSpeech LLC
|
As
designers are urged to create ever-more sophisticated self-service
applications, the pressure for evolving new techniques grows in
importance. New Eclipse-based graphical tools oriented around the
identification, definition, and reuse of hierarchical dialog patterns
and novel nonlinear call flows assisted by agents are described in this
dialog design session.
|
A Graphical Tool for Pattern-Based Dialog Design
Dominique Boucher, Lead Software Developer - Nu Echo Inc.
This
presentation shows an Eclipse-based, graphical environment for
developing speech applications that specifically addresses the problem
of capturing and expressing recurring dialog patterns. This tool
transforms the process of designing and implementing dialogs by
specifically orienting the design process around the identification,
definition, and reuse of hierarchical dialog patterns.
|
Non-Linear Call Flow Design
Clifford Harlow, Vice President, Client Services - Spoken Communications
Most
speech IVR applications are unable to skip utterances that they don’t
understand. In contrast, live agents can gather information out of
sequence or discern intent. By uniquely combining speech technology
with humans, callers can have a more natural, free-flowing self-service
experience because they are not locked into a rigid call flow.
|
Adaptive Voice Dialogs Based on Automatic Speaker Classification
Joachim Stegmann, Head, Advanced Voice Solutions - T-Systems Enterprise Services GmbH
This
presentation describes the technology and applications of automatic
speaker classification (e.g., age, gender, language, and emotions) in
voice portals. It shows how dialog parameters should be adapted to
achieve improved user acceptance in IVR systems. The first results from
pilot implementations within Deutsche Telekom prove the feasibility and
show advantages compared to conventional, non-adaptive systems.
|
|
Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN
|
C202 – Artificial Intelligence & VUI Design
1:30 p.m - 2:30 p.m
MODERATOR: K.W.'Bill' Scholz, President - NewSpeech LLC
|
The
growing sophistication of VUI designs demands the incorporation of new
technologies, including those borrowed from other disciplines. This
session focuses on the novel application of artificial intelligence
technology concurrently using a dialog engine and a problem-solving
engine. It also illustrates the use of natural language to understand
the semantics and context of any phrase being processed, making it much
easier to develop the answers.
|
Artificial Intelligence in Voice Self-Service Applications
Mahesh Rajagopalan, President & Co - Resolvity Dr. Jacek Jarmulak, Senior AI Scientist - Resolvity
This
presentation discusses how AI technologies may be used in voice
self-service applications to separate the product support logic from
the call flow logic, take advantage of the problem-solver’s
knowledgebase to develop dialogs, improve speech recognition, create
dynamic call flows, and provide an effective and efficient
troubleshooting experience. Learn about strengths and weaknesses,
rule-based systems, Bayesian inference, decision-trees, and
knowledgebases.
|
Improve Your VUI Design with an AI-Based Conversational Dialog Solution
Peter Trompetter, Vice President, Global Development - GyrusLogic
Natural
language understanding is an excellent augmentation to an existing or
new VUI for better automated call completion and customer satisfaction.
Hear about a solution that makes it easier to develop the natural
language application after understanding the semantics and context of
any phrase.
|
|
C203 – Advances in Video & Multimedia Application Design
2:45 p.m - 3:45 p.m
MODERATOR: K.W.'Bill' Scholz, President - NewSpeech LLC
|
The
availability of a robust 3G infrastructure throughout Europe and much
of Asia has released pent-up customer demand to add live video to
extend the utility of voice communications. This session illustrates
how video menus, pictures of products, live video clips, and video
commercials can be managed, as well as how sample speech/video-enabled
self-service applications for universities, travel, retail, and home
health can be developed. Also, the use of the Adobe Flash Player, a
popular standard for delivering rich Web content to develop multimedia
content, will be explained and illustrated.
|
Speech-Enabled Video Applications: A New Level of Customer Service
Dr. Valentine Matula, Director Multimedia Research - Avaya
Around
the world (including in the U.S. in 2007), many consumers have access
to live 2-way video. Learn how speech-enabled self-service applications
can become even more effective by showing the caller a visual display
or video at the same time that they use the speech application—menus,
pictures of products, live video clips, and video commercials. See
sample speech-enabled self-service and proactive contact/outbound
applications for universities, travel, retail, and home healthcare, and
hear about the process of application authoring.
|
Architecture for Web Multimodal Applications
Jan Sedivy, Manager, VTS - IBM
Learn
about extending the Adobe Flash Player with speech recognition. A
lightweight, embedded VoiceXML browser (VoiceXML 2.0-compatible) is
easy to control through XML protocol from Action Script to
speech-enable existing or new Flash applications. The VoiceXML is
controlled by a browser extension for the IE and Firefox browsers. The
browser uses the IBM ViaVoice Embedded Engine for speech recognition.
Hear the key aspects of the design and about the challenges faced
during the implementation.
|
|
Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m
|
C204 – Speech-to-Text Transcription
4:15 p.m - 5:15 p.m
MODERATOR: K.W.'Bill' Scholz, President - NewSpeech LLC
|
Recognition
technology has matured to the point that recorded telephone-quality
audio from unknown speakers can be accurately transcribed. Applications
such as speech-enabled e-mail have become highly needed in the mobile
environment because typing is not always practical when using hand-held
devices. Recent applications of speech-to-text for searching and
transcribing voice data will be illustrated for other applications,
including medical data transcription and the near-real-time conversion
of voice mail to text.
|
Technology & Applications Associated with Broadcast Transcription
Sara Basson, Program Director, Speech Transcription Strategy - IBM
As
speech transcription technology improves and evolves, more
opportunities emerge for captioning broadcast media. This presentation
outlines some remaining challenges, such as latencies and
understandability. It also addresses issues in combining speech
transcription with other natural language technologies, such as search,
translation, and named entity detection.
|
Are We Ready? A Look at the Latest Speech-to-Text Applications
Marie Meteer, Executive Director - Speech Technology Consortium
Speech-to-text
has steadily improved in accuracy during the past 2 decades, but the
question remains: “Is it good enough?” The answer lies not in the
technology, but in the applications. Using her experience with BBN’s
STT engine, Marie Meteer describes how STT performance affects a
variety of applications: where it works, where it fails, and where
supporting technologies can make the difference.
|
|
Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m
|
|
TRACK D: INNOVATIVE SPEECH APPLICATIONS
|
|
Majestic Room (6th floor)
|
Speech Technology at the Tipping Point
(Broadway Ballroom)
9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink
 |
The
market for speech technology topped the $1 billion mark in 2006, up
100% from 2 years ago. As adoption of speech systems increases in all
parts of our lives, from our computers to our cars to customer
relationships, it is clear that speech technology is at the tipping
point. Hear from Malcolm Gladwell, who coined this phrase, about how he
sees ideas, behaviors, and patterns moving through a population in much
the same way as an epidemic. He posits that ideas can become contagious
and reach a tipping point, just as a virus reaches critical mass. Learn
what it takes to start a social epidemic and create change and how to
push speech technology, and your businesses, toward the tipping point.
|
|
D201 – Speech Applications for Emergencies
10:45 a.m - 12:00 p.m
MODERATOR: Emmett Coin, Speech Scientist - ejTalk
|
This
session shows two case studies that illustrate how speech technology
can be used in emergency situations. The Italian police use call
steering to determine if the call is an emergency, and fire fighters
use advanced, multimodal, mobile services enabling on-the-move rescuers
to communicate and to share structured multimodal information resources
including audio, video, text, graphics, and location information.
|
Speech Technology: Improving Large-Scale Disaster Management
Klaus Schäefer, CFO - Fire Department of Dortmund, Germany Rainer Koch, Professor of Engineering - University Paderborn
Hear
how the Dortmund Fire Fighters (Germany) are using an advanced,
multimodal, mobile service for emergency teams to handle large-scale
rescue operations. The speakers will share their experience about how
innovative technologies allow workers on the move to communicate
naturally and bi-directionally and to share structured multimodal
information resources, including audio, video, text, graphics, and
location information.
|
Natural Language for Emergency Calls
Tom Hanson, Director of Product Management - Avaya
Natural
language processing analyzes incoming calls to determine if any
represent an emergency call. In this presentation, learn about the
innovative techniques used by the Italian police.
|
|
Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN
|
D202 – Synthetic Agents that Speak & Listen
1:30 p.m - 2:30 p.m
MODERATOR: James A. Larson, Vice President - Larson Technical Services
|
Synthetic
interviews enable callers to go beyond the picture or cultural object
to better understand the event or concept it represents. For instance,
callers may conduct synthetic interviews with individuals in
photographs of historical events in a museum or objects in a cultural
heritage site. Synthetic interviews represent a new way of interacting
with objects that are usually only viewed. The first case study in this
session describes a cell-phone application, and the second shows a
multimodal application that contains a chat-bot. When applied to the
world of advertising, synthetic agents will enable people to talk with
advertisements in magazines, on billboards, and other advertising media.
|
Synthetic Agents as Virtual Tour Guides: Accessing Cultural Heritage Information through Cell Phones
Matt Nickerson, Special Projects Librarian & Honors Program Direct - Southern Utah University
Many
museums around the world rent audio players to their visitors to
provide automated tours delivering prerecorded information about their
exhibits. One alternative to the rental paradigm is using the patron’s
own “digital sound player,” i.e., their personal cell phone. A
system-directed dialog using VoiceXML can allow patrons to obtain a
wide range of information through a menu-driven hierarchy or by
questioning a synthetic agent virtual tour guide, docent, or teacher.
|
MAGA: A Mobile Archaeological Guide at Agrigento
Antonio Gentile, Associate Professor, Department of Computer Engine - University of Palermo
PDAs
with ad hoc, built-in information retrieval and auto-localization
functionalities can help museum visitors more naturally than using
traditional audio/visual prerecorded guides. This presentation shows a
userfriendly, multimodal virtual-guide system called MAGA, which is
usable on multiple mobile devices (e.g., PDAs, Smartphones). An
information retrieval service is accessible through a spoken-language
interaction or an auto-localization service.
|
|
D203 – Speech in Banking
2:45 p.m - 3:45 p.m
Walter Rolandi Ph.D., Principal Usability Scientist - West Interactive Marc Seltier, Communications Technologies Manager - HypoVerreinsbank Deutschland Jeff Weiner, Senior Vice President, Telephone Channel - Bank of America Jan Smith, Senior Program Manager - Nuance Communications
|
Around
the world, speech applications are being used to update account
information and to transfer billions of dollars among accounts. Learn
how two major banks have successfully deployed speech applications that
revolutionized their operations. Speakers will describe solutions to
problems encountered during development and deployment.
|
|
Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m
|
D204 – Touch to Voice: Learning from Older Consumers
4:15 p.m - 5:15 p.m
MODERATOR: Leonard Klie, Senior Editor, Speech Technology Magazine - Information Today, Inc. Ryan Fox, Senior Vice President of Web/IVR Customer Service - Wachovia Ava Baker, CEO & President - AJ Waters, LLC
|
They
said it couldn’t be done. In today’s high-tech environment as more and
more older consumers interact with speech technology, businesses are
trying to find ways to make the experience a pleasant one. Hear the
latest research findings and how one organization is strategizing to
find ways to engage older consumers.
|
|
Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m
|
|
TRACK E: TIPS & TECHNIQUES FOR IMPLEMENTING YOUR SPEECH APPLICATIONS
|
|
Wintergarden Room (6th floor)
|
Speech Technology at the Tipping Point
(Broadway Ballroom)
9:00 a.m - 10:00 a.m
Malcom Gladwell, Author - The Tipping Point & Blink
 |
The
market for speech technology topped the $1 billion mark in 2006, up
100% from 2 years ago. As adoption of speech systems increases in all
parts of our lives, from our computers to our cars to customer
relationships, it is clear that speech technology is at the tipping
point. Hear from Malcolm Gladwell, who coined this phrase, about how he
sees ideas, behaviors, and patterns moving through a population in much
the same way as an epidemic. He posits that ideas can become contagious
and reach a tipping point, just as a virus reaches critical mass. Learn
what it takes to start a social epidemic and create change and how to
push speech technology, and your businesses, toward the tipping point.
|
|
E201 – Tips & Techniques for Using Grammars
10:45 a.m - 12:00 p.m
MODERATOR: Dr. Daniel C Burnett, Director of Speech Technologies and Standards - Voxeo
|
Specifying
grammars that cover the words spoken by users is one of the major keys
to designing and implementing speech applications. Tuning grammars is a
complex task and requires automated techniques to analyze large amounts
of data. What techniques can designers and implementers use to build
and tune grammars to handle unexpected events in caller responses such
as background talk, side conversations, nonspeech events, restarts, and
corrections? Can some of these techniques be automated?
|
Building Robust Grammars
Sunil Issar, Director, Architecture - Convergys Corporation
Speech
applications misrecognize many common events such as background talk,
restarts, and corrections and treat them as valid responses with a high
confidence score. This presentation discusses techniques for building
robust grammars that handle these unexpected events and other reasons
for frequent speech recognition errors and will also review
experimental results using data from speech applications.
|
Building and Tuning Automated Directory Assistance Systems
Padma Ramesh, Senior VUI Developer - Nuance Communications Krishnan Srinivasan, Senior Solutions Manager - Nuance Communications
Tuning
grammars for large telcos and automating directory assistance services
is a complex task and requires automated techniques to go over large
amounts of untranscribed data. The key technical challenges relate to
the integration of large vocabulary recognition of naturally spoken
requests, disambiguation, and search and to developing automated
approaches to building and tuning these grammars. In this presentation.
hear two experts describe their experiences with building and tuning
these grammars.
|
|
Keynote Luncheon - The Speed of Technology
12:00 p.m - 1:30 p.m
Kevin L. Childs, President - UCN
|
E202 – Speech Synthesizers on Steroids
1:30 p.m - 2:30 p.m
MODERATOR: Dr. Daniel C Burnett, Director of Speech Technologies and Standards - Voxeo
|
Text-to-speech
synthesis is used to render dynamic data, such as newsfeeds and e-mail,
and to avoid costly, time-consuming prerecording. The use of speech
synthesis will increase as functionality, including pronunciation
accuracy, natural timbre, intonation, and expressive voices that supply
so much of an application’s persona, improves. Special attention will
be given in this session to the challenging requirements of
non-English-languages and new extensions to the Speech Synthesis Markup
Language (SSML).
|
The Development of Chinese TTS Technology
Qiang Bai, Vice President - iFLYTEK
Making
a speech synthesis system for Chinese language has encountered many
linguistic and phonetic problems that differ from Western languages.
iFLYTEK has taken the language characteristics and constructed a
Chinese TTS system with the help of the China national standard CSSML
and the developing W3C SSML 1.1 specification. The 2008 Olympics and
World Expo 2010 in Shanghai will boost the utilization of the Chinese
TTS technology and the markup language for Chinese text.
|
New Applications & Contexts for Speech Technology
Davide Bonardo, Senior TTS Software Architect, Loquendo Technologies - Loquendo
Davide
Bonardo demonstrates the many TTS features available today in TTS
products, how to use them to obtain effective prompts, and how to
optimize TTS use. He will illustrate how to get the most out of the
tools available in order to refine prompts for use in the widest
variety of contexts.
|
|
E203 – Web Services & Speech
2:45 p.m - 3:45 p.m
MODERATOR: R.J. Auburn, Chief Technology Officer - Voxeo
|
A
major problem facing speech application developers is integration with
the rest of an enterprise’s IT infrastructures. Learn how to integrate
speech user interfaces over a network with the rest of an enterprise’s
IT infrastructure using service-oriented architecture (SOA) technology
and Web services. The benefits of this approach include keeping your
application portable and maintaining a clear separation of your call
flow and your business logic.
|
Bridging IT and Telephony: Integrating Speech Applications with Enterprise IT Assets
Michael Codini, Chief Technical Officer & Co-Founder - VoiceObjects, Inc.
One
shortcoming of many speech applications is the lack of integration with
the rest of an enterprise’s IT infrastructures. Web services
integration between speech applications and other enterprise IT assets
can remedy this situation. This presentation discusses service-oriented
architecture (SOA) technology and Web services and the application of
Web services to call center and other speech-enabled environments. See
an example of a speech application that fully leverages CRM, BI, and
other IT assets via Web services.
|
Expand VoiceXML & CCXML Using Web Services
Chris Passaretti, Senior Programmer/Analyst - Cablevision Systems Corporation
This
presentation discusses functionality beyond the VoiceXML/CCXML
standards and suggests a standards-based Web service methodology. The
benefits include keeping the application portable and a clear
separation of the call flow and the business logic. Learn how Web
services can help your speech application leverage other standards
without comprising portability.
|
|
Break — Visit the Exhibit Hall
3:45 p.m - 4:15 p.m
|
E204 – Deploying Speech Applications
4:15 p.m - 5:15 p.m
MODERATOR: Michael Perry, Director of Voice Self Service - Avaya R.J. Auburn, Chief Technology Officer - Voxeo Bob Cooper, CEO - Swampfox Inc.
|
Even
the most carefully planned deployments can go wrong. Experts in this
session review lessons learned from years of customer deployments of
applications with two to 20,000 ports. They discuss the features of
Session Initiation Protocol (SIP), a lightweight, transport-independent
protocol for messaging, and share tips on how to overcome the
challenges of deploying a VoiceXML/CCXLM application into an SIP
environment. Hear “R.J.’s Rules” for successful speech deployments,
encapsulating lessons learned from processing more than a billion calls.
|
|
Evening Attendee Reception in the Exhibit Hall
5:30 p.m - 7:00 p.m
|
ITIResearch.com
A collection of market research and reports for executive management and business & IT professionals
|
|
Silver
Bronze
Media
|