talk

AT&T WATSON(SM) Voice Recognition Technology & Speech API

Fri Apr 20 16:46:00 EDT 2012

AT&T WATSONSM is a general-purpose engine that performs a variety of speech and analysis tasks, including automatic speech recognition, natural language understanding (which includes machine learning), and dialog management tasks. WATSON also integrates AT&T Labs Natural Voices® text-to-speech conversion.

WATSON has been used within AT&T for IVR customers, including AT&T's VoiceTone® service, for over 20 years during which time the ASR algorithms, tools, and plug-in architecture have been refined to increase accuracy, convenience, and integration. Besides customer care IVR, AT&T WATSON has been used for speech analytics, speech translation (including the AT&T Translator app), mobile voice search of multimedia data, video search, voice remote, and voice mail to text. AT&T WATSON also supports Speak4it (local business search).

In June, APIs for AT&T WATSON speech recognition will be made available to developers who want to add voice recognition and transcription to their applications and merge voice recognition with applications, information databases, or specialized software that communicates with devices such as remote, video recorders.

The first APIs will support web search, local business search, Q&A, voice mail to text, SMS, AT&T's U-verse video programming guide, and a general-purpose dictation API, while providing vocabularies and language models focused on specific tasks. Also provided will be the Speech Kit SDK so developers can create software that captures spoken words and sends them into a network for transcription.