people

Mark C. Beutnagel

Beutnagel, Mark C.
180 Park Ave - Building 103
Florham Park, NJ
Subject matter expert in AT&T Natural Voices®, text to speech, SSML, text markup, text normaliztion, unit selection

Started at Bell Labs in 1986 in the text-to-speech group.  Initially handled system administration for the group.  Did DSP32 programming for synthesis and TTS software support.  Projects included programmable phone call automation with TTS responses, TTS on a DOS laptop (with no floating point or built-in audio device), and conversion of Email to Audix messages with TTS.

In 1996 became part of the new text to speech research group at AT&T Labs.   Merging old and new technologies, helped create and commercialize AT&T Natural Voices®.  Up to NV 3.0, worked with development groups to hand off research inprovments.  Since NV 4.0 have been responsible for releases and updates.  Also work with organizations using NV within AT&T and provide tier-3 support for Wizzard Software, which handles much of the external sales and licensing for NV.

For many years now, have monitored and maintained the heavily-used Natural Voices public demo, which demonstrates, educates and promotes our technology.  On average, more than 6000 people per  day use our site from all over the world. 

Projects
AT&T Natural VoicesTM Text-to-Speech, Natural Voices is AT&T's state-of-the-art text-to-speech product, taking text and producing natural-sounding, synthesized speech in a variety of voices and languages.

Technical Documents

Automatic Assessment of American English Lexical Stress using Machine Learning Algorithms
Yeon Kim, Mark Beutnagel
SLaTE-2011 workshop (Speech and Language Technology in Education),  2011.  [BIB]

Speech acts and dialog TTS
Ann Syrdal, Alistair Conkie, Yeon Kim, Mark Beutnagel
Seventh ISCA Speech Synthesis Workshop,  2010.  [BIB]

Patents

Methods And Apparatus For Rapid Acoustic Unit Selection From A Large Speech Corpus, Tue Nov 20 16:12:23 EST 2012
Methods And Apparatus For Rapid Acoustic Unit Selection From A Large Speech Corpus, Tue Dec 27 16:06:49 EST 2011
Method And System For Aligning Natural And Synthetic Video To Speech Synthesis, Tue Nov 30 15:05:08 EST 2010
Methods And Apparatus For Rapid Acoustic Unit Selection From A Large Speech Corpus, Tue Jul 20 15:04:13 EDT 2010
Methods and apparatus for rapid acoustic unit selection from a large speech corpus, Tue May 06 18:12:48 EDT 2008
Method and system for aligning natural and synthetic video to speech synthesis, Tue Apr 29 18:12:46 EDT 2008
Method and system for aligning natural and synthetic video to speech synthesis, Tue Sep 19 18:11:34 EDT 2006
Methods and apparatus for rapid acoustic unit selection from a large speech corpus, Tue Jul 25 18:11:26 EDT 2006
Advance TTS for facial animation, Tue Jul 11 18:11:24 EDT 2006
Employing Speech Models In Concatenative Speech Synthesis, Tue Sep 27 18:10:32 EDT 2005
Method and system for aligning natural and synthetic video to speech synthesis, Tue Mar 01 18:10:18 EST 2005
Integration Of Talking Heads And Text-To-Speech Synthesizers For visual TTS, Tue Jan 04 18:10:15 EST 2005
Methods and apparatus for rapid acoustic unit selection from a large speech corpus, Tue Mar 02 18:09:06 EST 2004
Method and apparatus for rapid acoustic unit selection from a large speech corpus, Tue Feb 24 18:09:05 EST 2004
Method And System For Aligning Natural And Synthetic Video To Speech Synthesis, Tue May 20 18:08:42 EDT 2003
Verbal, Fully Automatic Dictionary Updates By End-Users Of Speech Synthesis And Recognition Systems, Tue Jun 20 18:05:34 EDT 2000
graphviz

Connections

Graphviz