Bibliography for M. Rahim (mazin)
Bibliography
Robust Speech Recognition
- Goffin, V., Allauzen, C., Bocchieri, E., Hakkani-Tur, D., Ljolje, A., Parthasarathy, S., Rahim, M., Riccardi, G., Saraclar, M.,
``The AT&T Watson Speech Recognizer''
Proc. Int. Conf. Acoust., Speech & Sig. Proc., March 2005.
- Kim, H.-K., and Rahim, M.,
``Why speech recognizers make errors - A robustness view''
Int. Conf. Speech & Lang. Proc., Korea, October 2004.
- *Rahim, M., Riccardi, G., Saul, L., Wright, J., Buntschuh, B., and Gorin, A.,
``Robust numeric recognition in spoken language dialog''
Speech Communication, Vol 34, pp. 195-212, 2001.
- *Saul, L.
Rahim, M., and Allen, J.,
``A statistical model for robust integration of narrowband cues in speech''
Proc. Computer, Speech and Language, Vol 15, pp. 175-194, 2001.
- Furst, M., Allen, J., Saul, L. and
Rahim, M.
``Human speech perception in narrow bands''
Israel Society for Auditory Research, Tel Aviv, Israel, pp. 10-15, Oct. 2000.
- Allen, J., Furst, M., Saul, L. and
Rahim, M.
``Feature extraction in human speech recognition''
The Nature of Speech Recognition Workshop, Lake Taho, CA, pp. 23-27, Aug. 2000.
- Furst, M., Allen, J., Saul, L. and
Rahim, M.
``Human speech perception in narrow bands''
Israel Society for Auditory Research, Tel Aviv, Israel, pp. 10-15, Oct. 2000.
- Allen, J., Furst, M., Saul, L. and
Rahim, M.
``Feature extraction in human speech recognition''
The Nature of Speech Recognition Workshop, Lake Taho, CA, pp. 23-27, Aug. 2000.
- Allen, J., Furst, M., Saul, L. and
Rahim, M.
``Feature extraction in human speech recognition''
The Nature of Speech Recognition Workshop, Utrecht, Netherlands, pp. 3-7, July 2000.
- Saul, L. and
Rahim, M., Allen, J.,
``Learning from examples in critical bands of speech,''
Proc. IEEE ASR Workshop, Keystone, 1999.
- *Ephraim, Y. and
Rahim, M.,
``On second order statistics and linear estimation of cepstral coefficients,''
IEEE Transactions on Speech and Audio Processing,
vol. 7, no. 2, 1999,
pp. 162-176.
- Rahim, M.,
Riccardi, G.,
Wright, J.,
Buntschuh, B. and
Gorin, A.,
``Robust automatic speech recognition in a natural spoken dialog,''
Workshop on Robust Methods for Speech Recognition in Adverse Condition,
Tampere, Finland, 1999.
- *Surendran, A. C.,
Lee, C-H. and
Rahim, M.,
``Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition,''
IEEE Transactions on Speech and Audio Processing, 1999.
- *Lawrence, C., and
Rahim, M.,
``Integrated bias removal techniques for robust speech recognition,''
Computer Speech and Language,
pp. 283-298, 1999.
- Ephraim, Y. and
Rahim, M.,
``On second order statistics and linear estimation of cepstral coefficients,''
Proc. Int. Conf. Acoust., Speech, Signal Processing, no. SP29.2, 1998.
- Lawrence, C. and
Rahim, M.,
``An integrated bias removal techniques for robust speech recognition,''
Eurospeech '97,
Rhodes, Greece, 1997.
- Surendran, A.,
Lee, C. and
Rahim, M.,
``Unsupervised smooth training of feed-forward neural networks for mismatch compensation,''
Automatic Speech Recognition Workshop,
Santa Barbara, 1997.
- Chou, W.,
Rahim, M. and
Buhrke, E.,
``Signal conditioned minimum error rate training for continuous speech recognition,''
Proc. Euro-Speech '95, 1995.
- Chou, W.,
Seshadri, N. and
Rahim, M.,
``Trellis encoded vector quantization for robust speech recognition,''
Proc. ICSLP '96,
Philadelphia, Oct. 1996,
pp. 2001-2004.
- *Rahim, M. and
Juang, B-H.,
``Signal bias removal by maximum likelihood estimation for robust telephone speech recognition,''
IEEE Trans. Speech & Audio Proc.,
vol. IV, no. 1, Jan. 1996,
pp. 19-30.
- *Rahim, M.,
Juang, G-H.,
Chou, W. and
Buhrke, E.,
``Signal conditioning techniques for robust speech recognition,''
IEEE Signal Processing Letters,
vol. 3, 1996.
- Surendran, A. C.,
Lee, C-H. and
Rahim, M.,
``Maximum likelihood stochastic matching approach to non-linear equalization for robust speech recognition,''
Int. Conf. Speech & Lang. Proc.,
Phil., 1996.
- Rahim, M. and Juang, B-H.,
``Signal bias removal for robust telephone based speech recognition in adverse environments,''
Proc. Int. Conf. Acoustic, Speech & Sig. Proc., April 1994.
Speech Data Mining
- *Douglas, S., Agarwal, D., Alonso, T., Bell, R., Gilbert, M., Swayne, D., and Volinsky, C.,
``Mining Customer Care Dialogs for "Daily News",''
Special issue on Data Mining of Speech, Audio and Dialog,
IEEE Trans. Speech & Audio Proc., September 2005.
- Feng, J., Srinivas, B., and Rahim, M.,
``An Evaluation Study of WebTalk Question/Answering,''
International Conference on Speech and Language Processing,
Korea, October 2004.
- Douglas, S., Agarwal, D., Alonso, T., Bell, R., Rahim, M., Swayne, D., and Volinsky, C.,
``Mining Customer Care Dialogs for "Daily News",''
International Conference on Speech and Language Processing,
Korea, October 2004.
- Feng, J., Bangalore, S., Rahim, M.,
``WebTalk: Mining websites for automatically building dialog systems'',
Proc. IEEE ASR Workshop, Virgin Islands, 2003.
- Feng, J., Bangalore, S., Rahim, M.,
``WebTalk: Towards automatically building dialog services by exploiting the content and structure of websites'',
Int. Conf. on World Wide Web, Budapest, May 2003.
Spoken Language Understanding and Applications
- *Gilbert, M., Wilpon, J., Stern, B., and Di Fabbrizio, G.,
``Intelligent Virtual Agents for Contact Center Automation''
IEEE Signal Processing Magazine, September 2005.
- *Gupta, N., Tur, G., Tur, D., Bangalore, S., Riccardi, G., and Gilbert, M.,
``The AT\&T Spoken Language Understanding System,''
IEEE Trans. Speech & Audio Proc., To be published 2005.
- *Schapire, R., Rochery, M., Rahim M., and Gupta, N.,
``Incorporating prior knowledge into boosting for Text and Speech Classification,''
IEEE Trans. Speech & Audio Proc., March 2005.
- Tur, D., Tur, G., Rahim, M. and Riccardi, G.,
``Unsupervised and Active Learning in Automatic Speech Recognition for Call Classification,''
Proc. Int. Conf. Acoustic, Speech & Sig. Proc., Montreal, May 2004.
- Tur, G., Rahim, M. and Tur, D.,
``Active Labeling for Spoken Language Understanding,''
Proc. Euro-Speech '03, Geneva, Sep 2003.
- S. Bangalore, N. Gupta, M. Rahim,
``Extracting Clauses for Spoken Language Understanding in Conversational Systems'',
International Conference on Speech and Language Processing,
Colorado, 2002.
- G. Di Fabbrizio, D. Dutton, N. Gupta, B. Hollister, M. Rahim, G. Riccardi, R. Schapire, J. Schroeter,
``The AT\&T Help Desk'',
International Conference on Speech and Language Processing,
Colorado, 2002.
- R. Schapire, M. Rochery, M. Rahim and N.Gupta,
``Incorporating prior knowledge into boosting,''
In Machine Learning: Proceedings of the Nineteenth International Conference, 2002.
- M. Rochery, R. Schapire, M. Rahim, N. Gupta, G. Riccardi, S. Bangalore, H. Alshawi, S. Douglas,
``Combining Prior Knowledge and Boosting for Call Classification in Spoken Language Dialogue,''
Proc. Int. Conf. Acoust., Speech, Signal Processing, Orlando, 2002.
- M. Rochery, R. Schapire, M. Rahim, N. Gupta,
``BoosTexter for text categorization in spoken language dialogue''
Accepted to Automatic Speech Recognition and Understanding Workshop , 2001.
- M. Rahim, G. Di Fabbrizio, C. Kamm, M. Walker, A. Pokrovsky, P. Ruscitti, E. Levin, S. Lee, A. Syrdal, K. Schlosser,
``VOICE-IF: A Mixed-Initiative Spoken Dialogue System for AT&T Conference Services,''
Proc. European Conf. on Speech Communication and Technology,
2001.
- Levin, E., Narayanan, S., Pieraccini, R., Biatov, K., Bocchieri, E., DiFabbrizio, G., Eckert, W., Lee, S., Pokrovsky, A., Rahim, M., Ruscitti, P. and Walker,
``2000 The AT&T DARPA Communicator Mixed-Initiative Spoken Dialog System,''
In Proceedings of the International Conference on Spoken Language Processing , 2000.
- Rahim, M., Pieaccini, R., Eckert, W., Levin, E., Di Fabbrizio, G., Riccardi, G., Kamm, C., and Narayanan, S.,
``A Spoken Dialogue System for Conference/Workshop Services,''
Proc. ICSLP '00, 2000.
- Rahim, M., Pieraccini, R., Eckert, W., Levin, E.,
Di Fabbrizio, D., Riccardi, G., Lin, C.-M. and Kamm, C.,
``W99 -- A Spoken Dialogue System for the ASRU'99 Workshop,''
Proc. IEEE ASR Workshop,
Keystone, 1999,
- Rahim, M.,
``Recognizing connected digits in a natural spoken dialog,''
Proc. Int. Conf. Acoust., Speech, Signal Processing, no. 2011, 1999.
Utterance Verification
- Rahim, M.,
``Utterance verification for the numeric language in a natural spoken dialogue,''
Proc. European Conf. on Speech Communication and Technology,
Budapest, 1999,
pp. 495-498.
- Modi, P. and
Rahim, M.,
``Discriminative utterance verification using multiple confidence measures,''
Eurospeech '97,
Rhodes, Greece, 1997.
- *Rahim, M.,
Lee, C-H. and
Juang, B-H.,
``Discriminative utterance verification for connected digits recognition,''
IEEE Trans. on Speech & Audio Proc.,
vol. 5, 1997,
pp. 266-277.
- *Rahim, M.,
Lee, C-H. and
Juang, B-H.,
``A study on robust utterance verification for connected digits recognition,''
J. Acoustical Society of America, 1997,
pp. 2892-1902.
- Rahim, M.,
Lee, C-H.,
Juang, B-H. and
Chou, W.,
``Discriminative utterance verification using minimum string verification error (MSVE) training,''
Proc. ICASSP '96,
Atlanta, May 1996,
pp. 3585-3588.
- Sukkar, R.,
Setlur, A.,
Rahim, M. and
Lee, C-H.,
``Utterance verification of keyword strings using word based minimum verification error (WB-MVE) training,''
Proc. ICASSP '96,
Atlanta, May 1996,
pp. 516-519.
- Rahim, M.,
Lee, C-H. and
Juang, B-H.,
``Discriminative utterance verification for connected digit recognition,''
EuroSpeech '95,
Madrid, Spain, Sept. 1995.
- Rahim, M.,
Lee, C-H. and
Juang, B-H.,
``Robust utterance verification for connected digit recognition,''
ICASSP '95,
Detroit, May 1995.
Acoustic Modeling
- Saul, L. and
Rahim, M.,
``Markov processes on curves for automatic speech recognition,''
Advances in Neural Information Processing Systems 11,
MIT Press, pp. 751-757, Cambridge, 1999,
- *Saul, L. and
Rahim, M.,
``Maximum likelihood and minimum classification error factor analysis for automatic speech recognition,''
IEEE Transactions on Speech and Audio Processing, Vol 8(2), pp. 115-125, 1999,
- Saul, L. and
Rahim, M.,
``Modeling the rate of speech by Markov process on curves,''
Proc. European Conf. on Speech Communication and Technology,
Budapest,
pp. 495-498, 1999.
- Rahim, M. and
Lee, C-H.,
``String-based minimum verification error (SB-MVE) training for flexible speech recognition,''
Proc. Computer, Speech and Language,
vol. 11,
pp. 147-160, 1997.
- Rahim, M. and
Saul, L.,
``Minimum classification error factor analysis for automatic speech recognition,''
Automatic Speech Recognition Workshop,
Santa Barbara, 1997.
- Rahim, M.,
``A parallel environment model (PEM) for speech recognition and adaptation,''
Eurospeech '97,
Rhodes, Greece, 1997.
- Saul, L. and
Rahim, M.,
``Modeling acoustic correlations by factor analysis,''
NIPS '97, pp. 749-756, 1997.
- Rahim, M.,
Bengio, Y. and
LeCun, Y.,
``Discriminative feature and model design for automatic speech recognition,''
Eurospeech '97,
Rhodes, Greece, 1997.
- Rahim, M. and
Lee, C-H.,
``Simultaneous feature and HMM design using string-based minimum classification error training criterion,''
Proc. ICSLP '96,
Phil., Oct. 1996.
- Rahim, M. and
Lee, C-H.,
``Joint ANN feature and HMM recognizer design using string-based minimum classification error (MCE) training,''
Proc. WCNN '96,
San Diego, Sept. 1996.
- Rahim, M. and
Lee, C-H.,
``An integrated ANN-HMM speech recognition system based on minimum classification error training,''
Proc. IEEE ASR Workshop,
Snowbird, Dec. 1995.
- Salavedra, J.,
Jacobsen, C.,
Rahim, M.,
Zeljkovic, I. and
Wilpon, J. G.,
``Multi-lingual connected digits recognition,''
Proc. European Conf. Speech Communications,
vol. 3, 1995,
pp. 2119-2122.
- Buhrke, E.,
Cardin, R.,
Normandin, Y,
Rahim, M. and
Wilpon, J.,
``Application of vector quantized hidden Markov models to the recognition of connected digit strings in the telephone network,''
Proc. Int. Conf. Acoust., Speech & Sig. Proc., April 1994.
- ^Rahim, M.,
``A self-learning neural tree network for phone recognition'' in
Artificial Neural Networks for Speech & Vision,
Chapman & Hall, 1994.
- Assaleh, K.,
Mammone, R.,
Rahim, M. and
Flanagan, J.,
``Speech recognition using the modulation model,''
Proc. IEEE ICASSP '93,
Minneapolis, MN, 27-30 April 1993,
pp. II664-II667.
- Rahim, M.,
``A self-learning neural tree network for recognition of speech features,''
Proc. Int. Conf. Acoust., Speech, and Sig. Proc.,
vol. 1, 1993.
- Rahim, M. G.,
``A neural tree architecture for phoneme classification with experiments on the TIMIT database,''
Proc. Int. Conf. Acoust., Speech & Sig. Proc.,
San Francisco, 1992.
Speech Synthesis
- ^Rahim, M.,
``Artificial neural networks for speech analysis/synthesis,''
Chapman & Hall Publication, June 1994.
- *Rahim, M. G.,
Goodyear, C.,
Kleijn, W. B.,
Schroeter, J. and
Sondhi, M. M.,
``On the use of neural networks in articulatory speech synthesis,''
J. Acoust. Soc. Am.,
vol. 93, no. 2, Feb. 1993,
pp. 1109-1121.
- Rahim, M. G.,
Kleijn, W. B.,
Schroeter, J. and
Goodyear, C. C.,
``Acoustic to articulatory mapping using an assembly of neural
networks,''
Proc. Int. Conf. Acoust., Speech, Sig. Proc.,
Toronto, 1991,
pp. 485-488.
- Schroeter, J.,
Gupta, S.,
Rahim, M. and
Sondhi, M. M.,
``Improving an articulatory speech mimic,''
Fortschritte der Akustik - DAGA '91,
Bad Honnef: DPG-GmbH., 1991.
- *Rahim, M. G. and
Goodyear, C. C.,
``Estimation of vocal tract filter parameter using a neural net,''
Speech Communication,
vol. 9,
North-Holland, 1990.
- Rahim, M. and
Goodyear, C. C.,
``Articulatory synthesis with the aid of a neural net,''
Proc. Int. Conf. Acoustic, Speech & Sig. Proc.,
Glasgow, May 1989.
- Rahim, M. and
Goodyear, C. C.,
``Parameter estimation for spectral matching in articulatory synthesis,''
Collg. Spectral Estimation Techniques & Speech Proc.,
London, 1989.
Communication
- Ansari, A. and
Rahim, M.,
``Image compression using broad vector quantization,''
SPIE Conf.,
Orlando, 1992.
- *Rahim, M.,
Goodyear, C. C. and
Hughes, P. M.,
``Design of discriminator voice-band FSK data modems,''
IEE Proc. Circuits, Devices & Systems, Aug. 1989.
* Journal paper
^ Book or book chapter