Bibliography for M. Rahim (mazin)

Bibliography

    Robust Speech Recognition

  1. Goffin, V., Allauzen, C., Bocchieri, E., Hakkani-Tur, D., Ljolje, A., Parthasarathy, S., Rahim, M., Riccardi, G., Saraclar, M., ``The AT&T Watson Speech Recognizer'' Proc. Int. Conf. Acoust., Speech & Sig. Proc., March 2005.
  2. Kim, H.-K., and Rahim, M., ``Why speech recognizers make errors - A robustness view'' Int. Conf. Speech & Lang. Proc., Korea, October 2004.
  3. *Rahim, M., Riccardi, G., Saul, L., Wright, J., Buntschuh, B., and Gorin, A., ``Robust numeric recognition in spoken language dialog'' Speech Communication, Vol 34, pp. 195-212, 2001.
  4. *Saul, L. Rahim, M., and Allen, J., ``A statistical model for robust integration of narrowband cues in speech'' Proc. Computer, Speech and Language, Vol 15, pp. 175-194, 2001.
  5. Furst, M., Allen, J., Saul, L. and Rahim, M. ``Human speech perception in narrow bands'' Israel Society for Auditory Research, Tel Aviv, Israel, pp. 10-15, Oct. 2000.
  6. Allen, J., Furst, M., Saul, L. and Rahim, M. ``Feature extraction in human speech recognition'' The Nature of Speech Recognition Workshop, Lake Taho, CA, pp. 23-27, Aug. 2000.
  7. Furst, M., Allen, J., Saul, L. and Rahim, M. ``Human speech perception in narrow bands'' Israel Society for Auditory Research, Tel Aviv, Israel, pp. 10-15, Oct. 2000.
  8. Allen, J., Furst, M., Saul, L. and Rahim, M. ``Feature extraction in human speech recognition'' The Nature of Speech Recognition Workshop, Lake Taho, CA, pp. 23-27, Aug. 2000.
  9. Allen, J., Furst, M., Saul, L. and Rahim, M. ``Feature extraction in human speech recognition'' The Nature of Speech Recognition Workshop, Utrecht, Netherlands, pp. 3-7, July 2000.
  10. Saul, L. and Rahim, M., Allen, J., ``Learning from examples in critical bands of speech,'' Proc. IEEE ASR Workshop, Keystone, 1999.
  11. *Ephraim, Y. and Rahim, M., ``On second order statistics and linear estimation of cepstral coefficients,'' IEEE Transactions on Speech and Audio Processing, vol. 7, no. 2, 1999, pp. 162-176.
  12. Rahim, M., Riccardi, G., Wright, J., Buntschuh, B. and Gorin, A., ``Robust automatic speech recognition in a natural spoken dialog,'' Workshop on Robust Methods for Speech Recognition in Adverse Condition, Tampere, Finland, 1999.
  13. *Surendran, A. C., Lee, C-H. and Rahim, M., ``Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition,'' IEEE Transactions on Speech and Audio Processing, 1999.
  14. *Lawrence, C., and Rahim, M., ``Integrated bias removal techniques for robust speech recognition,'' Computer Speech and Language, pp. 283-298, 1999.
  15. Ephraim, Y. and Rahim, M., ``On second order statistics and linear estimation of cepstral coefficients,'' Proc. Int. Conf. Acoust., Speech, Signal Processing, no. SP29.2, 1998.
  16. Lawrence, C. and Rahim, M., ``An integrated bias removal techniques for robust speech recognition,'' Eurospeech '97, Rhodes, Greece, 1997.
  17. Surendran, A., Lee, C. and Rahim, M., ``Unsupervised smooth training of feed-forward neural networks for mismatch compensation,'' Automatic Speech Recognition Workshop, Santa Barbara, 1997.
  18. Chou, W., Rahim, M. and Buhrke, E., ``Signal conditioned minimum error rate training for continuous speech recognition,'' Proc. Euro-Speech '95, 1995.
  19. Chou, W., Seshadri, N. and Rahim, M., ``Trellis encoded vector quantization for robust speech recognition,'' Proc. ICSLP '96, Philadelphia, Oct. 1996, pp. 2001-2004.
  20. *Rahim, M. and Juang, B-H., ``Signal bias removal by maximum likelihood estimation for robust telephone speech recognition,'' IEEE Trans. Speech & Audio Proc., vol. IV, no. 1, Jan. 1996, pp. 19-30.
  21. *Rahim, M., Juang, G-H., Chou, W. and Buhrke, E., ``Signal conditioning techniques for robust speech recognition,'' IEEE Signal Processing Letters, vol. 3, 1996.
  22. Surendran, A. C., Lee, C-H. and Rahim, M., ``Maximum likelihood stochastic matching approach to non-linear equalization for robust speech recognition,'' Int. Conf. Speech & Lang. Proc., Phil., 1996.
  23. Rahim, M. and Juang, B-H., ``Signal bias removal for robust telephone based speech recognition in adverse environments,'' Proc. Int. Conf. Acoustic, Speech & Sig. Proc., April 1994.

    Speech Data Mining

  24. *Douglas, S., Agarwal, D., Alonso, T., Bell, R., Gilbert, M., Swayne, D., and Volinsky, C., ``Mining Customer Care Dialogs for "Daily News",'' Special issue on Data Mining of Speech, Audio and Dialog, IEEE Trans. Speech & Audio Proc., September 2005.
  25. Feng, J., Srinivas, B., and Rahim, M., ``An Evaluation Study of WebTalk Question/Answering,'' International Conference on Speech and Language Processing, Korea, October 2004.
  26. Douglas, S., Agarwal, D., Alonso, T., Bell, R., Rahim, M., Swayne, D., and Volinsky, C., ``Mining Customer Care Dialogs for "Daily News",'' International Conference on Speech and Language Processing, Korea, October 2004.
  27. Feng, J., Bangalore, S., Rahim, M., ``WebTalk: Mining websites for automatically building dialog systems'', Proc. IEEE ASR Workshop, Virgin Islands, 2003.
  28. Feng, J., Bangalore, S., Rahim, M., ``WebTalk: Towards automatically building dialog services by exploiting the content and structure of websites'', Int. Conf. on World Wide Web, Budapest, May 2003.

    Spoken Language Understanding and Applications

  29. *Gilbert, M., Wilpon, J., Stern, B., and Di Fabbrizio, G., ``Intelligent Virtual Agents for Contact Center Automation'' IEEE Signal Processing Magazine, September 2005.
  30. *Gupta, N., Tur, G., Tur, D., Bangalore, S., Riccardi, G., and Gilbert, M., ``The AT\&T Spoken Language Understanding System,'' IEEE Trans. Speech & Audio Proc., To be published 2005.
  31. *Schapire, R., Rochery, M., Rahim M., and Gupta, N., ``Incorporating prior knowledge into boosting for Text and Speech Classification,'' IEEE Trans. Speech & Audio Proc., March 2005.
  32. Tur, D., Tur, G., Rahim, M. and Riccardi, G., ``Unsupervised and Active Learning in Automatic Speech Recognition for Call Classification,'' Proc. Int. Conf. Acoustic, Speech & Sig. Proc., Montreal, May 2004.
  33. Tur, G., Rahim, M. and Tur, D., ``Active Labeling for Spoken Language Understanding,'' Proc. Euro-Speech '03, Geneva, Sep 2003.
  34. S. Bangalore, N. Gupta, M. Rahim, ``Extracting Clauses for Spoken Language Understanding in Conversational Systems'', International Conference on Speech and Language Processing, Colorado, 2002.
  35. G. Di Fabbrizio, D. Dutton, N. Gupta, B. Hollister, M. Rahim, G. Riccardi, R. Schapire, J. Schroeter, ``The AT\&T Help Desk'', International Conference on Speech and Language Processing, Colorado, 2002.
  36. R. Schapire, M. Rochery, M. Rahim and N.Gupta, ``Incorporating prior knowledge into boosting,'' In Machine Learning: Proceedings of the Nineteenth International Conference, 2002.
  37. M. Rochery, R. Schapire, M. Rahim, N. Gupta, G. Riccardi, S. Bangalore, H. Alshawi, S. Douglas, ``Combining Prior Knowledge and Boosting for Call Classification in Spoken Language Dialogue,'' Proc. Int. Conf. Acoust., Speech, Signal Processing, Orlando, 2002.
  38. M. Rochery, R. Schapire, M. Rahim, N. Gupta, ``BoosTexter for text categorization in spoken language dialogue'' Accepted to Automatic Speech Recognition and Understanding Workshop , 2001.
  39. M. Rahim, G. Di Fabbrizio, C. Kamm, M. Walker, A. Pokrovsky, P. Ruscitti, E. Levin, S. Lee, A. Syrdal, K. Schlosser, ``VOICE-IF: A Mixed-Initiative Spoken Dialogue System for AT&T Conference Services,'' Proc. European Conf. on Speech Communication and Technology, 2001.
  40. Levin, E., Narayanan, S., Pieraccini, R., Biatov, K., Bocchieri, E., DiFabbrizio, G., Eckert, W., Lee, S., Pokrovsky, A., Rahim, M., Ruscitti, P. and Walker, ``2000 The AT&T DARPA Communicator Mixed-Initiative Spoken Dialog System,'' In Proceedings of the International Conference on Spoken Language Processing , 2000.
  41. Rahim, M., Pieaccini, R., Eckert, W., Levin, E., Di Fabbrizio, G., Riccardi, G., Kamm, C., and Narayanan, S., ``A Spoken Dialogue System for Conference/Workshop Services,'' Proc. ICSLP '00, 2000.
  42. Rahim, M., Pieraccini, R., Eckert, W., Levin, E., Di Fabbrizio, D., Riccardi, G., Lin, C.-M. and Kamm, C., ``W99 -- A Spoken Dialogue System for the ASRU'99 Workshop,'' Proc. IEEE ASR Workshop, Keystone, 1999,
  43. Rahim, M., ``Recognizing connected digits in a natural spoken dialog,'' Proc. Int. Conf. Acoust., Speech, Signal Processing, no. 2011, 1999.

    Utterance Verification

  44. Rahim, M., ``Utterance verification for the numeric language in a natural spoken dialogue,'' Proc. European Conf. on Speech Communication and Technology, Budapest, 1999, pp. 495-498.
  45. Modi, P. and Rahim, M., ``Discriminative utterance verification using multiple confidence measures,'' Eurospeech '97, Rhodes, Greece, 1997.
  46. *Rahim, M., Lee, C-H. and Juang, B-H., ``Discriminative utterance verification for connected digits recognition,'' IEEE Trans. on Speech & Audio Proc., vol. 5, 1997, pp. 266-277.
  47. *Rahim, M., Lee, C-H. and Juang, B-H., ``A study on robust utterance verification for connected digits recognition,'' J. Acoustical Society of America, 1997, pp. 2892-1902.
  48. Rahim, M., Lee, C-H., Juang, B-H. and Chou, W., ``Discriminative utterance verification using minimum string verification error (MSVE) training,'' Proc. ICASSP '96, Atlanta, May 1996, pp. 3585-3588.
  49. Sukkar, R., Setlur, A., Rahim, M. and Lee, C-H., ``Utterance verification of keyword strings using word based minimum verification error (WB-MVE) training,'' Proc. ICASSP '96, Atlanta, May 1996, pp. 516-519.
  50. Rahim, M., Lee, C-H. and Juang, B-H., ``Discriminative utterance verification for connected digit recognition,'' EuroSpeech '95, Madrid, Spain, Sept. 1995.
  51. Rahim, M., Lee, C-H. and Juang, B-H., ``Robust utterance verification for connected digit recognition,'' ICASSP '95, Detroit, May 1995.

    Acoustic Modeling

  52. Saul, L. and Rahim, M., ``Markov processes on curves for automatic speech recognition,'' Advances in Neural Information Processing Systems 11, MIT Press, pp. 751-757, Cambridge, 1999,
  53. *Saul, L. and Rahim, M., ``Maximum likelihood and minimum classification error factor analysis for automatic speech recognition,'' IEEE Transactions on Speech and Audio Processing, Vol 8(2), pp. 115-125, 1999,
  54. Saul, L. and Rahim, M., ``Modeling the rate of speech by Markov process on curves,'' Proc. European Conf. on Speech Communication and Technology, Budapest, pp. 495-498, 1999.
  55. Rahim, M. and Lee, C-H., ``String-based minimum verification error (SB-MVE) training for flexible speech recognition,'' Proc. Computer, Speech and Language, vol. 11, pp. 147-160, 1997.
  56. Rahim, M. and Saul, L., ``Minimum classification error factor analysis for automatic speech recognition,'' Automatic Speech Recognition Workshop, Santa Barbara, 1997.
  57. Rahim, M., ``A parallel environment model (PEM) for speech recognition and adaptation,'' Eurospeech '97, Rhodes, Greece, 1997.
  58. Saul, L. and Rahim, M., ``Modeling acoustic correlations by factor analysis,'' NIPS '97, pp. 749-756, 1997.
  59. Rahim, M., Bengio, Y. and LeCun, Y., ``Discriminative feature and model design for automatic speech recognition,'' Eurospeech '97, Rhodes, Greece, 1997.
  60. Rahim, M. and Lee, C-H., ``Simultaneous feature and HMM design using string-based minimum classification error training criterion,'' Proc. ICSLP '96, Phil., Oct. 1996.
  61. Rahim, M. and Lee, C-H., ``Joint ANN feature and HMM recognizer design using string-based minimum classification error (MCE) training,'' Proc. WCNN '96, San Diego, Sept. 1996.
  62. Rahim, M. and Lee, C-H., ``An integrated ANN-HMM speech recognition system based on minimum classification error training,'' Proc. IEEE ASR Workshop, Snowbird, Dec. 1995.
  63. Salavedra, J., Jacobsen, C., Rahim, M., Zeljkovic, I. and Wilpon, J. G., ``Multi-lingual connected digits recognition,'' Proc. European Conf. Speech Communications, vol. 3, 1995, pp. 2119-2122.
  64. Buhrke, E., Cardin, R., Normandin, Y, Rahim, M. and Wilpon, J., ``Application of vector quantized hidden Markov models to the recognition of connected digit strings in the telephone network,'' Proc. Int. Conf. Acoust., Speech & Sig. Proc., April 1994.
  65. ^Rahim, M., ``A self-learning neural tree network for phone recognition'' in Artificial Neural Networks for Speech & Vision, Chapman & Hall, 1994.
  66. Assaleh, K., Mammone, R., Rahim, M. and Flanagan, J., ``Speech recognition using the modulation model,'' Proc. IEEE ICASSP '93, Minneapolis, MN, 27-30 April 1993, pp. II664-II667.
  67. Rahim, M., ``A self-learning neural tree network for recognition of speech features,'' Proc. Int. Conf. Acoust., Speech, and Sig. Proc., vol. 1, 1993.
  68. Rahim, M. G., ``A neural tree architecture for phoneme classification with experiments on the TIMIT database,'' Proc. Int. Conf. Acoust., Speech & Sig. Proc., San Francisco, 1992.

    Speech Synthesis

  69. ^Rahim, M., ``Artificial neural networks for speech analysis/synthesis,'' Chapman & Hall Publication, June 1994.
  70. *Rahim, M. G., Goodyear, C., Kleijn, W. B., Schroeter, J. and Sondhi, M. M., ``On the use of neural networks in articulatory speech synthesis,'' J. Acoust. Soc. Am., vol. 93, no. 2, Feb. 1993, pp. 1109-1121.
  71. Rahim, M. G., Kleijn, W. B., Schroeter, J. and Goodyear, C. C., ``Acoustic to articulatory mapping using an assembly of neural networks,'' Proc. Int. Conf. Acoust., Speech, Sig. Proc., Toronto, 1991, pp. 485-488.
  72. Schroeter, J., Gupta, S., Rahim, M. and Sondhi, M. M., ``Improving an articulatory speech mimic,'' Fortschritte der Akustik - DAGA '91, Bad Honnef: DPG-GmbH., 1991.
  73. *Rahim, M. G. and Goodyear, C. C., ``Estimation of vocal tract filter parameter using a neural net,'' Speech Communication, vol. 9, North-Holland, 1990.
  74. Rahim, M. and Goodyear, C. C., ``Articulatory synthesis with the aid of a neural net,'' Proc. Int. Conf. Acoustic, Speech & Sig. Proc., Glasgow, May 1989.
  75. Rahim, M. and Goodyear, C. C., ``Parameter estimation for spectral matching in articulatory synthesis,'' Collg. Spectral Estimation Techniques & Speech Proc., London, 1989.

    Communication

  76. Ansari, A. and Rahim, M., ``Image compression using broad vector quantization,'' SPIE Conf., Orlando, 1992.
  77. *Rahim, M., Goodyear, C. C. and Hughes, P. M., ``Design of discriminator voice-band FSK data modems,'' IEE Proc. Circuits, Devices & Systems, Aug. 1989.

* Journal paper ^ Book or book chapter