Toward Robust Speech Recognition and Understanding

被引:0
|
作者
Sadaoki Furui
机构
[1] Tokyo Institute of Technology,Department of Computer Science
关键词
speech recognition; speech understanding; robustness; adaptation; spontaneous speech; corpus; acoustic models; language models; dialogue; multi-modal; summarization;
D O I
暂无
中图分类号
学科分类号
摘要
The principal cause of speech recognition errors is a mismatch between trained acoustic/language models and input speech due to the limited amount of training data in comparison with the vast variation of speech. It is crucial to establish methods that are robust against voice variation due to individuality, the physical and psychological condition of the speaker, telephone sets, microphones, network characteristics, additive background noise, speaking styles, and other aspects. This paper overviews robust architecture and modeling techniques for speech recognition and understanding. The topics include acoustic and language modeling for spontaneous speech recognition, unsupervised adaptation of acoustic and language models, robust architecture for spoken dialogue systems, multi-modal speech recognition, and speech summarization. This paper also discusses the most important research problems to be solved in order to achieve ultimate robust speech recognition and understanding systems.
引用
收藏
页码:245 / 254
页数:9
相关论文
共 50 条
  • [31] Stochastic Matching for Robust Speech Recognition
    Sankar, Ananth
    Lee, Chin-Hui
    IEEE SIGNAL PROCESSING LETTERS, 1994, 1 (08) : 124 - 125
  • [32] Trajectory Modeling for Robust Speech Recognition
    Sim, KheChai
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : XXVII - XXVIII
  • [33] Robust speech recognition with dynamic synapses
    Liaw, JS
    Berger, TW
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 2175 - 2179
  • [34] Robust speech recognition in car environments
    Shozakai, M
    Nakamura, S
    Shikano, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 269 - 272
  • [35] Adaptive compensation for robust speech recognition
    Lee, CH
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 357 - 364
  • [36] Subband correlation and robust speech recognition
    McAuley, J
    Ming, J
    Stewart, D
    Hanna, P
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 956 - 964
  • [37] ULTRASONIC SENSING FOR ROBUST SPEECH RECOGNITION
    Srinivasan, Sundararajan
    Raj, Bhiksha
    Ezzat, Tony
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5102 - 5105
  • [38] TOWARD CONTINUOUS-SPEECH RECOGNITION
    VERHAEGHE, B
    BYTE, 1992, 17 (04): : 158 - 158
  • [39] Toward noise robustness speech recognition
    Namarvar, HH
    Liaw, J
    Berger, TW
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4016 - 4016
  • [40] Continuous Chinese speech recognition and understanding
    Gu, JH
    Liu, JM
    Shen, XQ
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 989 - 992