Robust speech recognition with time-varying filtering, interruptions, and noise

被引:4
|
作者
Lippmann, RP [1 ]
Carlson, BA [1 ]
机构
[1] MIT, Lincoln Lab, Lexington, MA 02173 USA
关键词
D O I
10.1109/ASRU.1997.659112
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech recognizers trained with quiet wide-band speech degrade dramatically with high-pass, low-pass, and notch filtering, with noise, and with interruptions of the speech input. A new and simple approach to compensate for these degradations is presented which uses met-filter-bank (MFB) magnitudes as input features and missing feature theory to dynamically modify the probability computations performed in Hidden Markov Model recognizers, When the identity of features missing due to filtering or masking is provided, recognition accuracy on a large talker-independent digit recognition task often rises from below 50% to above 95%, These promising results suggest future work to continuously estimate SNR's within MFB bands for dynamic adaptation of speech recognizers.
引用
收藏
页码:365 / 372
页数:8
相关论文
共 50 条
  • [1] Variational Model Composition for Robust Speech Recognition with Time-Varying Background Noise
    Kim, Wooil
    Hansen, John H. L.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2371 - 2374
  • [2] Robust filtering algorithm based on time-varying noise
    Shao, Hui
    Xiong, Zhi
    Xu, Jianxin
    Hua, Bing
    Han, Song
    [J]. AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 2016, 88 (01): : 189 - 196
  • [3] Recursive estimation of time-varying environments for robust speech recognition
    Zhao, YX
    Wang, SJ
    Yen, KC
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 225 - 228
  • [4] Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise
    Kim, Wooil
    Hansen, John H. L.
    [J]. SPEECH COMMUNICATION, 2011, 53 (04) : 451 - 464
  • [5] Multiple-microphone time-varying filters for robust speech recognition
    Lai, CYK
    Aarabi, P
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 233 - 236
  • [6] Noise adaptive speech recognition in time-varying noise based on sequential Kullback proximal algorithm
    Yao, KS
    Paliwal, KK
    Nakamura, S
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 189 - 192
  • [7] Robust Particle Filtering With Time-Varying Model Uncertainty and Inaccurate Noise Covariance Matrix
    Li, Wenshuo
    Guo, Lei
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (11): : 7099 - 7108
  • [8] Optimal robust filtering with time-varying parameter uncertainty
    Bolzern, P
    Colaneri, P
    DeNicolao, G
    [J]. INTERNATIONAL JOURNAL OF CONTROL, 1996, 63 (03) : 557 - 576
  • [9] Linear Prediction Filtering on Cepstral Time Series for Noise-Robust Speech Recognition
    Hsieh, Hsin-Ju
    Jheng, Jhih-Hao
    Lin, Jung-shan
    Hung, Jeih-weih
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2016, : 311 - 312
  • [10] ROBUST TIME-VARYING PARAMETRIC MODELING OF VOICED SPEECH
    HA, P
    ANN, S
    [J]. SIGNAL PROCESSING, 1995, 42 (03) : 311 - 317