Signal conditioning techniques for robust speech recognition

被引:37
|
作者
Rahim, MG
Juang, BH
Chou, W
Buhrke, E
机构
[1] AT and T Bell Laboratories, Murray Hill
关键词
D O I
10.1109/97.489062
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Acoustic mismatch encountered in various training and testing conditions of hidden Markov model (HMM) based systems often causes severe degradation in speech recognition performance. For telephone based speech recognition tasks, acoustic mismatch can arise from various sources, such as variations in telephone handsets, ambient noises, and channel distortions, This paper presents three techniques for blind channel equalization, namely, cepstral mean subtraction (CMS), signal bias removal (SBR) and hierarchical signal bias removal (HSBR), Experimental results on various connected digits databases show a reduction in the digit error rate by 16%, 21%, and 28% when employing CMS, SBR, and HSBR, respectively. Our results also demonstrate that the HSBR technique outperforms SBR and CMS on every sub-data collection and exhibits consistent improvements even for short utterances.
引用
下载
收藏
页码:107 / 109
页数:3
相关论文
共 50 条
  • [31] Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End
    Tan, Qun Feng
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2418 - 2429
  • [32] Evaluation of Modulation Spectrum Equalization Techniques for Large Vocabulary Robust Speech Recognition
    Sun, Liang-che
    Hsu, Chang-wen
    Lee, Lin-shan
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1004 - 1007
  • [33] A Bayesian view on acoustic model-based techniques for robust speech recognition
    Maas, Roland
    Huemmer, Christian
    Sehr, Armin
    Kellermann, Walter
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015, : 1 - 16
  • [34] FEATURE EXTRACTION ALGORITHM USING NEW CEPSTRAL TECHNIQUES FOR ROBUST SPEECH RECOGNITION
    Korba, Mohamed Cherif Amara
    Bourouba, Houcine
    Djemili, Rafik
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2020, 33 (02) : 90 - 101
  • [35] A Bayesian view on acoustic model-based techniques for robust speech recognition
    Roland Maas
    Christian Huemmer
    Armin Sehr
    Walter Kellermann
    EURASIP Journal on Advances in Signal Processing, 2015
  • [36] Comparison of Estimation Techniques in Joint Uncertainty Decoding for Noise Robust Speech Recognition
    Xu, Haitian
    Chin, K. K.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2363 - 2366
  • [37] Noise robust isolated word recognition using speech feature enhancement techniques
    Ecole Nationale d'Ingénieurs de Sfax ENIS, Department of Génie Electrique, BP W, 3038 Sfax, Tunisia
    不详
    J. Appl. Sci., 2007, 24 (3935-3942):
  • [38] Speech parameters for the robust emotional speech recognition
    Kim W.-G.
    Journal of Institute of Control, Robotics and Systems, 2010, 16 (12) : 1137 - 1142
  • [39] Robust recognition of fast speech
    Lee, Ki-Seung
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (08) : 2456 - 2459
  • [40] Japanese speech databases for robust speech recognition
    Nakamura, A
    Matsunaga, S
    Shimizu, T
    Tonomura, M
    Sagisaka, Y
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2199 - 2202