Signal conditioning techniques for robust speech recognition

被引:37
|
作者
Rahim, MG
Juang, BH
Chou, W
Buhrke, E
机构
[1] AT and T Bell Laboratories, Murray Hill
关键词
D O I
10.1109/97.489062
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Acoustic mismatch encountered in various training and testing conditions of hidden Markov model (HMM) based systems often causes severe degradation in speech recognition performance. For telephone based speech recognition tasks, acoustic mismatch can arise from various sources, such as variations in telephone handsets, ambient noises, and channel distortions, This paper presents three techniques for blind channel equalization, namely, cepstral mean subtraction (CMS), signal bias removal (SBR) and hierarchical signal bias removal (HSBR), Experimental results on various connected digits databases show a reduction in the digit error rate by 16%, 21%, and 28% when employing CMS, SBR, and HSBR, respectively. Our results also demonstrate that the HSBR technique outperforms SBR and CMS on every sub-data collection and exhibits consistent improvements even for short utterances.
引用
收藏
页码:107 / 109
页数:3
相关论文
共 50 条
  • [21] Robust speech recognition using signal processing based on binaural perception
    Stern, RM
    Sullivan, TM
    ACUSTICA, 1996, 82 : S92 - S92
  • [22] A combination of discriminative and Maximum Likelihood techniques for noise robust speech recognition
    Laurila, K
    Vasilache, M
    Viikki, O
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 85 - 88
  • [23] A robust speech analysis in speech recognition
    Miyanaga, Y
    Gozen, S
    Ohtsuki, N
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 706 - 709
  • [24] Signal Acquisition and Processing Techniques for sEMG Based Silent Speech Recognition
    Meltzner, Geoffrey S.
    Colby, Glen
    Deng, Yunbin
    Heaton, James T.
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 4848 - 4851
  • [25] SPEECH RECOGNITION TECHNIQUES
    GRANT, PM
    ELECTRONICS & COMMUNICATION ENGINEERING JOURNAL, 1991, 3 (01): : 37 - 48
  • [26] Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    Rahim, MG
    Juang, BH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (01): : 19 - 30
  • [27] Noise Robust Acoustic Signal Processing using a Hybrid Approach for Speech Recognition
    Gupta, Divya
    Bansal, Poonam
    Choudhary, Kavita
    2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), 2016, : 489 - 492
  • [28] A DCT-based fast signal subspace technique for robust speech recognition
    Huang, J
    Zhao, YX
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (06): : 747 - 751
  • [29] SPEECH SEPARATION BASED ON SIGNAL-NOISE-DEPENDENT DEEP NEURAL NETWORKS FOR ROBUST SPEECH RECOGNITION
    Tu, Yan-Hui
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 61 - 65
  • [30] A novel robust feature of speech signal based on the mellin transform for speaker-independent speech recognition
    Chen, JD
    Xu, B
    Huang, TY
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 629 - 632