Signal conditioning techniques for robust speech recognition

被引:37
|
作者
Rahim, MG
Juang, BH
Chou, W
Buhrke, E
机构
[1] AT and T Bell Laboratories, Murray Hill
关键词
D O I
10.1109/97.489062
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Acoustic mismatch encountered in various training and testing conditions of hidden Markov model (HMM) based systems often causes severe degradation in speech recognition performance. For telephone based speech recognition tasks, acoustic mismatch can arise from various sources, such as variations in telephone handsets, ambient noises, and channel distortions, This paper presents three techniques for blind channel equalization, namely, cepstral mean subtraction (CMS), signal bias removal (SBR) and hierarchical signal bias removal (HSBR), Experimental results on various connected digits databases show a reduction in the digit error rate by 16%, 21%, and 28% when employing CMS, SBR, and HSBR, respectively. Our results also demonstrate that the HSBR technique outperforms SBR and CMS on every sub-data collection and exhibits consistent improvements even for short utterances.
引用
下载
收藏
页码:107 / 109
页数:3
相关论文
共 50 条
  • [1] Signal processing techniques for robust speech recognition
    Asano, Futoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 393 - 401
  • [2] A STUDY ON BIAS-BASED SPEECH SIGNAL CONDITIONING TECHNIQUES FOR IMPROVING THE ROBUSTNESS OF AUTOMATIC SPEECH RECOGNITION
    Chowdhury, Md Foezur Rahman
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, : 366 - +
  • [3] SIGNAL MODELING TECHNIQUES IN SPEECH RECOGNITION
    PICONE, JW
    PROCEEDINGS OF THE IEEE, 1993, 81 (09) : 1215 - 1247
  • [4] An algorithm for robust signal modelling in speech recognition
    Vergin, R
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 969 - 972
  • [5] Missing data techniques for robust speech recognition
    Cooke, M
    Morris, A
    Green, P
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 863 - 866
  • [6] PROLOG TO - SIGNAL MODELING TECHNIQUES IN SPEECH RECOGNITION
    SELF, K
    PROCEEDINGS OF THE IEEE, 1993, 81 (09) : 1214 - 1214
  • [7] Channel and speaker adaptation techniques for robust speech recognition
    Chen, Jingdong
    Yao, Lei
    Huang, Taiyi
    Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
  • [8] Integrated bias removal techniques for robust speech recognition
    Lawrence, C
    Rahim, M
    COMPUTER SPEECH AND LANGUAGE, 1999, 13 (03): : 283 - 298
  • [9] Techniques for robust speech recognition in noisy and reverberant conditions
    Brown, GJ
    Palomäki, KJ
    SPEECH SEPARATION BY HUMANS AND MACHINES, 2005, : 213 - 220
  • [10] Non-linear techniques for robust speech recognition
    Ge, Yubo
    Niu, Jing
    Ge, Lingnan
    Shirai, Katsuhiko
    CITSA 2007/CCCT 2007: INTERNATIONAL CONFERENCE ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS : INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL III, POST-CONFERENCE ISSUE, PROCEEDINGS, 2007, : 134 - +