Robust telephone speech recognition based on channel compensation

被引:3
|
作者
Han, JQ [1 ]
Gao, W [1 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China
关键词
channel compensation; speech recognition; robustness; modulation frequencies; signal-to-noise rate;
D O I
10.1016/S0031-3203(98)00113-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Channel compensation technique has been proved to be an effective approach for robust speech recognition. In this payer, we compare the performance of our proposed method RMFCC with those of the former channel compensation methods: CMS, two-level CMS and RASTA for robust telephone speech recognition. For all experiments, a Korean isolated 84-word-database consisting of SO speakers collected from local telephone line is adopted. Using RMFCC, a 39.8% reduction in word error rate is obtained relative to conventional HMM system. It is shown from the experiments that RMFCC, comparing with RASTA, reduces the computational complexity without losing accuracy, and is also better than CMS and two-level CMS on the performance. After discussion, we verify that it is an effective approach to suppress very low modulation frequencies by filtering for robust telephone speech recognition. (C) 1999 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1061 / 1067
页数:7
相关论文
共 50 条
  • [41] A novel channel estimate for noise robust speech recognition
    Vanderreydt, Geoffroy
    Demuynck, Kris
    [J]. COMPUTER SPEECH AND LANGUAGE, 2024, 86
  • [42] Bayesian channel equalisation and robust features for speech recognition
    Milner, BP
    Vaseghi, SV
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1996, 143 (04): : 223 - 231
  • [43] Channel Robust MFCCs for Continuous Speech Speaker Recognition
    Chougule, Sharada Vikram
    Chavan, Mahesh S.
    [J]. ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 557 - 568
  • [44] Dual channel based speech enhancement using novelty filter for robust speech recognition in automobile environment
    Beh, Jounghoon
    Baran, Robert H.
    Ko, Hanseok
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (02) : 583 - 589
  • [45] CASA Based Speech Separation for Robust Speech Recognition
    Han Runqiang
    Zhao Pei
    Gao Qin
    Zhang Zhiping
    Wu Hao
    Wu Xihong
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 77 - 80
  • [46] Conversational telephone speech recognition
    Gauvain, JL
    Lamel, L
    Schwenk, H
    Adda, G
    Chen, L
    Lefèvre, F
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 212 - 215
  • [47] Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    Rahim, MG
    Juang, BH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (01): : 19 - 30
  • [48] A new robust telephone speech recognition algorithm with the multi-model structures
    Liu, J
    Pan, SX
    Wang, ZY
    Xia, SH
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (02) : 169 - 174
  • [49] COMBINING EIGENVOICE SPEAKER MODELING AND VTS-BASED ENVIRONMENT COMPENSATION FOR ROBUST SPEECH RECOGNITION
    Ou, Zhijian
    Deng, Kan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4673 - 4676
  • [50] VTS feature compensation based on two-layer GMM structure for robust speech recognition
    Zhou, Lin
    Li, Haijing
    Chen, Ying
    Wu, Zhenyang
    Lu, Yong
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,