Robust telephone speech recognition based on channel compensation

被引：3

作者：

Han, JQ ^{[1
]}

Gao, W ^{[1
]}

机构：

[1] Harbin Inst Technol, Dept Comp Sci & Engn, Harbin 150001, Peoples R China

来源：

PATTERN RECOGNITION | 1999年 / 32卷 / 06期

关键词：

channel compensation; speech recognition; robustness; modulation frequencies; signal-to-noise rate;

D O I：

10.1016/S0031-3203(98)00113-7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Channel compensation technique has been proved to be an effective approach for robust speech recognition. In this payer, we compare the performance of our proposed method RMFCC with those of the former channel compensation methods: CMS, two-level CMS and RASTA for robust telephone speech recognition. For all experiments, a Korean isolated 84-word-database consisting of SO speakers collected from local telephone line is adopted. Using RMFCC, a 39.8% reduction in word error rate is obtained relative to conventional HMM system. It is shown from the experiments that RMFCC, comparing with RASTA, reduces the computational complexity without losing accuracy, and is also better than CMS and two-level CMS on the performance. After discussion, we verify that it is an effective approach to suppress very low modulation frequencies by filtering for robust telephone speech recognition. (C) 1999 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

引用

页码：1061 / 1067

页数：7

共 50 条

[41] A novel channel estimate for noise robust speech recognition
Vanderreydt, Geoffroy
Demuynck, Kris
[J]. COMPUTER SPEECH AND LANGUAGE, 2024, 86
[42] Bayesian channel equalisation and robust features for speech recognition
Milner, BP
Vaseghi, SV
[J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1996, 143 (04): : 223 - 231
[43] Channel Robust MFCCs for Continuous Speech Speaker Recognition
Chougule, Sharada Vikram
Chavan, Mahesh S.
[J]. ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 557 - 568
[44] Dual channel based speech enhancement using novelty filter for robust speech recognition in automobile environment
Beh, Jounghoon
Baran, Robert H.
Ko, Hanseok
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (02) : 583 - 589
[45] CASA Based Speech Separation for Robust Speech Recognition
Han Runqiang
Zhao Pei
Gao Qin
Zhang Zhiping
Wu Hao
Wu Xihong
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 77 - 80
[46] Conversational telephone speech recognition
Gauvain, JL
Lamel, L
Schwenk, H
Adda, G
Chen, L
Lefèvre, F
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 212 - 215
[47] Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
Rahim, MG
Juang, BH
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (01): : 19 - 30
[48] A new robust telephone speech recognition algorithm with the multi-model structures
Liu, J
Pan, SX
Wang, ZY
Xia, SH
[J]. CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (02) : 169 - 174
[49] COMBINING EIGENVOICE SPEAKER MODELING AND VTS-BASED ENVIRONMENT COMPENSATION FOR ROBUST SPEECH RECOGNITION
Ou, Zhijian
Deng, Kan
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4673 - 4676
[50] VTS feature compensation based on two-layer GMM structure for robust speech recognition
Zhou, Lin
Li, Haijing
Chen, Ying
Wu, Zhenyang
Lu, Yong
[J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,

← 1 2 3 4 5 →