Modeling sub-band correlation for noise-robust speech recognition

被引：0

作者：

McAuley, J ^{[1
]}

Ming, J ^{[1
]}

Hanna, P ^{[1
]}

Stewart, D ^{[1
]}

机构：

[1] Queens Univ Belfast, Sch Comp Sci, Belfast BT7 1NN, Antrim, North Ireland

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper investigates the effect of modeling sub-band correlation for noisy speech recognition. Sub-band data streams are assumed to be independent in many sub-band based speech recognition systems. However, the structure and operation of the human vocal tract suggests this assumption is unrealistic. A novel method is proposed to incorporate correlation into sub-band speech feature streams. In this method, all possible combinations of sub-bands are created and each combination is treated as a single frequency band by calculating a single feature vector for it. The resulting feature vectors capture information about every band in the combination as well as the dependency across the bands. Experiments conducted on the TIDigits database demonstrate significantly improved robustness in comparison to an independent sub-band system in the presence of both stationary and non-stationary noise.

引用

页码：1017 / 1020

页数：4

共 50 条

[41] A speech emphasis method for noise-robust speech recognition by using repetitive phrase
Hirai, Takanori
Kuroiwa, Shingo
Tsuge, Satoru
Ren, Fuji
Fattah, Mohamed Abdel
[J]. 2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1269 - +
[42] Wavelet based robust sub-band features for phoneme recognition
Farooq, O
Datta, S
[J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (03): : 187 - 193
[43] Noise-Robust Voice Conversion Using High-Quefrency Boosting via Sub-Band Cepstrum Conversion and Fusion
Miao, Xiaokong
Sun, Meng
Zhang, Xiongwei
Wang, Yimin
[J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
[44] Speech rate estimation via temporal correlation and selected sub-band correlation
Narayanan, S
Wang, D
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 413 - 416
[45] Sub-band Feature Statistics Compensation Techniques Based on Discrete Wavelet Transform for Robust Speech Recognition
Fan, Hao-Teng
Hung, Jeih-weih
[J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 586 - 589
[46] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
Sara Ahmadi
Seyed Mohammad Ahadi
Bert Cranen
Lou Boves
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2014
[47] Improved model parameter compensation methods for noise-robust speech recognition
Chang, YH
Chung, YJ
Park, SU
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 561 - 564
[48] Novel frequency masking curves for noise-robust automatic speech recognition
Chen, Chia-Ping
Yeh, Ja-Zang
Wu, Bo-Feng
[J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2013, 36 (06) : 696 - 703
[49] A Noise-Robust Speech Recognition System Based on Wavelet Neural Network
Wang, Yiping
Zhao, Zhefeng
[J]. ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT III, 2011, 7004 : 392 - 397
[50] Noise-robust speech recognition by discriminative adaptation in parallel model combination
Chung, YJ
[J]. ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371

← 1 2 3 4 5 →