Modeling sub-band correlation for noise-robust speech recognition

被引:0
|
作者
McAuley, J [1 ]
Ming, J [1 ]
Hanna, P [1 ]
Stewart, D [1 ]
机构
[1] Queens Univ Belfast, Sch Comp Sci, Belfast BT7 1NN, Antrim, North Ireland
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the effect of modeling sub-band correlation for noisy speech recognition. Sub-band data streams are assumed to be independent in many sub-band based speech recognition systems. However, the structure and operation of the human vocal tract suggests this assumption is unrealistic. A novel method is proposed to incorporate correlation into sub-band speech feature streams. In this method, all possible combinations of sub-bands are created and each combination is treated as a single frequency band by calculating a single feature vector for it. The resulting feature vectors capture information about every band in the combination as well as the dependency across the bands. Experiments conducted on the TIDigits database demonstrate significantly improved robustness in comparison to an independent sub-band system in the presence of both stationary and non-stationary noise.
引用
收藏
页码:1017 / 1020
页数:4
相关论文
共 50 条
  • [41] A speech emphasis method for noise-robust speech recognition by using repetitive phrase
    Hirai, Takanori
    Kuroiwa, Shingo
    Tsuge, Satoru
    Ren, Fuji
    Fattah, Mohamed Abdel
    [J]. 2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1269 - +
  • [42] Wavelet based robust sub-band features for phoneme recognition
    Farooq, O
    Datta, S
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (03): : 187 - 193
  • [43] Noise-Robust Voice Conversion Using High-Quefrency Boosting via Sub-Band Cepstrum Conversion and Fusion
    Miao, Xiaokong
    Sun, Meng
    Zhang, Xiongwei
    Wang, Yimin
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [44] Speech rate estimation via temporal correlation and selected sub-band correlation
    Narayanan, S
    Wang, D
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 413 - 416
  • [45] Sub-band Feature Statistics Compensation Techniques Based on Discrete Wavelet Transform for Robust Speech Recognition
    Fan, Hao-Teng
    Hung, Jeih-weih
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 586 - 589
  • [46] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
    Sara Ahmadi
    Seyed Mohammad Ahadi
    Bert Cranen
    Lou Boves
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [47] Improved model parameter compensation methods for noise-robust speech recognition
    Chang, YH
    Chung, YJ
    Park, SU
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 561 - 564
  • [48] Novel frequency masking curves for noise-robust automatic speech recognition
    Chen, Chia-Ping
    Yeh, Ja-Zang
    Wu, Bo-Feng
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2013, 36 (06) : 696 - 703
  • [49] A Noise-Robust Speech Recognition System Based on Wavelet Neural Network
    Wang, Yiping
    Zhao, Zhefeng
    [J]. ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT III, 2011, 7004 : 392 - 397
  • [50] Noise-robust speech recognition by discriminative adaptation in parallel model combination
    Chung, YJ
    [J]. ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371