Modeling sub-band correlation for noise-robust speech recognition

被引:0
|
作者
McAuley, J [1 ]
Ming, J [1 ]
Hanna, P [1 ]
Stewart, D [1 ]
机构
[1] Queens Univ Belfast, Sch Comp Sci, Belfast BT7 1NN, Antrim, North Ireland
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the effect of modeling sub-band correlation for noisy speech recognition. Sub-band data streams are assumed to be independent in many sub-band based speech recognition systems. However, the structure and operation of the human vocal tract suggests this assumption is unrealistic. A novel method is proposed to incorporate correlation into sub-band speech feature streams. In this method, all possible combinations of sub-bands are created and each combination is treated as a single frequency band by calculating a single feature vector for it. The resulting feature vectors capture information about every band in the combination as well as the dependency across the bands. Experiments conducted on the TIDigits database demonstrate significantly improved robustness in comparison to an independent sub-band system in the presence of both stationary and non-stationary noise.
引用
收藏
页码:1017 / 1020
页数:4
相关论文
共 50 条
  • [1] Intra-frame cepstral sub-band weighting and histogram equalization for noise-robust speech recognition
    Hung J.-W.
    Fan H.-T.
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2013 (1)
  • [2] Sub-band weighted projection measure for sub-band speech recognition in noise
    Nasersharif, B.
    Akbari, A.
    [J]. ELECTRONICS LETTERS, 2006, 42 (14) : 829 - 831
  • [3] Noise Aware Sub-band Locality Preserving Projection for Robust Speech Recognition
    Karevan, Zahra
    Akbari, Ahmad
    Nasersharif, Babak
    [J]. ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP 2013, 2014, 427 : 203 - +
  • [4] Sub-band speech recognition
    Primor, D
    Furst-Yust, M
    [J]. 22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 10 - 12
  • [5] Sub-band level Histogram Equalization for Robust Speech Recognition
    Joshi, Vikas
    Bilgi, Raghavendra
    Umesh, S.
    Garcia, L.
    Benitez, C.
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1672 - +
  • [6] Maximum likelihood sub-band adaptation for robust speech recognition
    Zhu, DL
    Nakamura, S
    Paliwal, KK
    Wang, RH
    [J]. SPEECH COMMUNICATION, 2005, 47 (03) : 243 - 264
  • [7] Mel Sub-Band Filtering and Compression for Robust Speech Recognition
    Nasersharif, Babak
    Akbari, Ahmad
    Homayounpour, Mohammad Mehdi
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 105 - +
  • [8] Sub-band Modulation Spectrum Compensation for Robust Speech Recognition
    Tu, Wen-hsiang
    Huang, Sheng-Yuan
    Hung, Jeih-weih
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 261 - 265
  • [9] Modeling human auditory perception for noise-robust speech recognition
    Lee, SY
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : PL72 - PL74
  • [10] A probabilistic union model for sub-band based robust speech recognition
    Ming, J
    Smith, FJ
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1787 - 1790