SPARSE IMPUTATION FOR NOISE ROBUST SPEECH RECOGNITION USING SOFT MASKS

被引:4
|
作者
Gemmeke, J. F. [1 ]
Cranen, B. [1 ]
机构
[1] Radboud Univ Nijmegen, Dept Linguist, NL-6500 HD Nijmegen, Netherlands
关键词
Speech recognition; Robustness; Redundancy;
D O I
10.1109/ICASSP.2009.4960666
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In previous work we introduced a new missing data imputation method for ASR, dubbed sparse imputation. We showed that the method is capable of maintaining good recognition accuracies even at very low SNRs provided the number of mask estimation errors is sufficiently low. Especially at low SNRs, however, mask estimation is difficult and errors are unavoidable. In this paper, we try to reduce the impact of mask estimation errors by making soft decisions, i.e., estimating the probability that a feature is reliable. Using an isolated digit recognition task (using the ALTRORA-2 database), we demonstrate that using soft masks in our sparse imputation approach yields a substantial increase in recognition accuracy, most notably at low SNRs.
引用
收藏
页码:4645 / 4648
页数:4
相关论文
共 50 条
  • [31] Robust Speech Recognition Using Noise-Cluster HMM Interpolation
    Thatphithakkul, Nattanun
    Kruatrachue, Boontee
    Wutiwiwatchi, Chai
    Marukatat, Sanparith
    Boonpiam, Vataya
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 596 - +
  • [32] Robust speech recognition by using spectral subtraction with noise peak shifting
    Dai, Peng
    Soon, Ing Yann
    IET SIGNAL PROCESSING, 2013, 7 (08) : 684 - 692
  • [33] ON NOISE ESTIMATION FOR ROBUST SPEECH RECOGNITION USING VECTOR TAYLOR SERIES
    Zhao, Yong
    Juang, Biing-Hwang
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4290 - 4293
  • [34] Noise robust speech recognition using subband-crosscorrelation analysis
    Kajita, S
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (10) : 1079 - 1086
  • [35] Spectrum enhancement with sparse coding for robust speech recognition
    He, Yongjun
    Sun, Guanglu
    Han, Jiqing
    DIGITAL SIGNAL PROCESSING, 2015, 43 : 59 - 70
  • [36] Speaker and Noise Factorization for Robust Speech Recognition
    Wang, Yongqiang
    Gales, Mark J. F.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2149 - 2158
  • [37] Robust speech recognition for car environment noise
    Kokubo, H
    Amano, A
    Hataoka, N
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
  • [38] Articulatory Information for Noise Robust Speech Recognition
    Mitra, Vikramjit
    Nam, Hosung
    Espy-Wilson, Carol Y.
    Saltzman, Elliot
    Goldstein, Louis
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1913 - 1924
  • [39] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
    CUNG, HM
    NORMANDIN, Y
    SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276
  • [40] Robust noise suppression methods in speech recognition
    Cui, Yi
    Zhang, Dong
    Shi, Liangping
    Chen, Liyuan
    Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14