SPARSE IMPUTATION FOR NOISE ROBUST SPEECH RECOGNITION USING SOFT MASKS

被引:4
|
作者
Gemmeke, J. F. [1 ]
Cranen, B. [1 ]
机构
[1] Radboud Univ Nijmegen, Dept Linguist, NL-6500 HD Nijmegen, Netherlands
关键词
Speech recognition; Robustness; Redundancy;
D O I
10.1109/ICASSP.2009.4960666
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In previous work we introduced a new missing data imputation method for ASR, dubbed sparse imputation. We showed that the method is capable of maintaining good recognition accuracies even at very low SNRs provided the number of mask estimation errors is sufficiently low. Especially at low SNRs, however, mask estimation is difficult and errors are unavoidable. In this paper, we try to reduce the impact of mask estimation errors by making soft decisions, i.e., estimating the probability that a feature is reliable. Using an isolated digit recognition task (using the ALTRORA-2 database), we demonstrate that using soft masks in our sparse imputation approach yields a substantial increase in recognition accuracy, most notably at low SNRs.
引用
收藏
页码:4645 / 4648
页数:4
相关论文
共 50 条
  • [1] Feature Reconstruction using Sparse Imputation for Noise Robust Audio-Visual Speech Recognition
    Shen, Peng
    Tamura, Satoshi
    Hayamizu, Satoru
    [J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [2] Model-based clustered sparse imputation for noise robust speech recognition
    Goodarzi, Mohammad Mohsen
    Almasganj, Farshad
    [J]. SPEECH COMMUNICATION, 2016, 76 : 218 - 229
  • [3] Compressive Sensing for Missing Data Imputation in Noise Robust Speech Recognition
    Gemmeke, Jort Florent
    Van Hamme, Hugo
    Cranen, Bert
    Boves, Lou
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (02) : 272 - 287
  • [4] Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End
    Tan, Qun Feng
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2418 - 2429
  • [5] ROBUST ISOLATED SPEECH RECOGNITION USING BINARY MASKS
    Karadogan, Seliz Gulsen
    Larsen, Jan
    Pedersen, Michael Syskind
    Boldt, Jesper Bunsow
    [J]. 18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1988 - 1992
  • [6] Sparse imputation for large vocabulary noise robust ASR
    Gemmeke, Jort Florent
    Cranen, Bert
    Remes, Ulpu
    [J]. COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02): : 462 - 479
  • [7] Robust speech recognition from binary masks
    Narayanan, Arun
    Wang, DeLiang
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (05): : EL217 - EL222
  • [8] ROBUST SPEECH RECOGNITION FROM RATIO MASKS
    Wang, Zhong-Qiu
    Wang, DeLiang
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5720 - 5724
  • [9] Noise Robust Exemplar Matching Using Sparse Representations of Speech
    Yilmaz, Emre
    Gemmeke, Jort Florent
    Van Hamme, Hugo
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (08) : 1306 - 1319
  • [10] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
    Sara Ahmadi
    Seyed Mohammad Ahadi
    Bert Cranen
    Lou Boves
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2014