SPARSE IMPUTATION FOR NOISE ROBUST SPEECH RECOGNITION USING SOFT MASKS

被引：4

作者：

Gemmeke, J. F. ^{[1
]}

Cranen, B. ^{[1
]}

机构：

[1] Radboud Univ Nijmegen, Dept Linguist, NL-6500 HD Nijmegen, Netherlands

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年

关键词：

Speech recognition; Robustness; Redundancy;

D O I：

10.1109/ICASSP.2009.4960666

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In previous work we introduced a new missing data imputation method for ASR, dubbed sparse imputation. We showed that the method is capable of maintaining good recognition accuracies even at very low SNRs provided the number of mask estimation errors is sufficiently low. Especially at low SNRs, however, mask estimation is difficult and errors are unavoidable. In this paper, we try to reduce the impact of mask estimation errors by making soft decisions, i.e., estimating the probability that a feature is reliable. Using an isolated digit recognition task (using the ALTRORA-2 database), we demonstrate that using soft masks in our sparse imputation approach yields a substantial increase in recognition accuracy, most notably at low SNRs.

引用

页码：4645 / 4648

页数：4

共 50 条

[1] Feature Reconstruction using Sparse Imputation for Noise Robust Audio-Visual Speech Recognition
Shen, Peng
Tamura, Satoshi
Hayamizu, Satoru
[J]. 2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
[2] Model-based clustered sparse imputation for noise robust speech recognition
Goodarzi, Mohammad Mohsen
Almasganj, Farshad
[J]. SPEECH COMMUNICATION, 2016, 76 : 218 - 229
[3] Compressive Sensing for Missing Data Imputation in Noise Robust Speech Recognition
Gemmeke, Jort Florent
Van Hamme, Hugo
Cranen, Bert
Boves, Lou
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2010, 4 (02) : 272 - 287
[4] Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End
Tan, Qun Feng
Georgiou, Panayiotis G.
Narayanan, Shrikanth
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2418 - 2429
[5] ROBUST ISOLATED SPEECH RECOGNITION USING BINARY MASKS
Karadogan, Seliz Gulsen
Larsen, Jan
Pedersen, Michael Syskind
Boldt, Jesper Bunsow
[J]. 18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1988 - 1992
[6] Sparse imputation for large vocabulary noise robust ASR
Gemmeke, Jort Florent
Cranen, Bert
Remes, Ulpu
[J]. COMPUTER SPEECH AND LANGUAGE, 2011, 25 (02): : 462 - 479
[7] Robust speech recognition from binary masks
Narayanan, Arun
Wang, DeLiang
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (05): : EL217 - EL222
[8] ROBUST SPEECH RECOGNITION FROM RATIO MASKS
Wang, Zhong-Qiu
Wang, DeLiang
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5720 - 5724
[9] Noise Robust Exemplar Matching Using Sparse Representations of Speech
Yilmaz, Emre
Gemmeke, Jort Florent
Van Hamme, Hugo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (08) : 1306 - 1319
[10] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
Sara Ahmadi
Seyed Mohammad Ahadi
Bert Cranen
Lou Boves
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2014

← 1 2 3 4 5 →