SPARSE IMPUTATION FOR NOISE ROBUST SPEECH RECOGNITION USING SOFT MASKS

被引：4

作者：

Gemmeke, J. F. ^{[1
]}

Cranen, B. ^{[1
]}

机构：

[1] Radboud Univ Nijmegen, Dept Linguist, NL-6500 HD Nijmegen, Netherlands

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年

关键词：

Speech recognition; Robustness; Redundancy;

D O I：

10.1109/ICASSP.2009.4960666

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In previous work we introduced a new missing data imputation method for ASR, dubbed sparse imputation. We showed that the method is capable of maintaining good recognition accuracies even at very low SNRs provided the number of mask estimation errors is sufficiently low. Especially at low SNRs, however, mask estimation is difficult and errors are unavoidable. In this paper, we try to reduce the impact of mask estimation errors by making soft decisions, i.e., estimating the probability that a feature is reliable. Using an isolated digit recognition task (using the ALTRORA-2 database), we demonstrate that using soft masks in our sparse imputation approach yields a substantial increase in recognition accuracy, most notably at low SNRs.

引用

页码：4645 / 4648

页数：4

共 50 条

[31] Robust Speech Recognition Using Noise-Cluster HMM Interpolation
Thatphithakkul, Nattanun
Kruatrachue, Boontee
Wutiwiwatchi, Chai
Marukatat, Sanparith
Boonpiam, Vataya
ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 596 - +
[32] Robust speech recognition by using spectral subtraction with noise peak shifting
Dai, Peng
Soon, Ing Yann
IET SIGNAL PROCESSING, 2013, 7 (08) : 684 - 692
[33] ON NOISE ESTIMATION FOR ROBUST SPEECH RECOGNITION USING VECTOR TAYLOR SERIES
Zhao, Yong
Juang, Biing-Hwang
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4290 - 4293
[34] Noise robust speech recognition using subband-crosscorrelation analysis
Kajita, S
Takeda, K
Itakura, F
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1998, E81D (10) : 1079 - 1086
[35] Spectrum enhancement with sparse coding for robust speech recognition
He, Yongjun
Sun, Guanglu
Han, Jiqing
DIGITAL SIGNAL PROCESSING, 2015, 43 : 59 - 70
[36] Speaker and Noise Factorization for Robust Speech Recognition
Wang, Yongqiang
Gales, Mark J. F.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2149 - 2158
[37] Robust speech recognition for car environment noise
Kokubo, H
Amano, A
Hataoka, N
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
[38] Articulatory Information for Noise Robust Speech Recognition
Mitra, Vikramjit
Nam, Hosung
Espy-Wilson, Carol Y.
Saltzman, Elliot
Goldstein, Louis
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1913 - 1924
[39] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
CUNG, HM
NORMANDIN, Y
SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276
[40] Robust noise suppression methods in speech recognition
Cui, Yi
Zhang, Dong
Shi, Liangping
Chen, Liyuan
Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14

← 1 2 3 4 5 →