Covariance Modelling for Noise-Robust Speech Recognition

被引：0

作者：

van Dalen, R. C. ^{[1
]}

Gales, M. J. F. ^{[1
]}

机构：

[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

关键词：

Noise robust speech recognition; vector Taylor series; joint uncertainty decoding;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Model compensation is a standard way of improving speech recognisers' robustness to noise. Most model compensation techniques produce diagonal covariances. However, this fails to handle changes in the feature correlations due to the noise. This paper presents a scheme that allows full covariance matrices to be estimated. One problem is that full covariance matrix estimation will be more sensitive to approximations. like those for dynamic parameters which are known to be crude. In this paper a linear transformation of a window of consecutive frames is used as the basis for dynamic parameter compensation. A second problem is that the resulting full covariance matrices slow down decoding. This is addressed by using predictive linear transforms that decorrelate the feature space, so that the decoder can then use diagonal covariance matrices. On a noise-corrupted Resource Management task, the proposed scheme outperformed the standard VTS compensation scheme.

引用

页码：2000 / 2003

页数：4

共 50 条

[21] Noise-robust speech triage
Bartos, Anthony L.
Cipr, Tomas
Nelson, Douglas J.
Schwarz, Petr
Banowetz, John
Jerabek, Ladislav
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (04): : 2313 - 2320
[22] Noise-Robust Algorithm of Speech Features Extraction for Automatic Speech Recognition System
Yakhnev, A. N.
Pisarev, A. S.
[J]. PROCEEDINGS OF THE XIX IEEE INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM 2016), 2016, : 206 - 208
[23] A speech emphasis method for noise-robust speech recognition by using repetitive phrase
Hirai, Takanori
Kuroiwa, Shingo
Tsuge, Satoru
Ren, Fuji
Fattah, Mohamed Abdel
[J]. 2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1269 - +
[24] MODELLING SPECTRO-TEMPORAL DYNAMICS IN FACTORISATION-BASED NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
Hurmalainen, Antti
Virtanen, Tuomas
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4113 - 4116
[25] Sparse coding of the modulation spectrum for noise-robust automatic speech recognition
Sara Ahmadi
Seyed Mohammad Ahadi
Bert Cranen
Lou Boves
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2014
[26] Improved model parameter compensation methods for noise-robust speech recognition
Chang, YH
Chung, YJ
Park, SU
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 561 - 564
[27] GAUSSIAN POWER FLOW ORIENTATION COEFFICIENTS FOR NOISE-ROBUST SPEECH RECOGNITION
Gerazov, Branislav
Ivanovski, Zoran
[J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1467 - 1471
[28] Probabilistic vector mapping with trajectory information for noise-robust speech recognition
Kim, DY
Un, CK
[J]. ELECTRONICS LETTERS, 1996, 32 (17) : 1550 - 1551
[29] Modeling sub-band correlation for noise-robust speech recognition
McAuley, J
Ming, J
Hanna, P
Stewart, D
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 1017 - 1020
[30] Novel frequency masking curves for noise-robust automatic speech recognition
Chen, Chia-Ping
Yeh, Ja-Zang
Wu, Bo-Feng
[J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2013, 36 (06) : 696 - 703

← 1 2 3 4 5 →