Adding Noise to Improve Noise Robustness in Speech Recognition

被引：0

作者：

Morales, Nicolas ^{[1
]}

Gu, Liang ^{[2
]}

Gao, Yuqing ^{[2
]}

机构：

[1] Univ Autonoma Madrid, HCTLab, Madrid, Spain

[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

acoustic noise; robustness; speech enhancement; speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work we explore a technique for increasing recognition accuracy on speech affected by corrupting noise of an undetermined nature, by the addition of a known and well-behaved noise (masking noise). The same type of noise used for masking is added to the training data, thus reducing the gap between training and test conditions, independent of the type of corrupting noise, or whether it is stationary or not. While still in an early development stage, the new approach shows consistent improvements in accuracy and robustness for a variety of conditions, where no use is made of a-priori knowledge of the corrupting noise. The approach is shown to be of particular interest to the case of cross-talk corrupting noise, a complicated situation in speech recognition for which the relative gain with the proposed approach is over 24%.

引用

页码：861 / +

页数：2

共 50 条

[41] Enhancing adversarial robustness of quantum neural networks by adding noise layers
Huang, Chenyi
Zhang, Shibin
[J]. NEW JOURNAL OF PHYSICS, 2023, 25 (08):
[42] Articulatory Information for Noise Robust Speech Recognition
Mitra, Vikramjit
Nam, Hosung
Espy-Wilson, Carol Y.
Saltzman, Elliot
Goldstein, Louis
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1913 - 1924
[43] Speech Emotion Recognition under White Noise
Huang, Chengwei
Chen, Guoming
Yu, Hua
Bao, Yongqiang
Zhao, Li
[J]. ARCHIVES OF ACOUSTICS, 2013, 38 (04) : 457 - 463
[44] Benefits of amplification for speech recognition in background noise
Turner, CW
Henry, BA
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (04): : 1675 - 1680
[45] Speaker and Noise Factorization for Robust Speech Recognition
Wang, Yongqiang
Gales, Mark J. F.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2149 - 2158
[46] Robust speech recognition for car environment noise
Kokubo, H
Amano, A
Hataoka, N
[J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
[47] Toddlers' recognition of noise-vocoded speech
Newman, Rochelle
Chatterjee, Monita
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (01): : 483 - 494
[48] SPECTRAL ESTIMATION FOR NOISE ROBUST SPEECH RECOGNITION
ERELL, A
WEINTRAUB, M
[J]. SPEECH AND NATURAL LANGUAGE, 1989, : 319 - 324
[49] A PROCEDURE FOR QUANTIFYING THE EFFECTS OF NOISE ON SPEECH RECOGNITION
DIRKS, DD
MORGAN, DE
DUBNO, JR
[J]. JOURNAL OF SPEECH AND HEARING DISORDERS, 1982, 47 (02): : 114 - 123
[50] The Application of Speech Recognition System in Noise Environment
Niu, Gang
Ren, Xinzhi
Wu, Guoqing
[J]. PROCEEDINGS OF 2008 INTERNATIONAL PRE-OLYMPIC CONGRESS ON COMPUTER SCIENCE, VOL I: COMPUTER SCIENCE AND ENGINEERING, 2008, : 1 - 5

← 1 2 3 4 5 →