Adding Noise to Improve Noise Robustness in Speech Recognition

被引：0

作者：

Morales, Nicolas ^{[1
]}

Gu, Liang ^{[2
]}

Gao, Yuqing ^{[2
]}

机构：

[1] Univ Autonoma Madrid, HCTLab, Madrid, Spain

[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

acoustic noise; robustness; speech enhancement; speech recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this work we explore a technique for increasing recognition accuracy on speech affected by corrupting noise of an undetermined nature, by the addition of a known and well-behaved noise (masking noise). The same type of noise used for masking is added to the training data, thus reducing the gap between training and test conditions, independent of the type of corrupting noise, or whether it is stationary or not. While still in an early development stage, the new approach shows consistent improvements in accuracy and robustness for a variety of conditions, where no use is made of a-priori knowledge of the corrupting noise. The approach is shown to be of particular interest to the case of cross-talk corrupting noise, a complicated situation in speech recognition for which the relative gain with the proposed approach is over 24%.

引用

页码：861 / +

页数：2

共 50 条

[1] Toward noise robustness speech recognition
Namarvar, HH
Liaw, J
Berger, TW
[J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4016 - 4016
[2] ADDING CONTROLLED AMOUNT OF NOISE TO IMPROVE RECOGNITION OF COMPRESSED AND SPECTRALLY DISTORTED SPEECH
Nouza, Jan
Cerva, Petr
Silovsky, Jan
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8046 - 8050
[3] Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition
Eickhoff, Patrick
Moeller, Matthias
Rosin, Theresa Pekarek
Twiefel, Johannes
Wermter, Stefan
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 376 - 388
[4] Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data
Pervaiz, Ayesha
Hussain, Fawad
Israr, Huma
Tahir, Muhammad Ali
Raja, Fawad Riasat
Baloch, Naveed Khan
Ishmanov, Farruh
Zikria, Yousaf Bin
[J]. SENSORS, 2020, 20 (08)
[5] Noise Robustness of Tract Variables and their Application to Speech Recognition
Mitra, Vikramjit
Nam, Hosung
Espy-Wilson, Carol
Saltzman, Elliot
Goldstein, Louis
[J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2735 - +
[6] Improving Noise Robustness of Speech Emotion Recognition System
Juszkiewicz, Lukasz
[J]. INTELLIGENT DISTRIBUTED COMPUTING VII, 2014, 511 : 223 - 232
[7] Noise and speaker robustness in a Persian continuous speech recognition system
Veisi, Hadi
Sameti, Hossein
[J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 73 - 76
[8] Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
Hansen, JHL
[J]. SPEECH COMMUNICATION, 1996, 20 (1-2) : 151 - 173
[9] Adding noise to improve measurement
Andò, B
Graziani, S
[J]. IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2001, 4 (01) : 24 - 31
[10] A Curriculum Learning Method for Improved Noise Robustness in Automatic Speech Recognition
Braun, Stefan
Neil, Daniel
Liu, Shih-Chii
[J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 548 - 552

← 1 2 3 4 5 →