Adding Noise to Improve Noise Robustness in Speech Recognition

被引:0
|
作者
Morales, Nicolas [1 ]
Gu, Liang [2 ]
Gao, Yuqing [2 ]
机构
[1] Univ Autonoma Madrid, HCTLab, Madrid, Spain
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
关键词
acoustic noise; robustness; speech enhancement; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we explore a technique for increasing recognition accuracy on speech affected by corrupting noise of an undetermined nature, by the addition of a known and well-behaved noise (masking noise). The same type of noise used for masking is added to the training data, thus reducing the gap between training and test conditions, independent of the type of corrupting noise, or whether it is stationary or not. While still in an early development stage, the new approach shows consistent improvements in accuracy and robustness for a variety of conditions, where no use is made of a-priori knowledge of the corrupting noise. The approach is shown to be of particular interest to the case of cross-talk corrupting noise, a complicated situation in speech recognition for which the relative gain with the proposed approach is over 24%.
引用
收藏
页码:861 / +
页数:2
相关论文
共 50 条
  • [1] Toward noise robustness speech recognition
    Namarvar, HH
    Liaw, J
    Berger, TW
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4016 - 4016
  • [2] ADDING CONTROLLED AMOUNT OF NOISE TO IMPROVE RECOGNITION OF COMPRESSED AND SPECTRALLY DISTORTED SPEECH
    Nouza, Jan
    Cerva, Petr
    Silovsky, Jan
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8046 - 8050
  • [3] Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition
    Eickhoff, Patrick
    Moeller, Matthias
    Rosin, Theresa Pekarek
    Twiefel, Johannes
    Wermter, Stefan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 376 - 388
  • [4] Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data
    Pervaiz, Ayesha
    Hussain, Fawad
    Israr, Huma
    Tahir, Muhammad Ali
    Raja, Fawad Riasat
    Baloch, Naveed Khan
    Ishmanov, Farruh
    Zikria, Yousaf Bin
    [J]. SENSORS, 2020, 20 (08)
  • [5] Noise Robustness of Tract Variables and their Application to Speech Recognition
    Mitra, Vikramjit
    Nam, Hosung
    Espy-Wilson, Carol
    Saltzman, Elliot
    Goldstein, Louis
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2735 - +
  • [6] Improving Noise Robustness of Speech Emotion Recognition System
    Juszkiewicz, Lukasz
    [J]. INTELLIGENT DISTRIBUTED COMPUTING VII, 2014, 511 : 223 - 232
  • [7] Noise and speaker robustness in a Persian continuous speech recognition system
    Veisi, Hadi
    Sameti, Hossein
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 73 - 76
  • [8] Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
    Hansen, JHL
    [J]. SPEECH COMMUNICATION, 1996, 20 (1-2) : 151 - 173
  • [9] Adding noise to improve measurement
    Andò, B
    Graziani, S
    [J]. IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2001, 4 (01) : 24 - 31
  • [10] A Curriculum Learning Method for Improved Noise Robustness in Automatic Speech Recognition
    Braun, Stefan
    Neil, Daniel
    Liu, Shih-Chii
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 548 - 552