Adding Noise to Improve Noise Robustness in Speech Recognition

被引:0
|
作者
Morales, Nicolas [1 ]
Gu, Liang [2 ]
Gao, Yuqing [2 ]
机构
[1] Univ Autonoma Madrid, HCTLab, Madrid, Spain
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
关键词
acoustic noise; robustness; speech enhancement; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we explore a technique for increasing recognition accuracy on speech affected by corrupting noise of an undetermined nature, by the addition of a known and well-behaved noise (masking noise). The same type of noise used for masking is added to the training data, thus reducing the gap between training and test conditions, independent of the type of corrupting noise, or whether it is stationary or not. While still in an early development stage, the new approach shows consistent improvements in accuracy and robustness for a variety of conditions, where no use is made of a-priori knowledge of the corrupting noise. The approach is shown to be of particular interest to the case of cross-talk corrupting noise, a complicated situation in speech recognition for which the relative gain with the proposed approach is over 24%.
引用
收藏
页码:861 / +
页数:2
相关论文
共 50 条
  • [41] Enhancing adversarial robustness of quantum neural networks by adding noise layers
    Huang, Chenyi
    Zhang, Shibin
    [J]. NEW JOURNAL OF PHYSICS, 2023, 25 (08):
  • [42] Articulatory Information for Noise Robust Speech Recognition
    Mitra, Vikramjit
    Nam, Hosung
    Espy-Wilson, Carol Y.
    Saltzman, Elliot
    Goldstein, Louis
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1913 - 1924
  • [43] Speech Emotion Recognition under White Noise
    Huang, Chengwei
    Chen, Guoming
    Yu, Hua
    Bao, Yongqiang
    Zhao, Li
    [J]. ARCHIVES OF ACOUSTICS, 2013, 38 (04) : 457 - 463
  • [44] Benefits of amplification for speech recognition in background noise
    Turner, CW
    Henry, BA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (04): : 1675 - 1680
  • [45] Speaker and Noise Factorization for Robust Speech Recognition
    Wang, Yongqiang
    Gales, Mark J. F.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2149 - 2158
  • [46] Robust speech recognition for car environment noise
    Kokubo, H
    Amano, A
    Hataoka, N
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
  • [47] Toddlers' recognition of noise-vocoded speech
    Newman, Rochelle
    Chatterjee, Monita
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (01): : 483 - 494
  • [48] SPECTRAL ESTIMATION FOR NOISE ROBUST SPEECH RECOGNITION
    ERELL, A
    WEINTRAUB, M
    [J]. SPEECH AND NATURAL LANGUAGE, 1989, : 319 - 324
  • [49] A PROCEDURE FOR QUANTIFYING THE EFFECTS OF NOISE ON SPEECH RECOGNITION
    DIRKS, DD
    MORGAN, DE
    DUBNO, JR
    [J]. JOURNAL OF SPEECH AND HEARING DISORDERS, 1982, 47 (02): : 114 - 123
  • [50] The Application of Speech Recognition System in Noise Environment
    Niu, Gang
    Ren, Xinzhi
    Wu, Guoqing
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL PRE-OLYMPIC CONGRESS ON COMPUTER SCIENCE, VOL I: COMPUTER SCIENCE AND ENGINEERING, 2008, : 1 - 5