A perceptual masking approach for noise robust speech recognition

被引:8
|
作者
Maganti, Hari Krishna [1 ]
Matassoni, Marco [1 ]
机构
[1] Fdn Bruno Kessler CIT Irst, I-38123 Trento, Italy
关键词
ENHANCEMENT;
D O I
10.1186/1687-4722-2012-29
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article describes a modified technique for enhancing noisy speech to improve automatic speech recognition (ASR) performance. The proposed approach improves the widely used spectral subtraction which inherently suffers from the associated musical noise effects. Through a psychoacoustic masking and critical band variance normalization technique, the artifacts produced by spectral subtraction are minimized for improving the ASR accuracy. The popular advanced ETSI-2 front end is tested for comparison purposes. The performed speech recognition evaluations on the noisy standard AURORA-2 tasks show enhanced performance for all noise conditions.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A perceptual masking approach for noise robust speech recognition
    Hari Krishna Maganti
    Marco Matassoni
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [2] HISTOGRAM EQUALIZATION AND NOISE MASKING FOR ROBUST SPEECH RECOGNITION
    Zhang, Xueru
    Demuynck, Kris
    Van Hamme, Hugo
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4578 - 4581
  • [3] An engineering model of the masking for the noise-robust speech recognition
    Park, KY
    Lee, SY
    [J]. NEUROCOMPUTING, 2003, 52-4 : 615 - 620
  • [4] Psychoacoustic masking effect for noise robust speech recognition robot
    Miyanaga, Yoshikazu
    [J]. ISSCS 2019 - International Symposium on Signals, Circuits and Systems, 2019,
  • [5] Psychoacoustic Masking Effect for Noise Robust Speech Recognition Robot
    Miyanaga, Yoshikazu
    [J]. 2019 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS 2019), 2019,
  • [6] The perceptual wavelet feature for noise robust Vietnamese speech recognition
    Trung, Nguyen Quoc
    Nghia, Phung Trung
    [J]. 2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 255 - +
  • [7] A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks
    Li, Bo
    Sim, Khe Chai
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (08) : 1296 - 1305
  • [8] Novel frequency masking curves for noise-robust automatic speech recognition
    Chen, Chia-Ping
    Yeh, Ja-Zang
    Wu, Bo-Feng
    [J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2013, 36 (06) : 696 - 703
  • [9] Vector Taylor Series Expansion with Auditory Masking for Noise Robust Speech Recognition
    Das, Biswajit
    Panda, Ashish
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [10] Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition
    Gonzalez, Jose A.
    Gomez, Angel M.
    Peinado, Antonio M.
    Ma, Ning
    Barker, Jon
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (09) : 3731 - 3760