A perceptual masking approach for noise robust speech recognition

被引：8

作者：

Maganti, Hari Krishna ^{[1
]}

Matassoni, Marco ^{[1
]}

机构：

[1] Fdn Bruno Kessler CIT Irst, I-38123 Trento, Italy

来源：

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2012年

关键词：

ENHANCEMENT;

D O I：

10.1186/1687-4722-2012-29

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This article describes a modified technique for enhancing noisy speech to improve automatic speech recognition (ASR) performance. The proposed approach improves the widely used spectral subtraction which inherently suffers from the associated musical noise effects. Through a psychoacoustic masking and critical band variance normalization technique, the artifacts produced by spectral subtraction are minimized for improving the ASR accuracy. The popular advanced ETSI-2 front end is tested for comparison purposes. The performed speech recognition evaluations on the noisy standard AURORA-2 tasks show enhanced performance for all noise conditions.

引用

页数：9

共 50 条

[1] A perceptual masking approach for noise robust speech recognition
Hari Krishna Maganti
Marco Matassoni
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2012
[2] HISTOGRAM EQUALIZATION AND NOISE MASKING FOR ROBUST SPEECH RECOGNITION
Zhang, Xueru
Demuynck, Kris
Van Hamme, Hugo
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4578 - 4581
[3] An engineering model of the masking for the noise-robust speech recognition
Park, KY
Lee, SY
[J]. NEUROCOMPUTING, 2003, 52-4 : 615 - 620
[4] Psychoacoustic masking effect for noise robust speech recognition robot
Miyanaga, Yoshikazu
[J]. ISSCS 2019 - International Symposium on Signals, Circuits and Systems, 2019,
[5] Psychoacoustic Masking Effect for Noise Robust Speech Recognition Robot
Miyanaga, Yoshikazu
[J]. 2019 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS 2019), 2019,
[6] The perceptual wavelet feature for noise robust Vietnamese speech recognition
Trung, Nguyen Quoc
Nghia, Phung Trung
[J]. 2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 255 - +
[7] A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks
Li, Bo
Sim, Khe Chai
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (08) : 1296 - 1305
[8] Novel frequency masking curves for noise-robust automatic speech recognition
Chen, Chia-Ping
Yeh, Ja-Zang
Wu, Bo-Feng
[J]. JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2013, 36 (06) : 696 - 703
[9] Vector Taylor Series Expansion with Auditory Masking for Noise Robust Speech Recognition
Das, Biswajit
Panda, Ashish
[J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[10] Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition
Gonzalez, Jose A.
Gomez, Angel M.
Peinado, Antonio M.
Ma, Ning
Barker, Jon
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (09) : 3731 - 3760

← 1 2 3 4 5 →