Speech enhancement for non-stationary noise environments

被引：467

作者：

Cohen, I ^{[1
]}

Berdugo, B ^{[1
]}

机构：

[1] Lamar Signal Proc Ltd, IL-20692 Yokneam Ilit, Israel

来源：

SIGNAL PROCESSING | 2001年 / 81卷 / 11期

关键词：

D O I：

10.1016/S0165-1684(01)00128-1

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we present an optimally-modified log-spectral amplitude (OM-LSA) speech estimator and a minima controlled recursive averaging (MCRA) noise estimation approach for robust speech enhancement. The spectral gain function, which minimizes the mean-square error of the log-spectra, is obtained as a weighted geometric mean of the hypothetical gains associated with the speech presence uncertainty. The noise estimate is given by averaging past spectral power values, using a smoothing parameter that is adjusted by the speech presence probability in subbands. We introduce two distinct speech presence probability functions, one for estimating the speech and one for controlling the adaptation of the noise spectrum. The former is based on the time-frequency distribution of the a priori signal-to-noise ratio. The latter is determined by the ratio between the local energy of the noisy signal and its minimum within a specified time window. Objective and subjective evaluation under various environmental conditions confirm the superiority of the OM-LSA and MCRA estimators. Excellent noise suppression is achieved, while retaining weak speech components and avoiding the musical residual noise phenomena. (C) 2001 Elsevier Science B.V. All rights reserved.

引用

页码：2403 / 2418

页数：16

共 50 条

[1] Single Channel Speech Enhancement for Mixed Non-stationary Noise Environments
Singh, Sachin
Tripathy, Manoj
Anand, R. S.
[J]. ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 545 - 555
[2] Sparse Hidden Markov Models for Speech Enhancement in Non-Stationary Noise Environments
Deng, Feng
Bao, Changchun
Kleijn, W. Bastiaan
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1973 - 1987
[3] Robust Speech Enhancement Techniques for ASR in Non-stationary Noise and Dynamic Environments
Liu, Gang
Dimitriadis, Dimitrios
Bocchieri, Enrico
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3016 - 3020
[4] SPARSE HMM-BASED SPEECH ENHANCEMENT METHOD FOR STATIONARY AND NON-STATIONARY NOISE ENVIRONMENTS
Deng, Feng
Bao, Chang-chun
Kleijn, W. Bastiaan
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5073 - 5077
[5] Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments
Malah, D
Cox, RV
Accardi, AJ
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 789 - 792
[6] Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments
Duan, Zhiyao
Mysore, Gautham J.
Smaragdis, Paris
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 594 - 597
[7] A Novel Expectation-Maximization Framework for Speech Enhancement in Non-Stationary Noise Environments
Lun, Daniel P. K.
Shen, Tak-Wai
Ho, K. C.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 335 - 346
[8] Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech
Norholm, Sidsel Marie
Jensen, Jesper Rindom
Christensen, Mads Grsboll
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (04) : 645 - 658
[9] Speech Enhancement in Non-Stationary Noise Using Compressive Sensing
Sulong, Amart
Gunawan, Teddy Surya
Khalifa, Othman O.
Kartiwi, Mira
[J]. PROCEEDINGS OF 6TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING (ICCCE 2016), 2016, : 489 - 493
[10] Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement
Mai, Van-Khanh
Pastor, Dominique
Aissa-El-Bey, Abdeldjalil
Le-Bidan, Raphael
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (04) : 670 - 682

← 1 2 3 4 5 →