Single channel speech enhancement using iterative constrained NMF based adaptive wiener gain

被引：1

作者：

Yechuri, Sivaramakrishna ^{[1
]}

Vanambathina, Sunnydayal ^{[1
]}

机构：

[1] VIT AP Univ, SENSE, Amaravati, India

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2024年 / 83卷 / 09期

关键词：

NMF; Adaptive wiener gain; Inverse nakagami; Erlang; Inverse gamma; Students-t probability density functions; SDR; PESQ; STOI; NONNEGATIVE MATRIX FACTORIZATION; ALGORITHMS; EXTRACTION; MACHINE; FILTER;

D O I：

10.1007/s11042-023-16480-w

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a novel single channel speech enhancement algorithm using iterative constrained Non-negative matrix factorization (NMF) based adaptive Wiener gain for non-stationary noise. In the recent past, NMF-based Wiener filtering methods were used for speech enhancement. The Wiener filter performance depends on the adaptive gain factor value. The adaptive gain factor (alpha) value is constant regardless of noise type and signal to noise ratio (SNR), so it will affect speech enhancement performance. To overcome this, the adaptive factor value is calculated using a genetic algorithm (GA). Here, the GA adjusts the adaptive Wiener gain based on noise type and SNR level. The GA-based adaptive Wiener gain minimizes Wiener filter estimation errors and improves speech quality by adjusting the base vector weights of noise and speech. Additionally, we use the iterative constraints NMF (IC-NMF) method for calculating the priors from noisy speech magnitudes. We select the Erlang, Inverse Gamma, Students-t, and Inverse Nakagami distributions for speech priors and Gaussian distributions for noise priors. Noise and speech samples are well correlated with those distributions. This provides accurate estimation of the necessary statistics of these distributions to regularize the NMF criterion. So, we combine an iterative constrained NMF and a genetic algorithm-based adaptive Wiener filtering method for speech enhancement. The proposed method outperforms other benchmark algorithms in terms of source to distortion ratio (SDR), short-time objective intelligibility (STOI), and perceptual evaluation of speech quality (PESQ).

引用

页码：26233 / 26254

页数：22

共 50 条

[1] Single channel speech enhancement using iterative constrained NMF based adaptive wiener gain
Sivaramakrishna Yechuri
Sunnydayal Vanambathina
Multimedia Tools and Applications, 2024, 83 : 26233 - 26254
[2] Genetic Algorithm-Based Adaptive Wiener Gain for Speech Enhancement Using an Iterative Posterior NMF
Yechuri, Sivaramakrishna
Vanabathina, Sunny Dayal
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2023, 23 (06)
[3] Weibull and Nakagami speech priors based regularized NMF with adaptive wiener filter for speech enhancement
Jannu C.
Vanambathina S.D.
International Journal of Speech Technology, 2023, 26 (01) : 197 - 209
[4] Enhancement of Binaural Speech Using Codebook Constrained Iterative Binaural Wiener Filter
Cazi, Nadir
Sreenivas, T. V.
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1371 - 1374
[5] SPEECH ENHANCEMENT USING A FRAME ADAPTIVE GAIN FUNCTION FOR WIENER FILTERING
da Silva, Luiz Felipe
Bermudez, Jose C. M.
2011 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2011, : 389 - 392
[6] Glottal codebook constrained iterative Wiener filtering speech enhancement
Dai, Mingyang
Zhou, Yi
Xu, Boling
Shengxue Xuebao/Acta Acustica, 2003, 28 (01): : 21 - 27
[7] Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks
Bai, Zhigang
Bao, Changchun
Cui, Zihao
2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
[8] Single-channel speech enhancement method using reconstructive NMF with spectrotemporal speech presence probabilities
Lee, Seongjae
Han, David K.
Ko, Hanseok
APPLIED ACOUSTICS, 2017, 117 : 257 - 262
[9] SINGLE-CHANNEL ENHANCEMENT OF CONVOLUTIVE NOISY SPEECH BASED ON A DISCRIMINATIVE NMF ALGORITHM
Chung, Hanwook
Plourde, Eric
Champagne, Benoit
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2302 - 2306
[10] Single Channel Blind Source Separation Based on NMF and Its Application to Speech Enhancement
Chen, Yongqiang
2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1066 - 1069

← 1 2 3 4 5 →