Single channel speech enhancement using iterative constrained NMF based adaptive wiener gain

被引:0
|
作者
Sivaramakrishna Yechuri
Sunnydayal Vanambathina
机构
[1] VIT-AP University,SENSE
来源
关键词
NMF; Adaptive wiener gain; Inverse nakagami; Erlang; Inverse gamma; Students-t probability density functions; SDR; PESQ; STOI;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a novel single channel speech enhancement algorithm using iterative constrained Non-negative matrix factorization (NMF) based adaptive Wiener gain for non-stationary noise. In the recent past, NMF-based Wiener filtering methods were used for speech enhancement. The Wiener filter performance depends on the adaptive gain factor value. The adaptive gain factor (α\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha $$\end{document}) value is constant regardless of noise type and signal to noise ratio (SNR), so it will affect speech enhancement performance. To overcome this, the adaptive factor value is calculated using a genetic algorithm (GA). Here, the GA adjusts the adaptive Wiener gain based on noise type and SNR level. The GA-based adaptive Wiener gain minimizes Wiener filter estimation errors and improves speech quality by adjusting the base vector weights of noise and speech. Additionally, we use the iterative constraints NMF (IC-NMF) method for calculating the priors from noisy speech magnitudes. We select the Erlang, Inverse Gamma, Students-t, and Inverse Nakagami distributions for speech priors and Gaussian distributions for noise priors. Noise and speech samples are well correlated with those distributions. This provides accurate estimation of the necessary statistics of these distributions to regularize the NMF criterion. So, we combine an iterative constrained NMF and a genetic algorithm-based adaptive Wiener filtering method for speech enhancement. The proposed method outperforms other benchmark algorithms in terms of source to distortion ratio (SDR), short-time objective intelligibility (STOI), and perceptual evaluation of speech quality (PESQ).
引用
收藏
页码:26233 / 26254
页数:21
相关论文
共 50 条
  • [41] Speech enhancement for personal communication using an adaptive gain equalizer
    Westerlund, N
    Dahl, M
    Claesson, I
    SIGNAL PROCESSING, 2005, 85 (06) : 1089 - 1101
  • [42] A spectral conversion approach to the iterative wiener filter for speech enhancement
    Mouchtaris, A
    Van der Spiegel, J
    Mueller, P
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1971 - 1974
  • [43] A Two-step NMF Based Algorithm for Single Channel Speech Separation
    Wang, Shuo
    Wu, Wenjun
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1987 - 1990
  • [44] DNN TRAINING BASED ON CLASSIC GAIN FUNCTION FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 910 - 914
  • [45] A NOVEL SINGLE CHANNEL SPEECH ENHANCEMENT APPROACH BY COMBINING WIENER FILTER AND DICTIONARY LEARNING
    Tseng, Hung-Wei
    Vishnubhotla, Srikanth
    Hong, Mingyi
    Xiao, Jinjun
    Luo, Zhi-Quan
    Zhang, Tao
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8653 - 8657
  • [46] Enhancement of single channel speech quality and intelligibility in multiple noise conditions using wiener filter and deep CNN
    Hepsiba, D.
    Justin, Judith
    SOFT COMPUTING, 2022, 26 (23) : 13037 - 13047
  • [47] Enhancement of single channel speech quality and intelligibility in multiple noise conditions using wiener filter and deep CNN
    D. Hepsiba
    Judith Justin
    Soft Computing, 2022, 26 : 13037 - 13047
  • [48] Speech Enhancement Using A Critical Point Based Wiener Filter
    Lu, Meihui
    Zhou, Xuan
    Jaber, Nabih
    Hua, Kun
    Ali, Mahdi
    2017 ADVANCES IN WIRELESS AND OPTICAL COMMUNICATIONS (RTUWO), 2017, : 175 - 179
  • [49] An iterative posterior NMF method for speech enhancement in the presence of additive Gaussian noise
    Sunnydayal
    Kumar, Kishore
    Cruces, Sergio
    NEUROCOMPUTING, 2017, 230 : 312 - 315
  • [50] Single Channel Speech Enhancement by Frequency Domain Constrained Optimization and Temporal Masking
    Jin, Wen
    Scordilis, Michael
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1411 - 1414