Soft Decision Based Gaussian-Laplacian Combination Model for Noisy Speech Enhancement

被引:0
|
作者
OU Shifeng [1 ]
SONG Peng [2 ]
GAO Ying [1 ]
机构
[1] School of Science and Technology for Opto-electronic Information, Yantai University
[2] School of Computer and Control Engineering, Yantai University
基金
中国国家自然科学基金;
关键词
Speech enhancement; Soft decision; Speech distortion; Combination model;
D O I
暂无
中图分类号
TN912.3 [语音信号处理];
学科分类号
0711 ;
摘要
One of the key issues of noisy speech enhancement technique is to achieve appropriate statistical distributions to model the clean speech and noise signals accurately. Most of the existing algorithms try to employ a sole model assumption in transform domain, which, however, has been proven to being contrary with the fact. To address this problem, the statistical properties of clean speech as well as several noise signals are analyzed using actual data in Discrete cosine transform(DCT) domain, and the study indicates the statistic of clean speech DCT coefficients tending to fall somewhere in between the Gaussian and Laplacian distribution. Based on the results,a novel speech enhancement algorithm is proposed using Gaussian-Laplacian combination model, whose core is employing a linear combination of Gaussian and Laplacian distribution to model the statistic of clean speech DCT coefficients. The corresponding weights of either distribution to the combination model are adaptively adjusted in terms of the probability of each hypothesis, which is estimated based on a soft decision technique by using Bayesian theorem. Through a number of objective and subjective tests,we compare the performance of the proposed algorithm with other recent model based approaches and have found that our algorithm is superior to the related approaches at all testing environments.
引用
收藏
页码:827 / 834
页数:8
相关论文
共 50 条
  • [11] Noise robust face hallucination employing Gaussian-Laplacian mixture model
    Wang, Zhong-Yuan
    Han, Zhen
    Hu, Rui-Min
    Jiang, Jun-Jun
    [J]. NEUROCOMPUTING, 2014, 133 : 153 - 160
  • [12] New UWB receiver designs based on a Gaussian-Laplacian noise-plus-MAI model
    Beaulieu, N. C.
    Niranjayan, S.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-14, 2007, : 4128 - 4133
  • [13] Data Augmentation for Histopathological Images Based on Gaussian-Laplacian Pyramid Blending
    Mpinda Ataky, Steve Tsham
    de Matos, Jonathan
    Britto Jr, Alceu de S.
    Oliveira, Luiz E. S.
    Koerich, Alessandro L.
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [14] Improved Laplacian Factor Estimation for Noisy Speech Enhancement
    Ou, Shifeng
    Zhao, Xiaohui
    Gao, Ying
    [J]. 2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 2911 - +
  • [15] Employing Laplacian-Gaussian densities for speech enhancement
    Gazor, S
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 297 - 300
  • [16] Speech enhancement employing Laplacian-Gaussian mixture
    Gazor, S
    Zhang, W
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 896 - 904
  • [17] Complex laplacian probability density function for noisy speech enhancement
    Chang, Joon-Hyuk
    [J]. IEICE ELECTRONICS EXPRESS, 2007, 4 (08): : 245 - 250
  • [18] Noisy speech recognition based on speech enhancement
    Wang, Xia
    Tang, Hongmei
    Zhao, Xiaoqun
    [J]. SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 713 - +
  • [19] Speech enhancement based on Laplacian-Gaussian model and simplified phase discrimination in Discrete Cosine Transform domain
    School of Electronics and Information, Suzhou University, Suzhou 215021, China
    不详
    [J]. Shengxue Xuebao, 2008, 3 (244-251):
  • [20] Model-based feature enhancement for noisy speech recognition
    Couvreur, C
    Van hamme, H
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1719 - 1722