Speech enhancement based on speech spectral complex Gaussian Mixture Model

被引:0
|
作者
Ding, GH [1 ]
Wang, X [1 ]
Cao, Y [1 ]
Ding, F [1 ]
Tang, YZ [1 ]
机构
[1] Nokia Res Ctr, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a speech enhancement approach based on speech spectral complex Gaussian Mixture Model (GMM). First, a construction algorithm of speech spectral GMM is introduced and it is based on the distance measure of speech spectral Gaussian probability. Then a noise estimation algorithm based on the GMM is proposed in the Maximum Likelihood criterion using the Expectation-Maximum (EM) algorithm. Speech enhancement experimental results show that the GMM-based MMSE estimators, especially the GMM-based MMSE short-time spectral estimator, can afford better performance than alternative speech enhancement algorithms and the proposed noise estimation algorithm can improve the enhancement performance more, especially at low SNRs.
引用
收藏
页码:165 / 168
页数:4
相关论文
共 50 条
  • [31] BROAD PHONEME CLASS BASED SPEECH ENHANCEMENT USING MIXTURE MAXIMUM MODEL
    Das, Amit
    Hansen, John H. L.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4762 - 4765
  • [32] Speech Enhancement Based on Deep Mixture of Distinguishing Experts
    Jia, Xupeng
    Li, Dongmei
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 684 - 689
  • [33] SPEECH ENHANCEMENT USING A JOINT MAP ESTIMATOR WITH GAUSSIAN MIXTURE MODEL FOR (NON-)STATIONARY NOISE
    Fodor, Balazs
    Fingscheidt, Tim
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4768 - 4771
  • [34] Spectral-domain speech enhancement for speech recognition
    You, Chang Huai
    Ma, Bin
    SPEECH COMMUNICATION, 2017, 94 : 30 - 41
  • [35] Enhancement of spectral contrast to speech using a sinusoidal model
    Aguilera, CM
    Navas, A
    Tejero, JC
    Gago, A
    ELECTRONICS LETTERS, 1999, 35 (23) : 1997 - 1998
  • [36] Gaussian mixture model based mutual information estimation between frequency bands in speech
    Nilsson, M
    Gustafsson, H
    Andersen, SV
    Kleijn, WB
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 525 - 528
  • [37] A study of speech recognition system based on the Hidden Markov Model with Gaussian-Mixture
    Ben Hazem, Zied
    Zouhir, Youssef
    Ouni, Kais
    2014 INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2014,
  • [38] Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement
    Ou Shifeng
    Song Peng
    Gao Ying
    CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (06) : 1214 - 1220
  • [39] Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement
    OU Shifeng
    SONG Peng
    GAO Ying
    Chinese Journal of Electronics, 2018, 27 (06) : 1214 - 1220
  • [40] SPEECH ENHANCEMENT BASED ON A SINUSOIDAL MODEL
    KATES, JM
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1994, 37 (02): : 449 - 464