Speech enhancement based on speech spectral complex Gaussian Mixture Model

被引：0

作者：

Ding, GH ^{[1
]}

Wang, X ^{[1
]}

Cao, Y ^{[1
]}

Ding, F ^{[1
]}

Tang, YZ ^{[1
]}

机构：

[1] Nokia Res Ctr, Beijing, Peoples R China

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a speech enhancement approach based on speech spectral complex Gaussian Mixture Model (GMM). First, a construction algorithm of speech spectral GMM is introduced and it is based on the distance measure of speech spectral Gaussian probability. Then a noise estimation algorithm based on the GMM is proposed in the Maximum Likelihood criterion using the Expectation-Maximum (EM) algorithm. Speech enhancement experimental results show that the GMM-based MMSE estimators, especially the GMM-based MMSE short-time spectral estimator, can afford better performance than alternative speech enhancement algorithms and the proposed noise estimation algorithm can improve the enhancement performance more, especially at low SNRs.

引用

页码：165 / 168

页数：4

共 50 条

[31] BROAD PHONEME CLASS BASED SPEECH ENHANCEMENT USING MIXTURE MAXIMUM MODEL
Das, Amit
Hansen, John H. L.
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4762 - 4765
[32] Speech Enhancement Based on Deep Mixture of Distinguishing Experts
Jia, Xupeng
Li, Dongmei
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 684 - 689
[33] SPEECH ENHANCEMENT USING A JOINT MAP ESTIMATOR WITH GAUSSIAN MIXTURE MODEL FOR (NON-)STATIONARY NOISE
Fodor, Balazs
Fingscheidt, Tim
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4768 - 4771
[34] Spectral-domain speech enhancement for speech recognition
You, Chang Huai
Ma, Bin
SPEECH COMMUNICATION, 2017, 94 : 30 - 41
[35] Enhancement of spectral contrast to speech using a sinusoidal model
Aguilera, CM
Navas, A
Tejero, JC
Gago, A
ELECTRONICS LETTERS, 1999, 35 (23) : 1997 - 1998
[36] Gaussian mixture model based mutual information estimation between frequency bands in speech
Nilsson, M
Gustafsson, H
Andersen, SV
Kleijn, WB
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 525 - 528
[37] A study of speech recognition system based on the Hidden Markov Model with Gaussian-Mixture
Ben Hazem, Zied
Zouhir, Youssef
Ouni, Kais
2014 INTERNATIONAL CONFERENCE ON ELECTRICAL SCIENCES AND TECHNOLOGIES IN MAGHREB (CISTEM), 2014,
[38] Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement
Ou Shifeng
Song Peng
Gao Ying
CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (06) : 1214 - 1220
[39] Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement
OU Shifeng
SONG Peng
GAO Ying
Chinese Journal of Electronics, 2018, 27 (06) : 1214 - 1220
[40] SPEECH ENHANCEMENT BASED ON A SINUSOIDAL MODEL
KATES, JM
JOURNAL OF SPEECH AND HEARING RESEARCH, 1994, 37 (02): : 449 - 464

← 1 2 3 4 5 →