NOISE SUPPRESSION WITH UNSUPERVISED JOINT SPEAKER ADAPTATION AND NOISE MIXTURE MODEL ESTIMATION

被引：0

作者：

Fujimoto, Masakiyo ^{[1
]}

Watanabe, Shinji ^{[1
]}

Nakatani, Tomohiro ^{[1
]}

机构：

[1] NTT Corp, NTT Commun Sci Labs, Seika, Kyoto 6190237, Japan

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

关键词：

noise suppression; noise mixture model; speaker adaptation; MMSE estimation; CONTINUOUS SPEECH RECOGNITION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The estimation of an accurate noise model is a crucial problem for model-based noise suppression including a vector Taylor series (VTS)-based approach. The variation of the speaker characteristics is also a crucial factor as regards the model-based noise suppression. As a result, a speaker adaptation technique plays an important role in the model-based noise suppression. To deal with former problem, we have already proposed an unsupervised estimation method for a noise mixture model. Therefore, this paper proposes a joint processing method that simultaneously achieves speaker adaptation and noise mixture model estimation. This joint processing is realized by using minimum mean squared error (MMSE) estimates of clean speech and noise. Although VTS-based approach involves non-linear transformation, the MMSE estimates make it possible to flexibly estimate accurate parameters for the joint processing without the influences of non-linear VTS transformation. In the evaluation, the proposed method provided an improvement compared with results obtained using only noise mixture model estimation.

引用

页码：4713 / 4716

页数：4

共 50 条

[41] Robust Frequency Estimation Under Additive Mixture Noise
Chen, Yuan
Tian, Yulu
Zhang, Dingfan
Huang, Longting
Xu, Jingxin
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 1671 - 1684
[42] Unsupervised estimation of signal-dependent CCD camera noise
Bruno Aiazzi
Luciano Alparone
Stefano Baronti
Massimo Selva
Lorenzo Stefani
EURASIP Journal on Advances in Signal Processing, 2012
[43] Unsupervised estimation of signal-dependent CCD camera noise
Aiazzi, Bruno
Alparone, Luciano
Baronti, Stefano
Selva, Massimo
Stefani, Lorenzo
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
[44] Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA
Kato, M
Sugiyama, A
Serizawa, M
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (07) : 1710 - 1718
[45] Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA
Kato, M
Sugiyama, A
Serizawa, M
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (02): : 43 - 53
[46] Sound Source Localization Using Joint Bayesian Estimation With a Hierarchical Noise Model
Asano, Futoshi
Asoh, Hideki
Nakadai, Kazuhiro
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1953 - 1965
[47] Noise-robust open-set speaker recognition using noise-dependent Gaussian mixture classifier
Gong, YF
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 133 - 136
[48] UBM based speaker selection and model re-estimation for speaker adaptation
Wang, Jian
Guo, Jun
Liu, Gang
Lei, Jianjun
PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 856 - 860
[49] Message Passing Based Gaussian Mixture Model for DOA Estimation in Complex Noise Scenarios
Guan, Shanwen
Lu, Xinhua
Li, Ji
Luo, Xiaonan
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1379 - 1383
[50] An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition
Hariharan, R
Viikki, O
SPEECH COMMUNICATION, 2002, 37 (3-4) : 349 - 361

← 1 2 3 4 5 →