NOISE SUPPRESSION WITH UNSUPERVISED JOINT SPEAKER ADAPTATION AND NOISE MIXTURE MODEL ESTIMATION

被引:0
|
作者
Fujimoto, Masakiyo [1 ]
Watanabe, Shinji [1 ]
Nakatani, Tomohiro [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Seika, Kyoto 6190237, Japan
关键词
noise suppression; noise mixture model; speaker adaptation; MMSE estimation; CONTINUOUS SPEECH RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The estimation of an accurate noise model is a crucial problem for model-based noise suppression including a vector Taylor series (VTS)-based approach. The variation of the speaker characteristics is also a crucial factor as regards the model-based noise suppression. As a result, a speaker adaptation technique plays an important role in the model-based noise suppression. To deal with former problem, we have already proposed an unsupervised estimation method for a noise mixture model. Therefore, this paper proposes a joint processing method that simultaneously achieves speaker adaptation and noise mixture model estimation. This joint processing is realized by using minimum mean squared error (MMSE) estimates of clean speech and noise. Although VTS-based approach involves non-linear transformation, the MMSE estimates make it possible to flexibly estimate accurate parameters for the joint processing without the influences of non-linear VTS transformation. In the evaluation, the proposed method provided an improvement compared with results obtained using only noise mixture model estimation.
引用
收藏
页码:4713 / 4716
页数:4
相关论文
共 50 条
  • [41] Robust Frequency Estimation Under Additive Mixture Noise
    Chen, Yuan
    Tian, Yulu
    Zhang, Dingfan
    Huang, Longting
    Xu, Jingxin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 1671 - 1684
  • [42] Unsupervised estimation of signal-dependent CCD camera noise
    Bruno Aiazzi
    Luciano Alparone
    Stefano Baronti
    Massimo Selva
    Lorenzo Stefani
    EURASIP Journal on Advances in Signal Processing, 2012
  • [43] Unsupervised estimation of signal-dependent CCD camera noise
    Aiazzi, Bruno
    Alparone, Luciano
    Baronti, Stefano
    Selva, Massimo
    Stefani, Lorenzo
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [44] Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA
    Kato, M
    Sugiyama, A
    Serizawa, M
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (07) : 1710 - 1718
  • [45] Noise suppression with high speech quality based on weighted noise estimation and MMSE STSA
    Kato, M
    Sugiyama, A
    Serizawa, M
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (02): : 43 - 53
  • [46] Sound Source Localization Using Joint Bayesian Estimation With a Hierarchical Noise Model
    Asano, Futoshi
    Asoh, Hideki
    Nakadai, Kazuhiro
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1953 - 1965
  • [47] Noise-robust open-set speaker recognition using noise-dependent Gaussian mixture classifier
    Gong, YF
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 133 - 136
  • [48] UBM based speaker selection and model re-estimation for speaker adaptation
    Wang, Jian
    Guo, Jun
    Liu, Gang
    Lei, Jianjun
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 856 - 860
  • [49] Message Passing Based Gaussian Mixture Model for DOA Estimation in Complex Noise Scenarios
    Guan, Shanwen
    Lu, Xinhua
    Li, Ji
    Luo, Xiaonan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1379 - 1383
  • [50] An integrated study of speaker normalisation and HMM adaptation for noise robust speaker-independent speech recognition
    Hariharan, R
    Viikki, O
    SPEECH COMMUNICATION, 2002, 37 (3-4) : 349 - 361