Multichannel speech separation using adaptive parameterization of source PDFs

被引:0
|
作者
Kokkinakis, K [1 ]
Nandi, AK [1 ]
机构
[1] Univ Liverpool, Dept Elect Engn & Elect, Signal Proc & Commun Grp, Liverpool L69 3GJ, Merseyside, England
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Convolutive and temporally correlated mixtures of speech are tackled with an LP-based temporal pre-whitening stage combined with the natural gradient algorithm (NGA), to essentially perform spatial separation by maximizing entropy at the output of a nonlinear function. In the past, speech sources have been parameterized by the generalized Gaussian density (GGD) model, in which the exponent parameter directly relates to the exponent of the corresponding optimal nonlinear function. In this paper, we present an adaptive, source dependent estimation of this parameter, controlled exclusively by the statistics of the output source estimates. Comparative experimental results illustrate the inherent flexibility of the proposed method, as well as an overall increase in convergence speed and separation performance over existing approaches.
引用
收藏
页码:486 / 493
页数:8
相关论文
共 50 条
  • [1] Multichannel blind deconvolution for source separation in convolutive mixtures of speech
    Kokkinakis, K
    Nandi, AK
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 200 - 212
  • [2] ONLINE IVA WITH ADAPTIVE LEARNING FOR SPEECH SEPARATION USING VARIOUS SOURCE PRIORS
    Erateb, Suleiman
    Naqvi, Mohsen
    Chambers, Jonathon
    2017 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE (SSPD), 2017, : 74 - 78
  • [3] ADAPTIVE SPARSE SOURCE SEPARATION WITH APPLICATION TO SPEECH SIGNALS
    Azizi, Elham
    Mohimani, G. Hosein
    Babaie-Zadeh, Massoud
    ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 640 - 643
  • [4] AN EM ALGORITHM FOR JOINT SOURCE SEPARATION AND DIARISATION OF MULTICHANNEL CONVOLUTIVE SPEECH MIXTURES
    Kounades-Bastian, Dionyssos
    Girin, Laurent
    Alameda-Pineda, Xavier
    Gannot, Sharon
    Horaud, Radu
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 16 - 20
  • [5] A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction
    Souden, Mehrez
    Araki, Shoko
    Kinoshita, Keisuke
    Nakatani, Tomohiro
    Sawada, Hiroshi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1913 - 1928
  • [6] Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
    Na, Yueyue
    Wang, Ziteng
    Liu, Zhang
    Tian, Biao
    Fu, Qiang
    INTERSPEECH 2021, 2021, : 1144 - 1148
  • [7] Approximate maximum likelihood blind source separation with arbitrary source PDFS
    Ghogho, Mounir
    Swami, Ananthram
    Durrani, Tariq
    IEEE Signal Processing Workshop on Statistical Signal and Array Processing, SSAP, 2000, : 368 - 372
  • [8] Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function
    Li, Xiaofei
    Girin, Laurent
    Gannot, Sharon
    Horaud, Radu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (03) : 645 - 659
  • [9] Approximate maximum likelihood blind source separation with arbitrary source PDFS
    Ghogho, M
    Swami, A
    Durrani, T
    PROCEEDINGS OF THE TENTH IEEE WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, 2000, : 368 - 372
  • [10] Combined estimation scheme for blind source separation with arbitrary source PDFs
    Zarzoso, V
    Nandi, AK
    Herrmann, F
    Millet-Roig, J
    ELECTRONICS LETTERS, 2001, 37 (02) : 132 - 133