Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation

被引:1
|
作者
Fontaine, Mathieu [1 ,2 ]
Sekiguchi, Kouhei [2 ]
Nugraha, Aditya Arie [2 ]
Bando, Yoshiaki [4 ]
Yoshii, Kazuyoshi [2 ,3 ]
机构
[1] Telecom Paris, LTCI, Inst Polytech Paris, Palaiseau, France
[2] Ctr Adv Intelligence Project AIP, RIKEN, Tokyo 1030027, Japan
[3] Kyoto Univ, Grad Sch Informat, Sakyo Ku, Kyoto 6068501, Japan
[4] Natl Inst Adv Ind Sci & Technol, Koto Ku, Tokyo 1350064, Japan
关键词
Nonnegative matrix factorization; blind source separation; probabilistic framework; expectation-maximization; INDEPENDENT VECTOR ANALYSIS; SPEECH ENHANCEMENT; MODEL;
D O I
10.1109/TASLP.2022.3172631
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes heavy-tailed extensions of a state of the art versatile blind source separation method called fast multichannel nonnegative matrix factorization (FastMNMF) from a unified point of view. The common way of deriving such an extension is to replace the multivariate complex Gaussian distribution in the likelihood function with its heavy-tailed generalization, e.g., the multivariate complex Student's t and leptokurtic generalized Gaussian distributions, and tailor-make the corresponding parameter optimization algorithm. Using a wider class of heavy-tailed distributions called a Gaussian scale mixture (GSM), i.e., a mixture of Gaussian distributions whose variances are perturbed by positive random scalars called impulse variables, we propose GSM-FastMNMF and develop an expectation-maximization algorithm that works even when the probability density function of the impulse variables have no analytical expressions. We show that existing heavy-tailed FastMNMF extensions are instances of GSM-FastMNMF and derive a new instance based on the generalized hyperbolic distribution that include the normal-inverse Gaussian, Student's t, and Gaussian distributions as the special cases. Our experiments show that the normal-inverse Gaussian FastMNMF outperforms the state-of-the-art FastMNMF extensions and ILRMA model in speech enhancement and separation in terms of the signal-to-distortion ratio.
引用
收藏
页码:1734 / 1748
页数:15
相关论文
共 50 条
  • [21] Fast Multichannel Nonnegative Matrix Factorization With Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation
    Sekiguchi, Kouhei
    Bando, Yoshiaki
    Nugraha, Aditya Arie
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2610 - 2625
  • [22] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
  • [23] Multichannel Blind Sound Source Separation Using Spatial Covariance Model With Level and Time Differences and Nonnegative Matrix Factorization
    Carabias-Orti, Julio Jose
    Nikunen, Joonas
    Virtanen, Tuomas
    Vera-Candeas, Pedro
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1512 - 1527
  • [24] Underdetermined Blind Source Separation Combining Tensor Decomposition and Nonnegative Matrix Factorization
    Xie, Yuan
    Xie, Kan
    Yang, Junjie
    Xie, Shengli
    [J]. SYMMETRY-BASEL, 2018, 10 (10):
  • [25] A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2033 - 2037
  • [26] Orthogonal Nonnegative Matrix Factorization for Blind Image Separation
    Mirzal, Andri
    [J]. ADVANCES IN VISUAL INFORMATICS, 2013, 8237 : 25 - 35
  • [27] Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization
    Kitamura, Daichi
    Ono, Nobutaka
    Sawada, Hiroshi
    Kameoka, Hirokazu
    Saruwatari, Hiroshi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1626 - 1641
  • [28] Online Blind Source Separation Using Incremental Nonnegative Matrix Factorization with Volume Constraint
    Zhou, Guoxu
    Yang, Zuyuan
    Xie, Shengli
    Yang, Jun-Mei
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (04): : 550 - 560
  • [29] Blind source separation based on generalized gaussian model
    杨斌
    孔薇
    周越
    [J]. Journal of Harbin Institute of Technology(New series), 2007, (03) : 362 - 367
  • [30] Mixing Matrix Estimation in Blind Source Separation Based on Generalized Gaussian Mixture Modal
    Chen, Yongqiang
    Liu, Jun
    [J]. SMART TECHNOLOGIES FOR COMMUNICATION, 2012, 4 : 217 - 221