STUDENT'S T NONNEGATIVE MATRIX FACTORIZATION AND POSITIVE SEMIDEFINITE TENSOR FACTORIZATION FOR SINGLE-CHANNEL AUDIO SOURCE SEPARATION

被引:0
|
作者
Yoshii, Kazuyoshi [1 ]
Itoyama, Katsutoshi [1 ]
Goto, Masataka [2 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[2] Nat Inst Adv Ind Sci & Technol AIST, Tsukuba, Ibaraki, Japan
关键词
Source separation; nonnegative matrix factorization; positive semidefinite tensor factorization; t distribution;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a robust variant of nonnegative matrix factorization (NMF) based on complex Student's t distributions (t-NMF) for source separation of single-channel audio signals. The Itakura-Saito divergence NMF (Gaussian NMF) is justified for this purpose under an assumption that the complex spectra of source signals and those of the mixture signal are complex Gaussian distributed (the additivity of power spectra holds). In fact, however, the source spectra are often heavy-tailed distributed. When the source spectra are complex Cauchy distributed, for example, the mixture spectra are also complex Cauchy distributed (the additivity of amplitude spectra holds). Using the complex t distribution that includes the complex Gaussian and Cauchy distributions as its special cases, we propose t-NMF as a unified extension of Gaussian NMF and Cauchy NMF. Furthermore, we propose the corresponding variant of positive semidefinite tensor factorization based on multivariate complex t distributions (t-PSDTF). The experimental results showed that while t-NMF and t-PSDTF were comparative to Gaussian counterparts in terms of peak performance, they worked much better on average because they are insensitive to initialization and tend to avoid local optima.
引用
收藏
页码:51 / 55
页数:5
相关论文
共 50 条
  • [1] NONNEGATIVE TENSOR FACTORIZATION FOR SOURCE SEPARATION OF LOOPS IN AUDIO
    Smith, Jordan B. L.
    Goto, Masataka
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 171 - 175
  • [2] STUDENT'S T MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR BLIND SOURCE SEPARATION
    Kitamura, Koichi
    Bando, Yoshiaki
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    [J]. 2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [3] Perceptual Single-Channel Audio Source Separation by Non-negative Matrix Factorization
    Kirbiz, Serap
    Gunsel, Bilge
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 654 - 657
  • [4] Hidden Markov Models as Priors for Regularized Nonnegative Matrix Factorization in Single-Channel Source Separation
    Grais, Emad M.
    Erdogan, Hakan
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1534 - 1537
  • [5] Gaussian Mixture Gain Priors for Regularized Nonnegative Matrix Factorization in Single-Channel Source Separation
    Grais, Emad M.
    Erdogan, Hakan
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1518 - 1521
  • [6] Single-Channel Source Separation Using Complex Matrix Factorization
    King, Brian J.
    Atlas, Les
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2591 - 2597
  • [7] Initialization of Nonnegative Matrix Factorization Dictionaries for Single Channel Source Separation
    Grais, Emad M.
    Erdogan, Hakan
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [8] A PERCEPTUALLY ENHANCED BLIND SINGLE-CHANNEL AUDIO SOURCE SEPARATION BY NON-NEGATIVE MATRIX FACTORIZATION
    Kirbiz, S.
    Gunsel, B.
    [J]. 18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 731 - 735
  • [9] Single-Channel Signal Separation Using Spectral Basis Correlation with Sparse Nonnegative Tensor Factorization
    P. Parathai
    N. Tengtrairat
    W. L. Woo
    Bin Gao
    [J]. Circuits, Systems, and Signal Processing, 2019, 38 : 5786 - 5816
  • [10] Single-Channel Signal Separation Using Spectral Basis Correlation with Sparse Nonnegative Tensor Factorization
    Parathai, P.
    Tengtrairat, N.
    Woo, W. L.
    Gao, Bin
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (12) : 5786 - 5816