INFINITE PROBABILISTIC LATENT COMPONENT ANALYSIS FOR AUDIO SOURCE SEPARATION

被引:0
|
作者
Yoshii, Kazuyoshi [1 ,2 ]
Nakamura, Eita [1 ]
Itoyama, Katsutoshi [1 ]
Goto, Masataka [3 ]
机构
[1] Kyoto Univ, Kyoto, Japan
[2] RIKEN, Tokyo, Japan
[3] Natl Inst Adv Ind Sci & Technol, Tsukuba, Ibaraki, Japan
关键词
Source separation; nonparametric Bayes; probabilistic latent component analysis; Dirichlet process; Gibbs sampling; variational Bayes;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a statistical method of audio source separation based on a nonparametric Bayesian extension of probabilistic latent component analysis (PLCA). A major approach to audio source separation is to use nonnegative matrix factorization (NMF) that approximates the magnitude spectrum of a mixture signal at each frame as the weighted sum of fewer source spectra. Another approach is to use PLCA that regards the magnitude spectrogram as a two-dimensional histogram of "sound quanta" and classifies each quantum into one of sources. While NMF has a physically-natural interpretation, PLCA has been used successfully for music signal analysis. To enable PLCA to estimate the number of sources, we propose Dirichlet process PLCA (DP-PLCA) and derive two kinds of learning methods based on variational Bayes and collapsed Gibbs sampling. Unlike existing learning methods for nonparametric Bayesian NMF based on the beta or gamma processes (BP-NMF and GaP-NMF), our sampling method can efficiently search for the optimal number of sources without truncating the number of sources to be considered. Experimental results showed that DP-PLCA is superior to GaP-NMF in terms of source number estimation.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Audio source separation by source localization with Hilbert spectrum
    Molla, MKI
    Hirose, K
    Minematsu, N
    [J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 5734 - 5737
  • [42] Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation
    Makishima, Naoki
    Mogami, Shinichi
    Takamune, Norihiro
    Kitamura, Daichi
    Sumino, Hayato
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    Ono, Nobutaka
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (10) : 1601 - 1615
  • [43] Time-domain blind audio source separation using advanced component clustering and reconstruction
    Koldovsky, Zbynek
    Tichavsky, Petr
    [J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 216 - +
  • [44] Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation
    Narisawa, Naoki
    Ikeshita, Rintaro
    Takamune, Norihiro
    Kitamura, Daichi
    Nakamura, Tomohiko
    Saruwatari, Hiroshi
    Nakatani, Tomohiro
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 326 - 330
  • [45] Independent Low-Rank Tensor Analysis for Audio Source Separation
    Yoshii, Kazuyoshi
    Kitamura, Koichi
    Bando, Yoshiaki
    Nakamura, Eita
    Kawahara, Tatsuya
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1657 - 1661
  • [46] Independent Deeply Learned Matrix Analysis for Multichannel Audio Source Separation
    Mogami, Shinichi
    Sumino, Hayato
    Kitamura, Daichi
    Takamune, Norihiro
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    Ono, Nobutaka
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1557 - 1561
  • [47] DNN-Based Frequency Component Prediction for Frequency-Domain Audio Source Separation
    Watanabe, Rui
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 805 - 809
  • [48] A Survey of Optimization Methods for Independent Vector Analysis in Audio Source Separation
    Guo, Ruiming
    Luo, Zhongqiang
    Li, Mingchun
    [J]. SENSORS, 2023, 23 (01)
  • [49] Infinite Latent Feature Selection: A Probabilistic Latent Graph-Based Ranking Approach
    Roffo, Giorgio
    Melzi, Simone
    Castellani, Umberto
    Vinciarelli, Alessandro
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1407 - 1415
  • [50] Infinite Sparse Factor Analysis for Blind Source Separation in Reverberant Environments
    Nagira, Kohei
    Otsuka, Takuma
    Okuno, Hiroshi G.
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 638 - 647