Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation

被引:52
|
作者
Makishima, Naoki [1 ]
Mogami, Shinichi [1 ]
Takamune, Norihiro [1 ]
Kitamura, Daichi [2 ]
Sumino, Hayato [1 ]
Takamichi, Shinnosuke [1 ]
Saruwatari, Hiroshi [1 ]
Ono, Nobutaka [3 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo 1138656, Japan
[2] Kagawa Coll, Natl Inst Technol, Takamatsu, Kagawa 7618058, Japan
[3] Tokyo Metropolitan Univ, Grad Sch Syst Design, Tokyo 1910065, Japan
关键词
Audio source separation; independent component analysis; deep neural networks; semi-supervised learning; BLIND SOURCE SEPARATION; COMPONENT ANALYSIS; FACTORIZATION;
D O I
10.1109/TASLP.2019.2925450
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new framework called independent deeply learned matrix analysis (IDLMA), which unifies a deep neural network (DNN) and independence-based multichannel audio source separation. IDLMA utilizes both pretrained DNN source models and statistical independence between sources for the separation, where the time-frequency structures of each source are iteratively optimized by a DNN while enhancing the estimation accuracy of the spatial demixing filters. As the source generative model, we introduce a complex heavy-tailed distribution to improve the separation performance. In addition, we address a semi-supervised situation; namely, a solo-recorded audio dataset can be prepared for only one source in the mixture signal. To solve the limited-data problem, we propose an appropriate data augmentation method to adapt the DNN source models to the observed signal, which enables IDLMA to work even in the semi-supervised situation. Experiments are conducted using music signals with a training dataset in both supervised and semi-supervised situations. The results show the validity of the proposed method in terms of the separation accuracy.
引用
收藏
页码:1601 / 1615
页数:15
相关论文
共 50 条
  • [1] Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation
    Narisawa, Naoki
    Ikeshita, Rintaro
    Takamune, Norihiro
    Kitamura, Daichi
    Nakamura, Tomohiko
    Saruwatari, Hiroshi
    Nakatani, Tomohiro
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 326 - 330
  • [2] Independent Deeply Learned Matrix Analysis for Multichannel Audio Source Separation
    Mogami, Shinichi
    Sumino, Hayato
    Kitamura, Daichi
    Takamune, Norihiro
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    Ono, Nobutaka
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1557 - 1561
  • [3] Empirical Bayesian Independent Deeply Learned Matrix Analysis For Multichannel Audio Source Separation
    Hasumi, Takuya
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    Kitamura, Daichi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 331 - 335
  • [4] Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
    Hasumi, Takuya
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    Kitamura, Daichi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1226 - 1233
  • [5] PoP-IDLMA: Product-of-Prior Independent Deeply Learned Matrix Analysis for Multichannel Music Source Separation
    Hasumi, Takuya
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    Kitamura, Daichi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2680 - 2694
  • [6] Audio source separation based on independent component analysis
    Makino, S
    Araki, S
    Mukai, R
    Sawada, H
    [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 5, PROCEEDINGS, 2004, : 668 - 671
  • [7] Consistent independent low-rank matrix analysis for determined blind source separation
    Daichi Kitamura
    Kohei Yatabe
    [J]. EURASIP Journal on Advances in Signal Processing, 2020
  • [8] Consistent independent low-rank matrix analysis for determined blind source separation
    Kitamura, Daichi
    Yatabe, Kohei
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2020, 2020 (01)
  • [9] Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization
    Kitamura, Daichi
    Ono, Nobutaka
    Sawada, Hiroshi
    Kameoka, Hirokazu
    Saruwatari, Hiroshi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1626 - 1641
  • [10] Blind audio source separation based on independent component analysis
    Makino, Shoji
    Sawada, Hiroshi
    Araki, Shoko
    [J]. INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2007, 4666 : 843 - 843