Independent Deeply Learned Matrix Analysis for Multichannel Audio Source Separation

被引:0
|
作者
Mogami, Shinichi [1 ]
Sumino, Hayato [1 ]
Kitamura, Daichi [1 ]
Takamune, Norihiro [1 ]
Takamichi, Shinnosuke [1 ]
Saruwatari, Hiroshi [1 ]
Ono, Nobutaka [2 ]
机构
[1] Univ Tokyo, Tokyo, Japan
[2] Tokyo Metropolitan Univ, Hachioji, Tokyo, Japan
关键词
multichannel audio source separation; independent component analysis; deep neural networks;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we address a multichannel audio source separation task and propose a new efficient method called independent deeply learned matrix analysis (IDLMA). IDLMA estimates the demixing matrix in a blind manner and updates the time-frequency structures of each source using a pretrained deep neural network (DNN). Also, we introduce a complex Student's t-distribution as a generalized source generative model including both complex Gaussian and Cauchy distributions. Experiments are conducted using music signals with a training dataset, and the results show the validity of the proposed method in terms of separation accuracy and computational cost.
引用
收藏
页码:1557 / 1561
页数:5
相关论文
共 50 条
  • [1] Empirical Bayesian Independent Deeply Learned Matrix Analysis For Multichannel Audio Source Separation
    Hasumi, Takuya
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    Kitamura, Daichi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 331 - 335
  • [2] Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
    Hasumi, Takuya
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    Kitamura, Daichi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1226 - 1233
  • [3] Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation
    Makishima, Naoki
    Mogami, Shinichi
    Takamune, Norihiro
    Kitamura, Daichi
    Sumino, Hayato
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    Ono, Nobutaka
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (10) : 1601 - 1615
  • [4] Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation
    Narisawa, Naoki
    Ikeshita, Rintaro
    Takamune, Norihiro
    Kitamura, Daichi
    Nakamura, Tomohiko
    Saruwatari, Hiroshi
    Nakatani, Tomohiro
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 326 - 330
  • [5] PoP-IDLMA: Product-of-Prior Independent Deeply Learned Matrix Analysis for Multichannel Music Source Separation
    Hasumi, Takuya
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    Kitamura, Daichi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2680 - 2694
  • [6] SEPNET: A DEEP SEPARATION MATRIX PREDICTION NETWORK FOR MULTICHANNEL AUDIO SOURCE SEPARATION
    Inoue, Shota
    Kameoka, Hirokazu
    Li, Li
    Makino, Shoji
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 191 - 195
  • [7] BAYESIAN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR AUDIO SOURCE SEPARATION AND LOCALIZATION
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 551 - 555
  • [8] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [9] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
  • [10] Audio source separation based on independent component analysis
    Makino, S
    Araki, S
    Mukai, R
    Sawada, H
    [J]. 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 5, PROCEEDINGS, 2004, : 668 - 671