Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation

被引:0
|
作者
Narisawa, Naoki [1 ]
Ikeshita, Rintaro [2 ]
Takamune, Norihiro [1 ]
Kitamura, Daichi [3 ]
Nakamura, Tomohiko [1 ]
Saruwatari, Hiroshi [1 ]
Nakatani, Tomohiro [2 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan
[3] Kagawa Coll, Natl Inst Technol, Takamatsu, Kagawa, Japan
关键词
audio source separation; independent component analysis; deep neural networks; inter-frequency correlation; ICA;
D O I
10.23919/EUSIPCO54536.2021.9616300
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address the determined audio source separation problem in the time-frequency domain. In independent deeply learned matrix analysis (IDLMA), it is assumed that the inter-frequency correlation of each source spectrum is zero, which is inappropriate for modeling nonstationary signals such as music signals. To account for the correlation between frequencies, independent positive semidefinite tensor analysis has been proposed. This unsupervised (blind) method, however, severely restrict the structure of frequency covariance matrices (FCMs) to reduce the number of model parameters. As an extension of these conventional approaches, we here propose a supervised method that models FCMs using deep neural networks (DNNs). It is difficult to directly infer FCMs using DNNs. Therefore, we also propose a new FCM model represented as a convex combination of a diagonal FCM and a rank-1 FCM. Our FCM model is flexible enough to not only consider inter-frequency correlation, but also capture the dynamics of time-varying FCMs of nonstationary signals. We infer the proposed FCMs using two DNNs: DNN for power spectrum estimation and DNN for time-domain signal estimation. An experimental result of separating music signals shows that the proposed method provides higher separation performance than IDLMA.
引用
收藏
页码:326 / 330
页数:5
相关论文
共 50 条
  • [31] Sparse Reverberant Audio Source Separation via Reweighted Analysis
    Arberet, Simon
    Vandergheynst, Pierre
    Carrillo, Rafael E.
    Thiran, Jean-Philippe
    Wiaux, Yves
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1391 - 1402
  • [32] INFINITE PROBABILISTIC LATENT COMPONENT ANALYSIS FOR AUDIO SOURCE SEPARATION
    Yoshii, Kazuyoshi
    Nakamura, Eita
    Itoyama, Katsutoshi
    Goto, Masataka
    [J]. 2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [33] Sampling-Frequency-Independent Convolutional Layer and its Application to Audio Source Separation
    Saito, Koichi
    Nakamura, Tomohiko
    Yatabe, Kohei
    Saruwatari, Hiroshi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2928 - 2943
  • [34] TENSOR BASED SINGULAR SPECTRUM ANALYSIS FOR NONSTATIONARY SOURCE SEPARATION
    Kouchaki, Samaneh
    Sanei, Saeid
    [J]. 2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [35] AN OVERVIEW OF INFORMED AUDIO SOURCE SEPARATION
    Liutkus, Antoine
    Durrieu, Jean-Louis
    Daudet, Laurent
    Richard, Gael
    [J]. 2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,
  • [36] ON-THE-FLY AUDIO SOURCE SEPARATION
    El Badawy, Dalia
    Duong, Ngoc Q. K.
    Ozerov, Alexey
    [J]. 2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [37] MOTION INFORMED AUDIO SOURCE SEPARATION
    Parekh, Sanjeel
    Essid, Slim
    Ozerov, Alexey
    Duong, Ngoc Q. K.
    Perez, Patrick
    Richard, Gael
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6 - 10
  • [38] Audio source separation of convolutive mixtures
    Mitianoudis, N
    Davies, ME
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 489 - 497
  • [39] Joint Audio Inpainting and Source Separation
    Bilen, Cagdas
    Ozerov, Alexey
    Perez, Patrick
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 : 251 - 258
  • [40] The effect of source sparsity on independent vector analysis for blind source separation
    Gu, Jianjun
    Cheng, Longbiao
    Yao, Dingding
    Li, Junfeng
    Yan, Yonghong
    [J]. SIGNAL PROCESSING, 2023, 213