Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder

被引:18
|
作者
Seki, Shogo [1 ]
Kameoka, Hirokazu [2 ]
Li, Li [3 ]
Toda, Tomoki [4 ]
Takeda, Kazuya [5 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4640861, Japan
[2] NTT Corp, Atsugi, Kanagawa 2430198, Japan
[3] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan
[4] Nagoya Univ, Informat Technol Ctr, Nagoya, Aichi 4640861, Japan
[5] Nagoya Univ, Inst Innovat Future Soc, Nagoya, Aichi 4648603, Japan
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Underdetermined source separation; variational audoencoder; non-negative matrix factorization; AUDIO SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION;
D O I
10.1109/ACCESS.2019.2954120
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with a multichannel audio source separation problem under underdetermined conditions. Multichannel non-negative matrix factorization (MNMF) is a powerful method for underdetermined audio source separation, which adopts the NMF concept to model and estimate the power spectrograms of the sound sources in a mixture signal. This concept is also used in independent low-rank matrix analysis (ILRMA), a special class of the MNMF formulated under determined conditions. While these methods work reasonably well for particular types of sound sources, one limitation is that they can fail to work for sources with spectrograms that do not comply with the NMF model. To address this limitation, an extension of ILRMA called the multichannel variational autoencoder (MVAE) method was recently proposed, where a conditional VAE (CVAE) is used instead of the NMF model for expressing source power spectrograms. This approach has performed impressively in determined source separation tasks thanks to the representation power of deep neural networks. While the original MVAE method was formulated under determined mixing conditions, this paper proposes a generalized version of it by combining the ideas of MNMF and MVAE so that it can also deal with underdetermined cases. We call this method the generalized MVAE (GMVAE) method. In underdetermined source separation and speech enhancement experiments, the proposed method performed better than baseline methods.
引用
收藏
页码:168104 / 168115
页数:12
相关论文
共 50 条
  • [21] Underdetermined source separation with structured source priors
    Vincent, E
    Rodet, X
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 327 - 334
  • [22] UNDERDETERMINED BLIND SOURCE SEPARATION BASED ON DISTRIBUTED COMPRESSED SENSING
    Xu, Yong
    Zhang, Yujie
    Li, Hongwei
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2015, 11 (02): : 447 - 462
  • [23] Underdetermined blind source separation algorithm based on A-DBSCAN
    Ji C.
    Mu W.
    Geng R.
    1600, Chinese Institute of Electronics (42): : 2676 - 2683
  • [24] Underdetermined blind separation based on source signals' number estimation
    Tan, Bei-Hai
    Xie, Sheng-Li
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2008, 30 (04): : 863 - 867
  • [25] Variational Autoencoder based Latent Factor Decoding of Multichannel EEG for Emotion Recognition
    Li, Xiang
    Zhao, Zhigang
    Song, Dawei
    Zhang, Yazhou
    Niu, Chunyang
    Zhang, Junwei
    Huo, Jidong
    Li, Jing
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 684 - 687
  • [26] Parameter-adaptive variational autoencoder for linear/nonlinear blind source separation
    Wei, Yuan-Hao
    Ni, Yi-Qing
    JOURNAL OF CIVIL STRUCTURAL HEALTH MONITORING, 2024, : 1161 - 1184
  • [27] Algorithm for source recovery in underdetermined blind source separation based on plane pursuit
    FU Weihong
    WEI Juan
    LIU Naian
    CHEN Jiehu
    Journal of Systems Engineering and Electronics, 2018, 29 (02) : 223 - 228
  • [28] Source Recovery in Underdetermined Blind Source Separation Based on Artificial Neural Network
    Weihong Fu
    Bin Nong
    Xinbiao Zhou
    Jun Liu
    Changle Li
    中国通信, 2018, 15 (01) : 140 - 154
  • [29] Source Recovery in Underdetermined Blind Source Separation Based on Artificial Neural Network
    Fu, Weihong
    Nong, Bin
    Zhou, Xinbiao
    Liu, Jun
    Li, Changle
    CHINA COMMUNICATIONS, 2018, 15 (01) : 140 - 154
  • [30] Underdetermined Joint Blind Source Separation based on Tensor Decomposition
    Zou, Liang
    Wang, Z. Jane
    Chen, Xun
    Ji, Xiangyang
    2016 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2016,