Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder

被引:18
|
作者
Seki, Shogo [1 ]
Kameoka, Hirokazu [2 ]
Li, Li [3 ]
Toda, Tomoki [4 ]
Takeda, Kazuya [5 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4640861, Japan
[2] NTT Corp, Atsugi, Kanagawa 2430198, Japan
[3] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan
[4] Nagoya Univ, Informat Technol Ctr, Nagoya, Aichi 4640861, Japan
[5] Nagoya Univ, Inst Innovat Future Soc, Nagoya, Aichi 4648603, Japan
来源
IEEE ACCESS | 2019年 / 7卷
关键词
Underdetermined source separation; variational audoencoder; non-negative matrix factorization; AUDIO SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION;
D O I
10.1109/ACCESS.2019.2954120
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper deals with a multichannel audio source separation problem under underdetermined conditions. Multichannel non-negative matrix factorization (MNMF) is a powerful method for underdetermined audio source separation, which adopts the NMF concept to model and estimate the power spectrograms of the sound sources in a mixture signal. This concept is also used in independent low-rank matrix analysis (ILRMA), a special class of the MNMF formulated under determined conditions. While these methods work reasonably well for particular types of sound sources, one limitation is that they can fail to work for sources with spectrograms that do not comply with the NMF model. To address this limitation, an extension of ILRMA called the multichannel variational autoencoder (MVAE) method was recently proposed, where a conditional VAE (CVAE) is used instead of the NMF model for expressing source power spectrograms. This approach has performed impressively in determined source separation tasks thanks to the representation power of deep neural networks. While the original MVAE method was formulated under determined mixing conditions, this paper proposes a generalized version of it by combining the ideas of MNMF and MVAE so that it can also deal with underdetermined cases. We call this method the generalized MVAE (GMVAE) method. In underdetermined source separation and speech enhancement experiments, the proposed method performed better than baseline methods.
引用
收藏
页码:168104 / 168115
页数:12
相关论文
共 50 条
  • [31] Gaussian Processes for Underdetermined Source Separation
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gaeel
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (07) : 3155 - 3167
  • [32] Underdetermined Blind Source Separation with Variational Mode Decomposition for Compound Roller Bearing Fault Signals
    Tang, Gang
    Luo, Ganggang
    Zhang, Weihua
    Yang, Caijin
    Wang, Huaqing
    SENSORS, 2016, 16 (06)
  • [33] Algorithm for source recovery in underdetermined blind source separation based on plane pursuit
    Fu Weihong
    Wei Juan
    Liu Naian
    Chen Jiehu
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2018, 29 (02) : 223 - 228
  • [34] Underdetermined blind source separation using normalized spatial covariance matrix and multichannel nonnegative matrix factorization
    Oh, Son-hook
    Kim, Jung-Han
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (02): : 120 - 130
  • [35] DESIGNING MULTICHANNEL SOURCE SEPARATION BASED ON SINGLE-CHANNEL SOURCE SEPARATION
    Lopez, A. Ramirez
    Ono, N.
    Remes, U.
    Palomaki, K.
    Kurimo, M.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 469 - 473
  • [36] Source Recovery of Underdetermined Blind Source Separation Based on Modified Shortest Path Algorithm
    Wang, Chuanchuan
    Zeng, Yonghu
    Wang, Liandong
    PROCEEDINGS OF 2019 IEEE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY (ICEICT 2019), 2019, : 215 - 220
  • [37] Underdetermined Blind Source Separation Based on Relaxed Sparsity Condition of Sources
    Peng, Dezhong
    Xiang, Yong
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (02) : 809 - 814
  • [38] Nonnegative Mixture for Underdetermined Blind Source Separation Based on a Tensor Algorithm
    Ge, Sunan
    Han, Jie
    Han, Min
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (09) : 2935 - 2950
  • [39] Underdetermined Blind Source Separation of Adjacent Satellite Interference Based on Sparseness
    Chengjie Li
    Lidong Zhu
    Zhongqiang Luo
    中国通信, 2017, 14 (04) : 140 - 149
  • [40] Underdetermined blind source separation method based on independent component analysis
    Ordnance Engineering College, Shijiazhuang 050003, China
    不详
    J Vib Shock, 2013, 7 (30-33):