Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder

被引：18

作者：

Seki, Shogo ^{[1
]}

Kameoka, Hirokazu ^{[2
]}

Li, Li ^{[3
]}

Toda, Tomoki ^{[4
]}

Takeda, Kazuya ^{[5
]}

机构：

[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4640861, Japan

[2] NTT Corp, Atsugi, Kanagawa 2430198, Japan

[3] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058573, Japan

[4] Nagoya Univ, Informat Technol Ctr, Nagoya, Aichi 4640861, Japan

[5] Nagoya Univ, Inst Innovat Future Soc, Nagoya, Aichi 4648603, Japan

来源：

IEEE ACCESS | 2019年 / 7卷

关键词：

Underdetermined source separation; variational audoencoder; non-negative matrix factorization; AUDIO SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION;

D O I：

10.1109/ACCESS.2019.2954120

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper deals with a multichannel audio source separation problem under underdetermined conditions. Multichannel non-negative matrix factorization (MNMF) is a powerful method for underdetermined audio source separation, which adopts the NMF concept to model and estimate the power spectrograms of the sound sources in a mixture signal. This concept is also used in independent low-rank matrix analysis (ILRMA), a special class of the MNMF formulated under determined conditions. While these methods work reasonably well for particular types of sound sources, one limitation is that they can fail to work for sources with spectrograms that do not comply with the NMF model. To address this limitation, an extension of ILRMA called the multichannel variational autoencoder (MVAE) method was recently proposed, where a conditional VAE (CVAE) is used instead of the NMF model for expressing source power spectrograms. This approach has performed impressively in determined source separation tasks thanks to the representation power of deep neural networks. While the original MVAE method was formulated under determined mixing conditions, this paper proposes a generalized version of it by combining the ideas of MNMF and MVAE so that it can also deal with underdetermined cases. We call this method the generalized MVAE (GMVAE) method. In underdetermined source separation and speech enhancement experiments, the proposed method performed better than baseline methods.

引用

页码：168104 / 168115

页数：12

共 50 条

[31] Gaussian Processes for Underdetermined Source Separation
Liutkus, Antoine
Badeau, Roland
Richard, Gaeel
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (07) : 3155 - 3167
[32] Underdetermined Blind Source Separation with Variational Mode Decomposition for Compound Roller Bearing Fault Signals
Tang, Gang
Luo, Ganggang
Zhang, Weihua
Yang, Caijin
Wang, Huaqing
SENSORS, 2016, 16 (06)
[33] Algorithm for source recovery in underdetermined blind source separation based on plane pursuit
Fu Weihong
Wei Juan
Liu Naian
Chen Jiehu
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2018, 29 (02) : 223 - 228
[34] Underdetermined blind source separation using normalized spatial covariance matrix and multichannel nonnegative matrix factorization
Oh, Son-hook
Kim, Jung-Han
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (02): : 120 - 130
[35] DESIGNING MULTICHANNEL SOURCE SEPARATION BASED ON SINGLE-CHANNEL SOURCE SEPARATION
Lopez, A. Ramirez
Ono, N.
Remes, U.
Palomaki, K.
Kurimo, M.
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 469 - 473
[36] Source Recovery of Underdetermined Blind Source Separation Based on Modified Shortest Path Algorithm
Wang, Chuanchuan
Zeng, Yonghu
Wang, Liandong
PROCEEDINGS OF 2019 IEEE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY (ICEICT 2019), 2019, : 215 - 220
[37] Underdetermined Blind Source Separation Based on Relaxed Sparsity Condition of Sources
Peng, Dezhong
Xiang, Yong
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (02) : 809 - 814
[38] Nonnegative Mixture for Underdetermined Blind Source Separation Based on a Tensor Algorithm
Ge, Sunan
Han, Jie
Han, Min
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (09) : 2935 - 2950
[39] Underdetermined Blind Source Separation of Adjacent Satellite Interference Based on Sparseness
Chengjie Li
Lidong Zhu
Zhongqiang Luo
中国通信, 2017, 14 (04) : 140 - 149
[40] Underdetermined blind source separation method based on independent component analysis
Ordnance Engineering College, Shijiazhuang 050003, China
不详
J Vib Shock, 2013, 7 (30-33):

← 1 2 3 4 5 →