Multichannel Singing Voice Separation by Deep Neural Network Informed DOA Constrained CMNMF

被引:0
|
作者
Munoz-Montoro, Antonio J. [1 ]
Politis, Archontis [2 ]
Drossos, Konstantinos [2 ]
Carabias-Orti, Julio J. [1 ]
机构
[1] Univ Jaen, Telecommun Engn Dept, Jaen, Spain
[2] Tampere Univ, Audio Res Grp, Tampere, Finland
基金
欧洲研究理事会;
关键词
Multichannel Source Separation; Singing Voice; Deep Learning; CMNMF; Spatial Audio; SPATIAL COVARIANCE MODEL; AUDIO SOURCE SEPARATION; NONNEGATIVE MATRIX;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This work addresses the problem of multichannel source separation combining two powerful approaches, multichannel spectral factorization with recent monophonic deep learning (DL) based spectrum inference. Individual source spectra at different channels are estimated with a Masker-Denoiser twin network, able to model long-term temporal patterns of a musical piece. The monophonic source spectrograms are used within a spatial covariance mixing model based on complex-valued multichannel non-negative matrix factorization (CMNMF) that predicts the spatial characteristics of each source. The proposed framework is evaluated on the task of singing voice separation with a large multichannel dataset. Experimental results show that our joint DL+CMNMF method outperforms both the individual monophonic DL-based separation and the multichannel CMNMF baseline methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Ambisonics domain Singing Voice Separation combining Deep Neural Network and Direction Aware Multichannel NMF
    Munoz-Montoro, Antonio J.
    Carabias-Orti, Julio J.
    Vera-Candeas, Pedro
    IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2021,
  • [2] Singing Voice Separation Based on Deep Regression Neural Network
    Yang, Shuqian
    Zhang, Wei-Qiang
    2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
  • [3] PROXIMAL DEEP RECURRENT NEURAL NETWORK FOR MONAURAL SINGING VOICE SEPARATION
    Yuan, Weitao
    Wang, Shengbei
    Li, Xiangrui
    Unoki, Masashi
    Wang, Wenwu
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 286 - 290
  • [4] FC-U2-Net: A Novel Deep Neural Network for Singing Voice Separation
    Ni, Xin
    Ren, Jia
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 489 - 494
  • [5] Discriminative Training of Complex-valued Deep Recurrent Neural Network for Singing Voice Separation
    Lee, Yuan-Shan
    Yu, Kuo
    Chen, Sih-Huei
    Wang, Jia-Ching
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1327 - 1335
  • [6] Singing voice separation using a deep convolutional neural network trained by ideal binary mask and cross entropy
    Kin Wah Edward Lin
    B. T. Balamurali
    Enyan Koh
    Simon Lui
    Dorien Herremans
    Neural Computing and Applications, 2020, 32 : 1037 - 1050
  • [7] Singing voice separation using a deep convolutional neural network trained by ideal binary mask and cross entropy
    Lin, Kin Wah Edward
    Balamurali, B. T.
    Koh, Enyan
    Lui, Simon
    Herremans, Dorien
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (04): : 1037 - 1050
  • [8] Improving singing voice separation using attribute-aware deep network
    Swaminathan, Rupak Vignesh
    Lerch, Alexander
    2019 INTERNATIONAL WORKSHOP ON MULTILAYER MUSIC REPRESENTATION AND PROCESSING (MMRP 2019), 2019, : 60 - 65
  • [9] VOCAL ACTIVITY INFORMED SINGING VOICE SEPARATION WITH THE IKALA DATASET
    Chan, Tak-Shing
    Yeh, Tzu-Chun
    Fan, Zhe-Cheng
    Chen, Hung-Wei
    Sui, Li
    Yang, Yi-Hsuan
    Jang, Roger
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 718 - 722
  • [10] Informed Group-Sparse Representation for Singing Voice Separation
    Chan, Tak-Shing T.
    Yang, Yi-Hsuan
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (02) : 156 - 160