END-TO-END NON-NEGATIVE AUTOENCODERS FOR SOUND SOURCE SEPARATION

被引:0
|
作者
Venkataramani, Shrikant [1 ]
Tzinis, Efthymios [2 ]
Sinaragdis, Paris [2 ,3 ]
机构
[1] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
[2] Univ Illinois, Dept Comp Sci, Urbana, IL USA
[3] Adobe Res, San Jose, CA USA
基金
美国国家科学基金会;
关键词
Non-negative autoencoder; non-negative matrix factorization; source separation; single-channel audio separation; end-to-end; deep learning; MATRIX FACTORIZATION;
D O I
10.1109/icassp40776.2020.9053588
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Discriminative models for source separation have recently been shown to produce impressive results. However, when operating on sources outside of the training set, these models can not perform as well and are cumbersome to update. Classical methods like Non-negative Matrix Factorization (NMF) provide modular approaches to source separation that can be easily updated to adapt to new mixture scenarios. In this paper, we generalize NMF to develop end-to-end non-negative auto-encoders and demonstrate how they can be used for source separation. Our experiments indicate that these models deliver comparable separation performance to discriminative approaches, while retaining the modularity of NMF and the modeling flexibility of neural networks.
引用
收藏
页码:116 / 120
页数:5
相关论文
共 50 条
  • [1] Monophonic Sound Source Separation by Non-negative Sparse Autoencoders
    Zen, Keiki
    Suzuki, Masahiro
    Sato, Haruhiko
    Oyama, Satoshi
    Kurihara, Masahiro
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3623 - 3626
  • [2] END-TO-END TRAINABLE WEAKLY NON-NEGATIVE FACTORIZATION
    Kobayashi, Takumi
    Watanabe, Kenji
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 490 - 494
  • [3] END-TO-END SOUND SOURCE SEPARATION CONDITIONED ON INSTRUMENT LABELS
    Slizovskaia, Olga
    Kim, Leo
    Haro, Gloria
    Gomez, Emilia
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 306 - 310
  • [4] Shifted non-negative matrix factorisation for sound source separation
    FitzGerald, Derry
    Cranitch, Matt
    Coyle, Eugene
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 1061 - 1065
  • [5] LaSNet: An end-to-end network based on steering vector filter for sound source localization and separation
    Yang, Xiaokang
    Zhang, Hongcheng
    Lu, Yufei
    A, Ying
    Ren, Guangyi
    Wei, Jianguo
    Wang, Xianliang
    Li, Wei
    APPLIED ACOUSTICS, 2023, 212
  • [6] DBNET: DOA-DRIVEN BEAMFORMING NETWORK FOR END-TO-END REVERBERANT SOUND SOURCE SEPARATION
    Aroudi, Ali
    Braun, Sebastian
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 211 - 215
  • [7] Sound source separation using shifted Non-negative tensor factorisation
    FitzGerald, Derry
    Cranitch, Matt
    Coyle, Eugene
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5511 - 5514
  • [8] End-to-end music source separation: is it possible in the waveform domain?
    Lluis, Francesc
    Pons, Jordi
    Serra, Xavier
    INTERSPEECH 2019, 2019, : 4619 - 4623
  • [9] END-TO-END SOURCE SEPARATION WITH ADAPTIVE FRONT-ENDS
    Venkataramani, Shrikant
    Casebeer, Jonah
    Smaragdis, Paris
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 684 - 688
  • [10] CUSTOM SIZED NON-NEGATIVE MATRIX FACTOR DECONVOLUTION FOR SOUND SOURCE SEPARATION
    Becker, Julian M.
    Rohlfing, Christian
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,