Blind source separation with optimal transport non-negative matrix factorization

被引:0
|
作者
Antoine Rolet
Vivien Seguy
Mathieu Blondel
Hiroshi Sawada
机构
[1] Graduate School of Informatics,
[2] Kyoto University,undefined
[3] Yoshida Honmachi,undefined
[4] NTT Communication Science Laboratories,undefined
关键词
NMF; Speech; BSS; Optimal transport;
D O I
暂无
中图分类号
学科分类号
摘要
Optimal transport as a loss for machine learning optimization problems has recently gained a lot of attention. Building upon recent advances in computational optimal transport, we develop an optimal transport non-negative matrix factorization (NMF) algorithm for supervised speech blind source separation (BSS). Optimal transport allows us to design and leverage a cost between short-time Fourier transform (STFT) spectrogram frequencies, which takes into account how humans perceive sound. We give empirical evidence that using our proposed optimal transport, NMF leads to perceptually better results than NMF with other losses, for both isolated voice reconstruction and speech denoising using BSS. Finally, we demonstrate how to use optimal transport for cross-domain sound processing tasks, where frequencies represented in the input spectrograms may be different from one spectrogram to another.
引用
收藏
相关论文
共 50 条
  • [1] Blind source separation with optimal transport non-negative matrix factorization
    Rolet, Antoine
    Seguy, Vivien
    Blondel, Mathieu
    Sawada, Hiroshi
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2018,
  • [2] New algorithms for non-negative matrix factorization in applications to blind source separation
    Cichocki, Andrzej
    Zdunek, Rafal
    Amari, Shun-ichi
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 5479 - 5482
  • [3] SOURCE SEPARATION WITH SCATTERING NON-NEGATIVE MATRIX FACTORIZATION
    Bruna, Joan
    Sprechmann, Pablo
    LeCun, Yann
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1876 - 1880
  • [4] Blind Source Separation on Non-Contact Heartbeat Detection by Non-Negative Matrix Factorization Algorithms
    Ye, Chen
    Toyoda, Kentaroh
    Ohtsuki, Tomoaki
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2020, 67 (02) : 482 - 494
  • [5] BLIND AUDIO SOURCE SEPARATION OF STEREO MIXTURES USING BAYESIAN NON-NEGATIVE MATRIX FACTORIZATION
    Mirzaei, S.
    Van Hamme, H.
    Norouzi, Y.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 621 - 625
  • [6] Source Separation Based on Non-Negative Matrix Factorization of the Synchrosqueezing Transform
    Singh, Neha
    Meignen, Sylvain
    Oberlin, Thomas
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1910 - 1914
  • [7] Sparsity Promoted Non-Negative Matrix Factorization for Source Separation and Detection
    Wang, Yanlin
    Li, Yun
    Ho, K. C.
    Zare, A.
    Skubic, M.
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 640 - 645
  • [8] Non-negative Matrix Factorization-Based Blind Source Separation for Non-contact Heartbeat Detection
    Ye, Chen
    Toyoda, Kentaroh
    Ohtsuki, Tomoaki
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [9] Perceptually Weighted Non-negative Matrix Factorization for Blind Single-Channel Music Source Separation
    Kirbiz, S.
    Gunsel, B.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 226 - 229
  • [10] Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization
    Kirbiz, S.
    Gunsel, B.
    DIGITAL SIGNAL PROCESSING, 2013, 23 (02) : 646 - 658