ITAKURA-SAITO NONNEGATIVE MATRIX FACTORIZATION WITH GROUP SPARSITY

被引:0
|
作者
Lefevre, Augustin
Bach, Francis
Fevotte, Cedric
机构
关键词
Blind source separation; audio signal processing; unsupervised learning; nonnegative matrix factorization; sparsity priors;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an unsupervised inference procedure for audio source separation. Components in nonnegative matrix factorization (NMF) are grouped automatically in audio sources via a penalized maximum likelihood approach. The penalty term we introduce favors sparsity at the group level, and is motivated by the assumption that the local amplitude of the sources are independent. Our algorithm extends multiplicative updates for NMF; moreover we propose a test statistic to tune hyperparameters in our model, and illustrate its adequacy on synthetic data. Results on real audio tracks show that our sparsity prior allows to identify audio sources without knowledge on their spectral properties.
引用
收藏
页码:21 / 24
页数:4
相关论文
共 50 条
  • [1] ONLINE ALGORITHMS FOR NONNEGATIVE MATRIX FACTORIZATION WITH THE ITAKURA-SAITO DIVERGENCE
    Lefevre, Augustin
    Bach, Francis
    Fevotte, Cedric
    [J]. 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 313 - 316
  • [2] EFFICIENT ALGORITHMS FOR MULTICHANNEL EXTENSIONS OF ITAKURA-SAITO NONNEGATIVE MATRIX FACTORIZATION
    Sawada, Hiroshi
    Kameoka, Hirokazu
    Araki, Shoko
    Ueda, Naonori
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 261 - 264
  • [3] Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis
    Fevotte, Cedric
    Bertin, Nancy
    Durrieu, Jean-Louis
    [J]. NEURAL COMPUTATION, 2009, 21 (03) : 793 - 830
  • [4] Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization
    Magron, Paul
    Virtanen, Tuomas
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 856 - 860
  • [5] MAJORIZATION-MINIMIZATION ALGORITHM FOR SMOOTH ITAKURA-SAITO NONNEGATIVE MATRIX FACTORIZATION
    Fevotte, Cedric
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1980 - 1983
  • [6] Speech Enhancement Based on Discrete Wavelet Packet Transform and Itakura-Saito Nonnegative Matrix Factorisation
    Liu, Houguang
    Wang, Wenbo
    Xue, Lin
    Yang, Jianhua
    Wang, Zhihua
    Hua, Chunli
    [J]. ARCHIVES OF ACOUSTICS, 2020, 45 (04) : 565 - 572
  • [7] NON-NEGATIVE MATRIX FACTORIZATION WITH MIXTURE OF ITAKURA-SAITO DIVERGENCE FOR SAR IMAGES
    Liu, Chi
    Liao, Wenzhi
    Li, Heng-Chao
    Philips, Wilfried
    [J]. 2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 779 - 782
  • [8] MASK ESTIMATE THROUGH ITAKURA-SAITO NONNEGATIVE RPCA FOR SPEECH ENHANCEMENT
    Min, Gang
    Zhang, Xiongwei
    Zou, Xia
    Sun, Meng
    [J]. 2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [9] A TEMPERING APPROACH FOR ITAKURA-SAITO NON-NEGATIVE MATRIX FACTORIZATION. WITH APPLICATION TO MUSIC TRANSCRIPTION
    Bertin, Nancy
    Fevotte, Cedric
    Badeau, Roland
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1545 - 1548
  • [10] Robust Hypothesis Testing with the Itakura-Saito Divergence
    Zhou, Feng
    Song, Enbin
    Zhu, Yunmin
    [J]. 2017 20TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2017, : 1561 - 1566