Temporal annotation-based audio source separation using weighted nonnegative matrix factorization

被引:0
|
作者
Duong, Ngoc Q. K. [1 ]
Ozerov, Alexey [1 ]
Chevallier, Louis [1 ]
机构
[1] Technicolor, 975 Ave Champs Blanes,CS 17616, F-35576 Cesson Sevigne, France
关键词
User-guided audio source separation; temporal annotation; nonnegative matrix factorization (NMF); weighted NMF;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We consider an emerging user-guided audio source separation approach based on the temporal annotation of the source activity along the mixture. In this baseline algorithm nonnegative matrix factorization (NMF) is usually used as spectral model for audio sources. In this paper we propose two weighting strategies incorporated in the NMF formulation so as to better exploit the annotation. We then derive the corresponding multiplicative update (MU) rules for the parameter estimation. The proposed approach was objectively evaluated within the fourth community- based Signal Separation Evaluation Campaign (SiSEC 2013) and shown to outperform the baseline algorithm, while obtaining comparable result to some other state-of-the-art methods.
引用
收藏
页码:220 / 224
页数:5
相关论文
共 50 条
  • [1] Audio Source Separation Based on Nonnegative Matrix Factorization with Graph Harmonic Structure
    Ichita, Tomohiro
    Kyochi, Seisuke
    Imoto, Keisuke
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1148 - 1152
  • [2] BAYESIAN MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR AUDIO SOURCE SEPARATION AND LOCALIZATION
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 551 - 555
  • [3] Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
    Ozerov, Alexey
    Fevotte, Cedric
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 550 - 563
  • [4] Ray-Space-Based Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Pezzoli, Mirco
    Carabias-Orti, Julio Jose
    Cobos, Maximo
    Antonacci, Fabio
    Sarti, Augusto
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 369 - 373
  • [5] Supervised Audio Source Separation Based on Nonnegative Matrix Factorization with Cosine Similarity Penalty
    Iwase, Yuta
    Kitamura, Daichi
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (06) : 906 - 913
  • [6] REVERBERANT AUDIO SOURCE SEPARATION USING PARTIALLY PRE-TRAINED NONNEGATIVE MATRIX FACTORIZATION
    Fakhry, Mahmoud
    Svaizer, Piergiorgio
    Omologo, Maurizio
    [J]. 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 273 - 277
  • [7] SCORE INFORMED AUDIO SOURCE SEPARATION USING CONSTRAINED NONNEGATIVE MATRIX FACTORIZATION AND SCORE SYNTHESIS
    Fritsch, Joachim
    Plumbley, Mark D.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 888 - 891
  • [8] Beamspace-Domain Multichannel Nonnegative Matrix Factorization for Audio Source Separation
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (01) : 43 - 46
  • [9] Audio Source Separation in Reverberant Environments Using β-Divergence-Based Nonnegative Factorization
    Fakhry, Mahmoud
    Svaizer, Piergiorgio
    Omologo, Maurizio
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1462 - 1476
  • [10] NONNEGATIVE TENSOR FACTORIZATION FOR SOURCE SEPARATION OF LOOPS IN AUDIO
    Smith, Jordan B. L.
    Goto, Masataka
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 171 - 175