A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION

被引:0
|
作者
Bouvier, Damien [1 ]
Obin, Nicolas [1 ]
Liuni, Marco [1 ]
Roebel, Axel [1 ]
机构
[1] UPMC, IRCAM, CNRS, UMR STMS IRCAM, Paris, France
关键词
speech separation; non-negative matrix factorization; source/filter model; constraints; NONNEGATIVE MATRIX FACTORIZATION; PARTS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a constrained source/filter model for semi-supervised speech separation based on non-negative matrix factorization (NMF). The objective is to inform NMF with prior knowledge about speech, providing a physically meaningful speech separation. To do so, a source/filter model (indicated as Instantaneous Mixture Model or IMM) is integrated in the NMF. Furthermore, constraints are added to the IMM-NMF, in order to control the NMF behaviour during separation, and to enforce its physical meaning. In particular, a speech specific constraint-based on the source/filter coherence of speech - and a method for the automatic adaptation of constraints' weights during separation are presented. Also, the proposed source/filter model is semi-supervised: during training, one filter basis is estimated for each phoneme of a speaker; during separation, the estimated filter bases are then used in the constrained source/filter model. An experimental evaluation for speech separation was conducted on the TIMIT speakers database mixed with various environmental background noises from the QUT-NOISE database. This evaluation showed that the use of adaptive constraints increases the performance of the source/filter model for speaker-dependent speech separation, and compares favorably to fully-supervised speech separation.
引用
收藏
页码:131 / 135
页数:5
相关论文
共 50 条
  • [1] NMF-BASED INFORMED SOURCE SEPARATION
    Rohlfing, Christian
    Becker, Julian M.
    Wien, Mathias
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 474 - 478
  • [2] USING SCORE-INFORMED CONSTRAINTS FOR NMF-BASED SOURCE SEPARATION
    Ewert, Sebastian
    Mueller, Meinard
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 129 - 132
  • [3] Extended Semantic Initialization for NMF-based Audio Source Separation
    Rohlfing, Christian
    Becker, Julian M.
    2015 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2015, : 95 - 100
  • [4] NMF-based Target Source Separation Using Deep Neural Network
    Kang, Tae Gyoon
    Kwon, Kisoo
    Shin, Jong Won
    Kim, Nam Soo
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (02) : 229 - 233
  • [5] NMF-BASED SOURCE SEPARATION UTILIZING PRIOR KNOWLEDGE ON ENCODING VECTOR
    Kwon, Kisoo
    Shin, Jong Won
    Kim, Nam Soo
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 479 - 483
  • [6] Multichannel Audio Source Separation Exploiting NMF-Based Generic Source Spectral Model in Gaussian Modeling Framework
    Thanh Thi Hien Duong
    Duong, Ngoc Q. K.
    Cong-Phuong Nguyen
    Quoc-Cuong Nguyen
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 547 - 557
  • [7] Initialization for NMF-Based Audio Source Separation Using Priors on Encoding Vectors
    Jaeuk Byun
    Jong Won Shin
    中国通信, 2019, 16 (09) : 177 - 186
  • [8] Initialization for NMF-Based Audio Source Separation Using Priors on Encoding Vectors
    Byun, Jacuk
    Shin, Jong Won
    CHINA COMMUNICATIONS, 2019, 16 (09) : 177 - 186
  • [9] Prediction of NMF-based Wiener Filter for Speech Enhancement Using Deep Neural Networks
    Bai, Zhigang
    Bao, Changchun
    Cui, Zihao
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [10] OPTIMAL COST FUNCTION AND MAGNITUDE POWER FOR NMF-BASED SPEECH SEPARATION AND MUSIC INTERPOLATION
    King, Brian
    Fevotte, Cedric
    Smaragdis, Paris
    2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2012,