A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION

被引:0
|
作者
Bouvier, Damien [1 ]
Obin, Nicolas [1 ]
Liuni, Marco [1 ]
Roebel, Axel [1 ]
机构
[1] UPMC, IRCAM, CNRS, UMR STMS IRCAM, Paris, France
关键词
speech separation; non-negative matrix factorization; source/filter model; constraints; NONNEGATIVE MATRIX FACTORIZATION; PARTS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a constrained source/filter model for semi-supervised speech separation based on non-negative matrix factorization (NMF). The objective is to inform NMF with prior knowledge about speech, providing a physically meaningful speech separation. To do so, a source/filter model (indicated as Instantaneous Mixture Model or IMM) is integrated in the NMF. Furthermore, constraints are added to the IMM-NMF, in order to control the NMF behaviour during separation, and to enforce its physical meaning. In particular, a speech specific constraint-based on the source/filter coherence of speech - and a method for the automatic adaptation of constraints' weights during separation are presented. Also, the proposed source/filter model is semi-supervised: during training, one filter basis is estimated for each phoneme of a speaker; during separation, the estimated filter bases are then used in the constrained source/filter model. An experimental evaluation for speech separation was conducted on the TIMIT speakers database mixed with various environmental background noises from the QUT-NOISE database. This evaluation showed that the use of adaptive constraints increases the performance of the source/filter model for speaker-dependent speech separation, and compares favorably to fully-supervised speech separation.
引用
收藏
页码:131 / 135
页数:5
相关论文
共 50 条
  • [31] A SPECTRAL GLOTTAL FLOW MODEL FOR SOURCE-FILTER SEPARATION OF SPEECH
    Perrotin, Olivier
    McLoughlin, Ian
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7160 - 7164
  • [32] Source separation based on binaural cues and source model constraints
    Weiss, Ron J.
    Mandel, Michael I.
    Ellis, Daniel P. W.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 419 - 422
  • [33] NMF-based Multiple Pitch Estimation Using Sparseness and Inter-frame Continuity Constraints
    Fujisawa, Takanori
    Degawa, Ikuo
    Ikehara, Masaaki
    2014 IEEE 16TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2014,
  • [34] Eye detection using eye filter and minimisation of NMF-based reconstruction error in facial image
    Park, C. W.
    Park, K. T.
    Moon, Y. S.
    ELECTRONICS LETTERS, 2010, 46 (02) : 130 - 131
  • [35] A Source-Filter based Adaptive Harmonic Model and Its Application to Speech Prosody Modification
    Lee, JeeSok
    Soong, Frank K.
    Kang, Hong-Goo
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 39 - 43
  • [36] TOWARDS SOURCE-FILTER BASED SINGLE SENSOR SPEECH SEPARATION
    Stark, Michael
    Pernkopf, Franz
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 97 - 100
  • [37] Homotopy optimisation based NMF for audio source separation
    Koundinya, Sriharsha
    Karmakar, Abhijit
    IET SIGNAL PROCESSING, 2018, 12 (09) : 1099 - 1106
  • [38] Efficient Source Separation Algorithm based on NMF Approach
    Sami, Cherif
    Hassen, Lazreg
    Kamel, Aloui
    Saber, Naceur Med
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 528 - 532
  • [39] ROTATIONAL RESET STRATEGY FOR ONLINE SEMI-SUPERVISED NMF-BASED SPEECH ENHANCEMENT FOR LONG RECORDINGS
    Zhou, Jun
    Chen, Shuo
    Duan, Zhiyao
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [40] Automatic liver tumour segmentation in CT combining FCN and NMF-based deformable model
    Zheng S.
    Fang B.
    Li L.
    Gao M.
    Wang Y.
    Peng K.
    Computer Methods in Biomechanics and Biomedical Engineering: Imaging and Visualization, 2020, 8 (05): : 468 - 477