A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION

被引:0
|
作者
Bouvier, Damien [1 ]
Obin, Nicolas [1 ]
Liuni, Marco [1 ]
Roebel, Axel [1 ]
机构
[1] UPMC, IRCAM, CNRS, UMR STMS IRCAM, Paris, France
关键词
speech separation; non-negative matrix factorization; source/filter model; constraints; NONNEGATIVE MATRIX FACTORIZATION; PARTS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a constrained source/filter model for semi-supervised speech separation based on non-negative matrix factorization (NMF). The objective is to inform NMF with prior knowledge about speech, providing a physically meaningful speech separation. To do so, a source/filter model (indicated as Instantaneous Mixture Model or IMM) is integrated in the NMF. Furthermore, constraints are added to the IMM-NMF, in order to control the NMF behaviour during separation, and to enforce its physical meaning. In particular, a speech specific constraint-based on the source/filter coherence of speech - and a method for the automatic adaptation of constraints' weights during separation are presented. Also, the proposed source/filter model is semi-supervised: during training, one filter basis is estimated for each phoneme of a speaker; during separation, the estimated filter bases are then used in the constrained source/filter model. An experimental evaluation for speech separation was conducted on the TIMIT speakers database mixed with various environmental background noises from the QUT-NOISE database. This evaluation showed that the use of adaptive constraints increases the performance of the source/filter model for speaker-dependent speech separation, and compares favorably to fully-supervised speech separation.
引用
收藏
页码:131 / 135
页数:5
相关论文
共 50 条
  • [21] Generalized Constraints for NMF with Application to Informed Source Separation
    Rohlfing, Christian
    Becker, Julian M.
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 597 - 601
  • [22] Weibull and Nakagami speech priors based regularized NMF with adaptive wiener filter for speech enhancement
    Jannu C.
    Vanambathina S.D.
    International Journal of Speech Technology, 2023, 26 (01) : 197 - 209
  • [23] REGULARIZED NMF-BASED SPEECH ENHANCEMENT WITH SPECTRAL COMPONENTS MODELED BY GAUSSIAN MIXTURES
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [24] DOES INHARMONICITY IMPROVE AN NMF-BASED PIANO TRANSCRIPTION MODEL ?
    Rigaud, Francois
    Falaize, Antoine
    David, Bertrand
    Daudet, Laurent
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 11 - 15
  • [25] Compositional model for speech denoising based on source/filter speech representation and smoothness/sparseness noise constraints
    Cabanas-Molero, P.
    Martinez-Munoz, D.
    Vera-Candeas, P.
    Canadas-Quesada, F. J.
    Ruiz-Reyes, N.
    SPEECH COMMUNICATION, 2016, 78 : 84 - 99
  • [26] An Improved Bayesian NMF-Based Speech Enhancement Method Using Multivariate Laplace Distribution
    Zhang, Liwei
    Zhang, Xiongwei
    Zou, Xia
    Min, Gang
    2014 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2014,
  • [27] Single Channel Blind Source Separation Based on NMF and Its Application to Speech Enhancement
    Chen, Yongqiang
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1066 - 1069
  • [28] Combined Multi-channel NMF-based Robust Beamforming for Noisy Speech Recognition
    Mimura, Masato
    Bando, Yoshiaki
    Shimada, Kazuki
    Sakai, Shinsuke
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2451 - 2455
  • [29] A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users
    Dekkers, Gert
    van Waterschoot, Toon
    Vanrumste, Bart
    Van Den Broeck, Bert
    Gemmeke, Jort F.
    Van Hamme, Hugo
    Karsmakers, Peter
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 746 - 750
  • [30] COMPLEX NMF UNDER PHASE CONSTRAINTS BASED ON SIGNAL MODELING: APPLICATION TO AUDIO SOURCE SEPARATION
    Magron, Paul
    Badeau, Roland
    David, Bertrand
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 46 - 50