A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION

被引：0

作者：

Bouvier, Damien ^{[1
]}

Obin, Nicolas ^{[1
]}

Liuni, Marco ^{[1
]}

Roebel, Axel ^{[1
]}

机构：

[1] UPMC, IRCAM, CNRS, UMR STMS IRCAM, Paris, France

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年

关键词：

speech separation; non-negative matrix factorization; source/filter model; constraints; NONNEGATIVE MATRIX FACTORIZATION; PARTS;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper introduces a constrained source/filter model for semi-supervised speech separation based on non-negative matrix factorization (NMF). The objective is to inform NMF with prior knowledge about speech, providing a physically meaningful speech separation. To do so, a source/filter model (indicated as Instantaneous Mixture Model or IMM) is integrated in the NMF. Furthermore, constraints are added to the IMM-NMF, in order to control the NMF behaviour during separation, and to enforce its physical meaning. In particular, a speech specific constraint-based on the source/filter coherence of speech - and a method for the automatic adaptation of constraints' weights during separation are presented. Also, the proposed source/filter model is semi-supervised: during training, one filter basis is estimated for each phoneme of a speaker; during separation, the estimated filter bases are then used in the constrained source/filter model. An experimental evaluation for speech separation was conducted on the TIMIT speakers database mixed with various environmental background noises from the QUT-NOISE database. This evaluation showed that the use of adaptive constraints increases the performance of the source/filter model for speaker-dependent speech separation, and compares favorably to fully-supervised speech separation.

引用

页码：131 / 135

页数：5

共 50 条

[21] Generalized Constraints for NMF with Application to Informed Source Separation
Rohlfing, Christian
Becker, Julian M.
2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 597 - 601
[22] Weibull and Nakagami speech priors based regularized NMF with adaptive wiener filter for speech enhancement
Jannu C.
Vanambathina S.D.
International Journal of Speech Technology, 2023, 26 (01) : 197 - 209
[23] REGULARIZED NMF-BASED SPEECH ENHANCEMENT WITH SPECTRAL COMPONENTS MODELED BY GAUSSIAN MIXTURES
Chung, Hanwook
Plourde, Eric
Champagne, Benoit
2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
[24] DOES INHARMONICITY IMPROVE AN NMF-BASED PIANO TRANSCRIPTION MODEL ?
Rigaud, Francois
Falaize, Antoine
David, Bertrand
Daudet, Laurent
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 11 - 15
[25] Compositional model for speech denoising based on source/filter speech representation and smoothness/sparseness noise constraints
Cabanas-Molero, P.
Martinez-Munoz, D.
Vera-Candeas, P.
Canadas-Quesada, F. J.
Ruiz-Reyes, N.
SPEECH COMMUNICATION, 2016, 78 : 84 - 99
[26] An Improved Bayesian NMF-Based Speech Enhancement Method Using Multivariate Laplace Distribution
Zhang, Liwei
Zhang, Xiongwei
Zou, Xia
Min, Gang
2014 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2014,
[27] Single Channel Blind Source Separation Based on NMF and Its Application to Speech Enhancement
Chen, Yongqiang
2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1066 - 1069
[28] Combined Multi-channel NMF-based Robust Beamforming for Noisy Speech Recognition
Mimura, Masato
Bando, Yoshiaki
Shimada, Kazuki
Sakai, Shinsuke
Yoshii, Kazuyoshi
Kawahara, Tatsuya
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2451 - 2455
[29] A multi-channel speech enhancement framework for robust NMF-based speech recognition for speech-impaired users
Dekkers, Gert
van Waterschoot, Toon
Vanrumste, Bart
Van Den Broeck, Bert
Gemmeke, Jort F.
Van Hamme, Hugo
Karsmakers, Peter
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 746 - 750
[30] COMPLEX NMF UNDER PHASE CONSTRAINTS BASED ON SIGNAL MODELING: APPLICATION TO AUDIO SOURCE SEPARATION
Magron, Paul
Badeau, Roland
David, Bertrand
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 46 - 50

← 1 2 3 4 5 →