Monaural speech segregation based on fusion of source-driven with model-driven techniques

被引：19

作者：

Radfar, Mohammad H.

Dansereau, Richard M.

Sayadiyan, Abolghasem

机构：

[1] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada

[2] Amirkabir Univ Technol, Dept Elect Engn, Tehran 15875 4413, Iran

来源：

SPEECH COMMUNICATION | 2007年 / 49卷 / 06期

基金：

加拿大自然科学与工程研究理事会;

关键词：

speech processing; monaural speech segregation; CASA; speech coding; harmonic modelling; vector quantization; envelope extraction; multi-pitch tracking; MIXMAX estimator;

D O I：

10.1016/j.specom.2007.04.007

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper by exploiting the prevalent methods in speech coding and synthesis, a new single channel speech segregation technique is presented. The technique integrates a model-driven method with a source-driven method to take advantage of both individual approaches and reduce their pitfalls significantly. We apply harmonic modelling in which the pitch and spectrum envelope are the main components for the analysis and synthesis stages. Pitch values of two speakers are obtained by using a source-driven method. The spectrum envelope, is obtained by using a new model-driven technique consisting of four components: a trained codebook of the vector quantized envelopes (VQ-based separation), a mixture-maximum approximation (MIXMAX), minimum mean square error estimator (MMSE), and a harmonic synthesizer. In contrast with previous model-driven techniques, this approach is speaker independent and can separate out the unvoiced regions as well as suppress the crosstalk effect which both are the drawbacks of source-driven or equivalently computational auditory scene analysis (CASA) models. We compare our fused model with both model- and source-driven techniques by conducting subjective and objective experiments. The results show that although for the speaker-dependent case, model-based separation delivers the best quality, for a speaker independent scenario the integrated model outperforms the individual approaches. This result supports the idea that the human auditory system takes on both grouping cues (e.g., pitch tracking) and a priori knowledge (e.g., trained quantized envelopes) to segregate speech signals. (C) 2007 Elsevier B.V. All rights reserved.

引用

页码：464 / 476

页数：13

共 50 条

[41] Model-driven architecture based security analysis
Mili, Saoussen
Nguyen, Nga
Chelouah, Rachid
SYSTEMS ENGINEERING, 2021, 24 (05) : 307 - 321
[42] A Model-Driven Visualization System Based on DVDL
Du, Yi
Ren, Lei
Zhou, Yuanchun
Li, Jianhui
CHALLENGES AND OPPORTUNITY WITH BIG DATA, 2017, 10228 : 11 - 24
[43] Model-Driven Engineering Based on Attribute Grammars
Calegari, Daniel
Viera, Marcos
PROGRAMMING LANGUAGES, SBLP 2015, 2015, 9325 : 112 - 127
[44] Model-Driven Prototyping Based Requirements Elicitation
Fu, Jicheng
Bastani, Farokh B.
Yen, I-Ling
INNOVATIONS FOR REQUIREMENTS ANALYSIS: FROM STAKEHOLDERS' NEEDS TO FORMAL DESIGNS, 2008, 5320 : 43 - 61
[45] Towards a Model-driven based Security Framework
Abdallah, Rouwaida
Yakymets, Nataliya
Lanusse, Agnes
MODELSWARD 2015 PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT, 2015, : 639 - 645
[46] Hyperspectral and multispectral image fusion: When model-driven meet data-driven strategies
Yan, Hao-Fang
Zhao, Yong-Qiang
Chan, Jonathan Cheung-Wai
Kong, Seong G.
EI-Bendary, Nashwa
Reda, Mohamed
INFORMATION FUSION, 2025, 116
[47] Variational acceleration of fission source iteration for subcritical source-driven systems
20211910321949
(1) Energy Institute, Istanbul Technical University, Istanbul, Turkey, 1600, (Japan Atomic Energy Agency, JAEA):
[48] DM-Fusion: Deep Model-Driven Network for Heterogeneous Image Fusion
Xu, Guoxia
He, Chunming
Wang, Hao
Zhu, Hu
Ding, Weiping
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 10071 - 10085
[49] Systematic review of matching techniques used in model-driven methodologies
Somogyi, Ferenc Attila
Asztalos, Mark
SOFTWARE AND SYSTEMS MODELING, 2020, 19 (03): : 693 - 720
[50] Systematic review of matching techniques used in model-driven methodologies
Ferenc Attila Somogyi
Mark Asztalos
Software and Systems Modeling, 2020, 19 : 693 - 720

← 1 2 3 4 5 →