Monaural speech segregation based on fusion of source-driven with model-driven techniques

被引:19
|
作者
Radfar, Mohammad H.
Dansereau, Richard M.
Sayadiyan, Abolghasem
机构
[1] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada
[2] Amirkabir Univ Technol, Dept Elect Engn, Tehran 15875 4413, Iran
基金
加拿大自然科学与工程研究理事会;
关键词
speech processing; monaural speech segregation; CASA; speech coding; harmonic modelling; vector quantization; envelope extraction; multi-pitch tracking; MIXMAX estimator;
D O I
10.1016/j.specom.2007.04.007
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper by exploiting the prevalent methods in speech coding and synthesis, a new single channel speech segregation technique is presented. The technique integrates a model-driven method with a source-driven method to take advantage of both individual approaches and reduce their pitfalls significantly. We apply harmonic modelling in which the pitch and spectrum envelope are the main components for the analysis and synthesis stages. Pitch values of two speakers are obtained by using a source-driven method. The spectrum envelope, is obtained by using a new model-driven technique consisting of four components: a trained codebook of the vector quantized envelopes (VQ-based separation), a mixture-maximum approximation (MIXMAX), minimum mean square error estimator (MMSE), and a harmonic synthesizer. In contrast with previous model-driven techniques, this approach is speaker independent and can separate out the unvoiced regions as well as suppress the crosstalk effect which both are the drawbacks of source-driven or equivalently computational auditory scene analysis (CASA) models. We compare our fused model with both model- and source-driven techniques by conducting subjective and objective experiments. The results show that although for the speaker-dependent case, model-based separation delivers the best quality, for a speaker independent scenario the integrated model outperforms the individual approaches. This result supports the idea that the human auditory system takes on both grouping cues (e.g., pitch tracking) and a priori knowledge (e.g., trained quantized envelopes) to segregate speech signals. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:464 / 476
页数:13
相关论文
共 50 条
  • [31] Model-Driven Architecture
    Venegas Loor, Leopoldo Vinicio
    REVISTA SAN GREGORIO, 2014, (08): : 64 - 72
  • [32] Model-driven engineering
    Schmidt, DC
    COMPUTER, 2006, 39 (02) : 25 - 31
  • [33] Model-driven development
    Mellor, SJ
    Clark, AN
    Futagami, T
    IEEE SOFTWARE, 2003, 20 (05) : 14 - 18
  • [34] Going model-driven
    Coulter, D
    CONTROL AND INSTRUMENTATION, 1997, 29 (09): : 27 - 28
  • [35] RF ion source-driven IEC design and operation
    Miley, GH
    Yang, Y
    Webber, J
    Shaban, Y
    Momota, H
    FUSION SCIENCE AND TECHNOLOGY, 2005, 47 (04) : 1233 - 1237
  • [36] SPECIAL ISSUE ON MODEL-DRIVEN SERVICE ENGINEERING: BENEFITS OF APPLYING MODEL-DRIVEN TECHNIQUES TO SERVICE ENGINEERING GUEST EDITORS' INTRODUCTION
    De Castro, Valeria
    Manuel Vara, Juan
    Van Den Heuvel, Willem-Jan
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2011, 20 (02) : 137 - 142
  • [37] Putting performance engineering into model-driven engineering: Model-driven performance engineering
    Fritzsche, Mathias
    Johannes, Jendrik
    MODELS IN SOFTWARE ENGINEERING, 2008, 5002 : 164 - +
  • [38] Comparison of model-driven architecture and software factories in the context of Model-Driven Development
    Demir, Ahmet
    Joint Meeting of the Fourth Workshop on Model-Based Development of Computer-Based Systems and Third International Workshop on Model-Based Methodologies for Pervasive and Embedded Software, Proceedings, 2006, : 75 - 83
  • [39] Theory of neutron fluctuations in source-driven subcritical systems
    Pazsit, I
    Yamane, Y
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1998, 403 (2-3): : 431 - 441
  • [40] Source-driven and Resonance-driven Harmonic Interaction between PV Inverters and the Grid
    Rogalla, Soenke
    Ackermann, Florian
    Bihler, Nicolas
    Moghadam, Hasanali
    Stalter, Olivier
    2016 IEEE 43RD PHOTOVOLTAIC SPECIALISTS CONFERENCE (PVSC), 2016, : 1399 - 1404