Source-Filter-Based Single-Channel Speech Separation Using Pitch Information

被引:38
|
作者
Stark, Michael [1 ]
Wohlmayr, Michael [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
基金
奥地利科学基金会;
关键词
Single-channel speech separation (SCSS); multi-pitch estimation; source-filter representation; ALGORITHM; TRACKING;
D O I
10.1109/TASL.2010.2047419
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.
引用
下载
收藏
页码:242 / 255
页数:14
相关论文
共 50 条
  • [41] Optimum Mixture Estimator for single-channel Speech Separation
    Mowlaee, Pejman
    Sayadiyan, Abolghassem
    Sheikhan, Mansour
    2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 543 - +
  • [42] Single-channel Blind Source Separation Based on Cyclic Spectrum Estimation
    He, Jiai
    Liu, Linzhi
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 1525 - 1529
  • [43] Decomposition of Radiated Disturbances Based on Single-channel Blind Source Separation
    Cao, Bin
    Lu, Jiajun
    Gu, Yixing
    Ren, Jinjing
    Jing, Shenhui
    PROCEEDINGS OF THE 2020 INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY (EMC EUROPE), 2020,
  • [44] A WATERMARKING-BASED METHOD FOR SINGLE-CHANNEL AUDIO SOURCE SEPARATION
    Parvaix, Mathieu
    Girin, Laurent
    Brossier, Jean-Marc
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 101 - +
  • [45] Informed Single-Channel Speech Separation Using HMM-GMM User-Generated Exemplar Source
    Wang, Qi
    Woo, W. L.
    Dlay, S. S.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 2087 - 2100
  • [46] Single-channel Music/Speech Separation Using Non-linear Masks
    Mowlaee, P.
    Sayadian, A.
    Sheikhan, M.
    Fallah, M.
    2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 782 - +
  • [47] NEURAL SOURCE-FILTER-BASED WAVEFORM MODEL FOR STATISTICAL PARAMETRIC SPEECH SYNTHESIS
    Wang, Xin
    Takaki, Shinji
    Yamagishi, Junichi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5916 - 5920
  • [48] A VQ-based Single-Channel Audio Separation for Music/Speech Mixtures
    Asgari, Meysam
    Fallah, Mahdi
    Mehrizi, Elahe Abouie
    Mostafavi, Ali
    UKSIM 2009: ELEVENTH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION, 2009, : 223 - +
  • [49] Single-channel Speech Separation Using Dictionary-updated Orthogonal Matching Pursuit and Temporal Structure Information
    Haiyan Guo
    Xiaoxiong Li
    Lin Zhou
    Zhenyang Wu
    Circuits, Systems, and Signal Processing, 2015, 34 : 3861 - 3882
  • [50] Deep clustering-based single-channel speech separation and recent advances
    Aihara, Ryo
    Wichern, Gordon
    Le Roux, Jonathan
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2020, 41 (02) : 465 - 471