Source-Filter-Based Single-Channel Speech Separation Using Pitch Information

被引:38
|
作者
Stark, Michael [1 ]
Wohlmayr, Michael [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
基金
奥地利科学基金会;
关键词
Single-channel speech separation (SCSS); multi-pitch estimation; source-filter representation; ALGORITHM; TRACKING;
D O I
10.1109/TASL.2010.2047419
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.
引用
下载
收藏
页码:242 / 255
页数:14
相关论文
共 50 条
  • [21] New Results on Single-Channel Speech Separation Using Sinusoidal Modeling
    Mowlaee, Pejman
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1265 - 1277
  • [22] Single-channel speech separation using sequential discriminative dictionary learning
    Ye, Zhongfu, 1600, Elsevier B.V., Netherlands (106):
  • [23] Single-channel speech separation using sequential discriminative dictionary learning
    Xu, Yangfei
    Bao, Guangzhao
    Xu, Xu
    Ye, Zhongfu
    SIGNAL PROCESSING, 2015, 106 : 134 - 140
  • [24] SINGLE-CHANNEL SPEECH SEPARATION AND RECOGNITION USING LOOPY BELIEF PROPAGATION
    Rennie, Steven J.
    Hershey, John R.
    Olsen, Peder A.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3845 - 3848
  • [25] SINGLE-CHANNEL SPEECH SEPARATION BASED ON ROBUST SPARSE BAYESIAN LEARNING
    Wang, Zhe
    Bi, Guoan
    Li, Xiumei
    2017 13TH IEEE INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2017, : 113 - 117
  • [26] Single-Channel Speech Separation Based on Deep Clustering with Local Optimization
    Fu, Taotao
    Yu, Ge
    Guo, Lili
    Wang, Yan
    Liang, Ji
    2017 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP), 2017, : 44 - 49
  • [27] Catalog-Based Single-Channel Speech-Music Separation
    Demir, Cemil
    Cemgil, A. Taylan
    Saraclar, Murat
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2786 - +
  • [28] Single-channel phaseless blind source separation
    Humera Hameed
    Ali Ahmed
    Ubaid U. Fayyaz
    Telecommunication Systems, 2022, 80 : 469 - 475
  • [29] Single-channel phaseless blind source separation
    Hameed, Humera
    Ahmed, Ali
    Fayyaz, Ubaid U.
    TELECOMMUNICATION SYSTEMS, 2022, 80 (03) : 469 - 475
  • [30] Speech separation from background of music based on single-channel recording
    Jin, Xue-Cheng
    Wang, Zeng-Fu
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 278 - +