Source-Filter-Based Single-Channel Speech Separation Using Pitch Information

被引:38
|
作者
Stark, Michael [1 ]
Wohlmayr, Michael [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
基金
奥地利科学基金会;
关键词
Single-channel speech separation (SCSS); multi-pitch estimation; source-filter representation; ALGORITHM; TRACKING;
D O I
10.1109/TASL.2010.2047419
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.
引用
收藏
页码:242 / 255
页数:14
相关论文
共 50 条
  • [1] SINGLE-CHANNEL SPEECH SEPARATION INTEGRATING PITCH INFORMATION BASED ON A MULTI TASK LEARNING FRAMEWORK
    Li, Xiang
    Liu, Rui
    Song, Tao
    Wu, Xihong
    Chen, Jing
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7279 - 7283
  • [2] A PITCH-AWARE APPROACH TO SINGLE-CHANNEL SPEECH SEPARATION
    Wang, Ke
    Soong, Frank
    Xie, Lei
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 296 - 300
  • [3] Single-channel speech separation using empirical mode decomposition and multi pitch information with estimation of number of speakers
    Prasanna Kumar M.K.
    Kumaraswamy R.
    [J]. International Journal of Speech Technology, 2017, 20 (1) : 109 - 125
  • [4] Single-channel speech separation using combined EMD and speech-specific information
    Prasanna Kumar M.K.
    Kumaraswamy R.
    [J]. International Journal of Speech Technology, 2017, 20 (4) : 1037 - 1047
  • [5] Single-Channel Speech Separation Using Phase-Based Methods
    Lee, Yun-Kyung
    Lee, In Sung
    Kwon, Oh-Wook
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2453 - 2459
  • [6] DESIGNING MULTICHANNEL SOURCE SEPARATION BASED ON SINGLE-CHANNEL SOURCE SEPARATION
    Lopez, A. Ramirez
    Ono, N.
    Remes, U.
    Palomaki, K.
    Kurimo, M.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 469 - 473
  • [7] A Pitch State Dependent Dictionary Design Method for Single-Channel Speech Separation
    Guo, Haiyan
    Yang, Zhen
    Zhang, Linghua
    Ye, Lei
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [8] Single-channel speech separation based on modulation frequency
    Gu, Lingyun
    Stern, Richard M.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 25 - 28
  • [9] Sparsity-based phase spectrum compensation for single-channel speech source separation
    Jeon, Kwang Myung
    Kim, Hong Kook
    [J]. DIGITAL SIGNAL PROCESSING, 2020, 97
  • [10] Using Cyclic Noise as the Source Signal for Neural Source-Filter-based Speech Waveform Model
    Wang, Xin
    Yamagishi, Junichi
    [J]. INTERSPEECH 2020, 2020, : 1992 - 1996