Source-Filter-Based Single-Channel Speech Separation Using Pitch Information

被引：38

作者：

Stark, Michael ^{[1
]}

Wohlmayr, Michael ^{[1
]}

Pernkopf, Franz ^{[1
]}

机构：

[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 02期

基金：

奥地利科学基金会;

关键词：

Single-channel speech separation (SCSS); multi-pitch estimation; source-filter representation; ALGORITHM; TRACKING;

D O I：

10.1109/TASL.2010.2047419

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.

引用

下载

页码：242 / 255

页数：14

共 50 条

[1] SINGLE-CHANNEL SPEECH SEPARATION INTEGRATING PITCH INFORMATION BASED ON A MULTI TASK LEARNING FRAMEWORK
Li, Xiang
Liu, Rui
Song, Tao
Wu, Xihong
Chen, Jing
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7279 - 7283
[2] Single-channel speech separation based on pitch state and interframe correlation
Guo, Haiyan
Li, Xiaoxiong
Li, Nijun
Zhou, Lin
Wu, Zhenyang
Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2014, 44 (06): : 1099 - 1104
[3] A PITCH-AWARE APPROACH TO SINGLE-CHANNEL SPEECH SEPARATION
Wang, Ke
Soong, Frank
Xie, Lei
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 296 - 300
[4] Single-channel speech separation using empirical mode decomposition and multi pitch information with estimation of number of speakers
Prasanna Kumar M.K.
Kumaraswamy R.
International Journal of Speech Technology, 2017, 20 (01) : 109 - 125
[5] Single-channel speech separation using combined EMD and speech-specific information
Prasanna Kumar M.K.
Kumaraswamy R.
International Journal of Speech Technology, 2017, 20 (4) : 1037 - 1047
[6] Single-Channel Speech Separation Using Phase-Based Methods
Lee, Yun-Kyung
Lee, In Sung
Kwon, Oh-Wook
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2453 - 2459
[7] DESIGNING MULTICHANNEL SOURCE SEPARATION BASED ON SINGLE-CHANNEL SOURCE SEPARATION
Lopez, A. Ramirez
Ono, N.
Remes, U.
Palomaki, K.
Kurimo, M.
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 469 - 473
[8] A Pitch State Dependent Dictionary Design Method for Single-Channel Speech Separation
Guo, Haiyan
Yang, Zhen
Zhang, Linghua
Ye, Lei
2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
[9] Single-channel speech separation based on modulation frequency
Gu, Lingyun
Stern, Richard M.
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 25 - 28
[10] Sparsity-based phase spectrum compensation for single-channel speech source separation
Jeon, Kwang Myung
Kim, Hong Kook
DIGITAL SIGNAL PROCESSING, 2020, 97

← 1 2 3 4 5 →