Source-Filter-Based Single-Channel Speech Separation Using Pitch Information

被引：38

作者：

Stark, Michael ^{[1
]}

Wohlmayr, Michael ^{[1
]}

Pernkopf, Franz ^{[1
]}

机构：

[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 02期

基金：

奥地利科学基金会;

关键词：

Single-channel speech separation (SCSS); multi-pitch estimation; source-filter representation; ALGORITHM; TRACKING;

D O I：

10.1109/TASL.2010.2047419

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.

引用

下载

页码：242 / 255

页数：14

共 50 条

[21] New Results on Single-Channel Speech Separation Using Sinusoidal Modeling
Mowlaee, Pejman
Christensen, Mads Graesboll
Jensen, Soren Holdt
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1265 - 1277
[22] Single-channel speech separation using sequential discriminative dictionary learning
Ye, Zhongfu, 1600, Elsevier B.V., Netherlands (106):
[23] Single-channel speech separation using sequential discriminative dictionary learning
Xu, Yangfei
Bao, Guangzhao
Xu, Xu
Ye, Zhongfu
SIGNAL PROCESSING, 2015, 106 : 134 - 140
[24] SINGLE-CHANNEL SPEECH SEPARATION AND RECOGNITION USING LOOPY BELIEF PROPAGATION
Rennie, Steven J.
Hershey, John R.
Olsen, Peder A.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3845 - 3848
[25] SINGLE-CHANNEL SPEECH SEPARATION BASED ON ROBUST SPARSE BAYESIAN LEARNING
Wang, Zhe
Bi, Guoan
Li, Xiumei
2017 13TH IEEE INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION (ICCA), 2017, : 113 - 117
[26] Single-Channel Speech Separation Based on Deep Clustering with Local Optimization
Fu, Taotao
Yu, Ge
Guo, Lili
Wang, Yan
Liang, Ji
2017 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP), 2017, : 44 - 49
[27] Catalog-Based Single-Channel Speech-Music Separation
Demir, Cemil
Cemgil, A. Taylan
Saraclar, Murat
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2786 - +
[28] Single-channel phaseless blind source separation
Humera Hameed
Ali Ahmed
Ubaid U. Fayyaz
Telecommunication Systems, 2022, 80 : 469 - 475
[29] Single-channel phaseless blind source separation
Hameed, Humera
Ahmed, Ali
Fayyaz, Ubaid U.
TELECOMMUNICATION SYSTEMS, 2022, 80 (03) : 469 - 475
[30] Speech separation from background of music based on single-channel recording
Jin, Xue-Cheng
Wang, Zeng-Fu
18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 278 - +

← 1 2 3 4 5 →