Source-Filter-Based Single-Channel Speech Separation Using Pitch Information

被引：38

作者：

Stark, Michael ^{[1
]}

Wohlmayr, Michael ^{[1
]}

Pernkopf, Franz ^{[1
]}

机构：

[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 02期

基金：

奥地利科学基金会;

关键词：

Single-channel speech separation (SCSS); multi-pitch estimation; source-filter representation; ALGORITHM; TRACKING;

D O I：

10.1109/TASL.2010.2047419

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.

引用

下载

页码：242 / 255

页数：14

共 50 条

[41] Optimum Mixture Estimator for single-channel Speech Separation
Mowlaee, Pejman
Sayadiyan, Abolghassem
Sheikhan, Mansour
2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 543 - +
[42] Single-channel Blind Source Separation Based on Cyclic Spectrum Estimation
He, Jiai
Liu, Linzhi
2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 1525 - 1529
[43] Decomposition of Radiated Disturbances Based on Single-channel Blind Source Separation
Cao, Bin
Lu, Jiajun
Gu, Yixing
Ren, Jinjing
Jing, Shenhui
PROCEEDINGS OF THE 2020 INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY (EMC EUROPE), 2020,
[44] A WATERMARKING-BASED METHOD FOR SINGLE-CHANNEL AUDIO SOURCE SEPARATION
Parvaix, Mathieu
Girin, Laurent
Brossier, Jean-Marc
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 101 - +
[45] Informed Single-Channel Speech Separation Using HMM-GMM User-Generated Exemplar Source
Wang, Qi
Woo, W. L.
Dlay, S. S.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 2087 - 2100
[46] Single-channel Music/Speech Separation Using Non-linear Masks
Mowlaee, P.
Sayadian, A.
Sheikhan, M.
Fallah, M.
2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 782 - +
[47] NEURAL SOURCE-FILTER-BASED WAVEFORM MODEL FOR STATISTICAL PARAMETRIC SPEECH SYNTHESIS
Wang, Xin
Takaki, Shinji
Yamagishi, Junichi
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5916 - 5920
[48] A VQ-based Single-Channel Audio Separation for Music/Speech Mixtures
Asgari, Meysam
Fallah, Mahdi
Mehrizi, Elahe Abouie
Mostafavi, Ali
UKSIM 2009: ELEVENTH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION, 2009, : 223 - +
[49] Single-channel Speech Separation Using Dictionary-updated Orthogonal Matching Pursuit and Temporal Structure Information
Haiyan Guo
Xiaoxiong Li
Lin Zhou
Zhenyang Wu
Circuits, Systems, and Signal Processing, 2015, 34 : 3861 - 3882
[50] Deep clustering-based single-channel speech separation and recent advances
Aihara, Ryo
Wichern, Gordon
Le Roux, Jonathan
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2020, 41 (02) : 465 - 471

← 1 2 3 4 5 →