Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping

被引：0

作者：

Li, Ming ^{[1
]}

Cao, Chuan ^{[1
]}

Wang, Di ^{[1
]}

Lu, Ping ^{[1
]}

Fu, Qiang ^{[1
]}

Yan, Yonghong ^{[1
]}

机构：

[1] Chinese Acad Sci, ThinkIT Speech Lab, Inst Acoust, Beijing 100190, Peoples R China

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

关键词：

Auditory scene analysis; cochannel speech; multi-pitch estimation; sequential grouping;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is employed to determine the optimal sequential grouping alignments and group the speaker homogeneous segments into pure speaker streams. Systematic evaluation on the speech separation challenge database shows significant improvement over the baseline performance.

引用

页码：151 / 154

页数：4

共 50 条

[11] Joint DOA and multi-pitch estimation based on subspace techniques
Zhang, Johan Xi
Christensen, Mads Graesboll
Jensen, Soren Holdt
Moonen, Marc
[J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
[12] Multi-pitch estimation based on partial event and support transfer
Duan, Zhiyao
Zhang, Dan
Zhang, Changshui
Shi, Zhenwei
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 216 - 219
[13] Joint DOA and multi-pitch estimation based on subspace techniques
Johan Xi Zhang
Mads Græsbøll Christensen
Søren Holdt Jensen
Marc Moonen
[J]. EURASIP Journal on Advances in Signal Processing, 2012
[14] Using multi-scale product spectrum for single and multi-pitch estimation
Messaoud, M. A. B.
Bouzid, A.
Ellouze, N.
[J]. IET SIGNAL PROCESSING, 2011, 5 (03) : 344 - 355
[15] LOCALIZATION BASED SEQUENTIAL GROUPING FOR CONTINUOUS SPEECH SEPARATION
Wang, Zhong-Qiu
Wang, DeLiang
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 281 - 285
[16] Multi-pitch and periodicity analysis model for sound separation and auditory scene analysis
Karjalainen, M
Tolonen, T
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 929 - 932
[17] MULTI-PITCH ESTIMATION AND TRACKING USING BAYESIAN INFERENCE IN BLOCK SPARSITY
Karimian-Azari, Sam
Jakobsson, Andreas
Jensen, Jesper R.
Christensen, Mads G.
[J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 16 - 20
[18] EXPECTATION-MAXIMIZATION ALGORITHM FOR MULTI-PITCH ESTIMATION AND SEPARATION OF OVERLAPPING HARMONIC SPECTRA
Badeau, Roland
Emiya, Valentin
David, Bertrand
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3073 - 3076
[19] Co-channel speaker identification using usable speech extraction based on multi-pitch tracking
Shao, Y
Wang, DL
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 205 - 208
[20] An iterative model-based approach to cochannel speech separation
Hu, Ke
Wang, DeLiang
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,

← 1 2 3 4 5 →