Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping

被引:0
|
作者
Li, Ming [1 ]
Cao, Chuan [1 ]
Wang, Di [1 ]
Lu, Ping [1 ]
Fu, Qiang [1 ]
Yan, Yonghong [1 ]
机构
[1] Chinese Acad Sci, ThinkIT Speech Lab, Inst Acoust, Beijing 100190, Peoples R China
关键词
Auditory scene analysis; cochannel speech; multi-pitch estimation; sequential grouping;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is employed to determine the optimal sequential grouping alignments and group the speaker homogeneous segments into pure speaker streams. Systematic evaluation on the speech separation challenge database shows significant improvement over the baseline performance.
引用
收藏
页码:151 / 154
页数:4
相关论文
共 50 条
  • [11] Joint DOA and multi-pitch estimation based on subspace techniques
    Zhang, Johan Xi
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    Moonen, Marc
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [12] Multi-pitch estimation based on partial event and support transfer
    Duan, Zhiyao
    Zhang, Dan
    Zhang, Changshui
    Shi, Zhenwei
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 216 - 219
  • [13] Joint DOA and multi-pitch estimation based on subspace techniques
    Johan Xi Zhang
    Mads Græsbøll Christensen
    Søren Holdt Jensen
    Marc Moonen
    [J]. EURASIP Journal on Advances in Signal Processing, 2012
  • [14] Using multi-scale product spectrum for single and multi-pitch estimation
    Messaoud, M. A. B.
    Bouzid, A.
    Ellouze, N.
    [J]. IET SIGNAL PROCESSING, 2011, 5 (03) : 344 - 355
  • [15] LOCALIZATION BASED SEQUENTIAL GROUPING FOR CONTINUOUS SPEECH SEPARATION
    Wang, Zhong-Qiu
    Wang, DeLiang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 281 - 285
  • [16] Multi-pitch and periodicity analysis model for sound separation and auditory scene analysis
    Karjalainen, M
    Tolonen, T
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 929 - 932
  • [17] MULTI-PITCH ESTIMATION AND TRACKING USING BAYESIAN INFERENCE IN BLOCK SPARSITY
    Karimian-Azari, Sam
    Jakobsson, Andreas
    Jensen, Jesper R.
    Christensen, Mads G.
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 16 - 20
  • [18] EXPECTATION-MAXIMIZATION ALGORITHM FOR MULTI-PITCH ESTIMATION AND SEPARATION OF OVERLAPPING HARMONIC SPECTRA
    Badeau, Roland
    Emiya, Valentin
    David, Bertrand
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3073 - 3076
  • [19] Co-channel speaker identification using usable speech extraction based on multi-pitch tracking
    Shao, Y
    Wang, DL
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 205 - 208
  • [20] An iterative model-based approach to cochannel speech separation
    Hu, Ke
    Wang, DeLiang
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,