Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping

被引:0
|
作者
Li, Ming [1 ]
Cao, Chuan [1 ]
Wang, Di [1 ]
Lu, Ping [1 ]
Fu, Qiang [1 ]
Yan, Yonghong [1 ]
机构
[1] Chinese Acad Sci, ThinkIT Speech Lab, Inst Acoust, Beijing 100190, Peoples R China
关键词
Auditory scene analysis; cochannel speech; multi-pitch estimation; sequential grouping;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is employed to determine the optimal sequential grouping alignments and group the speaker homogeneous segments into pure speaker streams. Systematic evaluation on the speech separation challenge database shows significant improvement over the baseline performance.
引用
收藏
页码:151 / 154
页数:4
相关论文
共 50 条
  • [1] MULTI-PITCH ESTIMATION USING SEMIDEFINITE PROGRAMMING
    Jensen, Tobias Lindstrom
    Vandenberghe, Lieven
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4192 - 4196
  • [2] Multi-pitch estimation using harmonic music
    Christensen, Mads Graesboll
    Jakobsson, Andreas
    Jensen, Soren Holdt
    [J]. 2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 521 - +
  • [3] RNN-BLSTM Based Multi-Pitch Estimation
    Zhang, Jianshu
    Tang, Jian
    Dai, Li-Rang
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1785 - 1789
  • [4] SVM-BASED SEPARATION OF UNVOICED-VOICED SPEECH IN COCHANNEL CONDITIONS
    Hu, Ke
    Wang, DeLiang
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4545 - 4548
  • [5] Pitch estimation using models of voiced speech on three levels
    Joho, Dominik
    Bennewitz, Maren
    Behnke, Sven
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1077 - +
  • [6] MPTRACKER: A NEW MULTI-PITCH DETECTION AND SEPARATION ALGORITHM FOR MIXED SPEECH SIGNALS
    Radfar, M. H.
    Dansereau, R. M.
    Chan, W. -Y.
    Wong, W.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4468 - 4471
  • [7] MULTI-PITCH ESTIMATION OF AUDIO RECORDINGS USING A CODEBOOK-BASED APPROACH
    Hansen, Martin Weiss
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    [J]. 2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 983 - 987
  • [8] Model-based sequential organization in cochannel speech
    Shao, Y
    Wang, DL
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 289 - 298
  • [9] JOINT DOA AND MULTI-PITCH ESTIMATION USING BLOCK SPARSITY
    Kronvall, Ted
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] An iterative subspace-based multi-pitch estimation algorithm
    Zhang, Johan Xi
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    Moonen, Marc
    [J]. SIGNAL PROCESSING, 2011, 91 (01) : 150 - 154