Audio data indexing : use of second-order statistics for speaker-based segmentation

被引:0
|
作者
Delacourt, P [1 ]
Wellekens, C [1 ]
机构
[1] Inst EURECOM, F-06904 Sophia Antipolis, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The content-based indexing task considered in this paper consists in recognizing from their voice, speakers involved in a conversation. A new approach for speaker-based segmentation, which is the first necessary step for this indexing task, is described. Our study is done under the assumptions that no prior information on speakers is available, that the number of speakers is unknown and that people do not speak simultaneously. Audio data indexing is commonly divided in two parts : audio data is first segmented with respect to speakers utterances and then resulting segments associated with a given speaker are merged together In this work, we focus on the first part and we propose a new segmentation method based on second order statistics. The practical significance of this study is illustrated by applying our new technique to real data to show its efficiency.
引用
收藏
页码:959 / 963
页数:5
相关论文
共 50 条
  • [1] DISTBIC: A speaker-based segmentation for audio data indexing
    Delacourt, P
    Wellekens, CJ
    [J]. SPEECH COMMUNICATION, 2000, 32 (1-2) : 111 - 126
  • [2] SEGMENTATION BASED ON SECOND-ORDER STATISTICS
    ROUNDS, EM
    SUTTY, G
    [J]. OPTICAL ENGINEERING, 1980, 19 (06) : 936 - 940
  • [3] A two-level method for unsupervised speaker-based audio segmentation
    Zhang, Shilei
    Zhang, Shuwu
    Xu, Bo
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 298 - +
  • [4] BINSEG: An Efficient Speaker-based Segmentation Technique
    Zdansky, Jindrich
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2182 - 2185
  • [5] Automatic segmentation and clustering for speaker indexing of audio databases
    Chen, YX
    Gao, J
    Wang, Q
    [J]. PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 399 - 403
  • [6] A Fast Speaker Indexing Using Vector Quantization and Second Order Statistics with Adaptive Threshold Computation
    Biatov, Konstantin
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1453 - 1456
  • [7] Blind channel order estimation based on second-order statistics
    Gerstacker, WH
    Taylor, DP
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (02) : 39 - 42
  • [8] Beyond second-order statistics
    Coles, P
    [J]. NEW ERA IN COSMOLOGY, 2002, 283 : 56 - 59
  • [9] A blind equalization algorithm based on second-order statistics
    Jie, S
    Hu, B
    [J]. PROCEEDINGS OF THE IEEE 6TH CIRCUITS AND SYSTEMS SYMPOSIUM ON EMERGING TECHNOLOGIES: FRONTIERS OF MOBILE AND WIRELESS COMMUNICATION, VOLS 1 AND 2, 2004, : 373 - 376
  • [10] Digital Image Authentication Based on Second-Order Statistics
    Shena, Bias Sekar Avi
    Ciptasari, Rimba Whidiana
    Sthevanie, Febryanti
    [J]. 2016 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2016,