Audio data indexing : use of second-order statistics for speaker-based segmentation

被引：0

作者：

Delacourt, P ^{[1
]}

Wellekens, C ^{[1
]}

机构：

[1] Inst EURECOM, F-06904 Sophia Antipolis, France

来源：

IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2 | 1999年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The content-based indexing task considered in this paper consists in recognizing from their voice, speakers involved in a conversation. A new approach for speaker-based segmentation, which is the first necessary step for this indexing task, is described. Our study is done under the assumptions that no prior information on speakers is available, that the number of speakers is unknown and that people do not speak simultaneously. Audio data indexing is commonly divided in two parts : audio data is first segmented with respect to speakers utterances and then resulting segments associated with a given speaker are merged together In this work, we focus on the first part and we propose a new segmentation method based on second order statistics. The practical significance of this study is illustrated by applying our new technique to real data to show its efficiency.

引用

页码：959 / 963

页数：5

共 50 条

[1] DISTBIC: A speaker-based segmentation for audio data indexing
Delacourt, P
Wellekens, CJ
[J]. SPEECH COMMUNICATION, 2000, 32 (1-2) : 111 - 126
[2] SEGMENTATION BASED ON SECOND-ORDER STATISTICS
ROUNDS, EM
SUTTY, G
[J]. OPTICAL ENGINEERING, 1980, 19 (06) : 936 - 940
[3] A two-level method for unsupervised speaker-based audio segmentation
Zhang, Shilei
Zhang, Shuwu
Xu, Bo
[J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 298 - +
[4] BINSEG: An Efficient Speaker-based Segmentation Technique
Zdansky, Jindrich
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2182 - 2185
[5] Automatic segmentation and clustering for speaker indexing of audio databases
Chen, YX
Gao, J
Wang, Q
[J]. PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 399 - 403
[6] A Fast Speaker Indexing Using Vector Quantization and Second Order Statistics with Adaptive Threshold Computation
Biatov, Konstantin
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1453 - 1456
[7] Blind channel order estimation based on second-order statistics
Gerstacker, WH
Taylor, DP
[J]. IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (02) : 39 - 42
[8] Beyond second-order statistics
Coles, P
[J]. NEW ERA IN COSMOLOGY, 2002, 283 : 56 - 59
[9] A blind equalization algorithm based on second-order statistics
Jie, S
Hu, B
[J]. PROCEEDINGS OF THE IEEE 6TH CIRCUITS AND SYSTEMS SYMPOSIUM ON EMERGING TECHNOLOGIES: FRONTIERS OF MOBILE AND WIRELESS COMMUNICATION, VOLS 1 AND 2, 2004, : 373 - 376
[10] Digital Image Authentication Based on Second-Order Statistics
Shena, Bias Sekar Avi
Ciptasari, Rimba Whidiana
Sthevanie, Febryanti
[J]. 2016 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2016,

← 1 2 3 4 5 →