Using confidence measures to evaluate the speaker turns in speaker segmentation

被引：0

作者：

Chu, Wei ^{[1
]}

Liu, Jia ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

来源：

2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In this paper, we propose a speaker segmentation algorithm using confidence measures, named CM-DISTBIC, which inserts a confidence score computation and fusion procedure into the two-step DISTBIC and MDISTBIC. In the first step, symmetric Kullback-Leibler distance (KL2) distance is replaced by Bayesian Information Criterion (BIC) distance to obtain a lower misdetection rate. In the second step, three different confidence measures are attached to the speaker change candidates according to the distance curve derived from the first step. False alarm peaks with relatively low fused confidence scores are eliminate from the set of potential speak turns. In. the third step, speaker turn candidates are validated through BIC criterion. Compared with DISTBIC and MDISTBIC, the CM-DISTBIC conducted on the broadcast news corpora receives an increase of more than 11.5% and 8.9% in F-score respectively.

引用

下载

页码：728 / 731

页数：4

共 50 条

[31] UBM based speaker segmentation and clustering for 2-speaker detection
Deng, Jing
Zheng, Thomas Fang
Wu, Wenhu
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 116 - +
[32] BIC-Based Speaker Segmentation Using Divide-and-Conquer Strategies With Application to Speaker Diarization
Cheng, Shih-Sian
Wang, Hsin-Min
Fu, Hsin-Chia
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 141 - 157
[33] A new speaker change detection method for two-speaker segmentation
Adami, AG
Kajarekar, SS
Hermansky, H
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3908 - 3911
[34] Using Phoneme Recognition and Text-dependent Speaker Verification to Improve Speaker Segmentation for Chinese Speech
Wang, Gang
Wu, Xiaojun
Zheng, Thomas Fang
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1457 - 1460
[35] Hierarchical speaker identification using speaker clustering
Sun, B
Liu, WJ
Zhong, QH
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 299 - 304
[36] Investigating the contribution of speaker attributes to speaker separability using disentangled speaker representations
Luu, Chau
Renals, Steve
Bell, Peter
INTERSPEECH 2022, 2022, : 610 - 614
[37] SPEAKER SEGMENTATION USING I-VECTOR IN MEETINGS DOMAIN
Neri, Leonardo V.
Pinheiro, Hector N. B.
Ren, Tsang Ing
Cavalcanti, George D. da C.
Adami, Andre G.
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5455 - 5459
[38] Speaker Segmentation Using Parallel Fusion between three Classifiers
Ouamour, S.
Sayoud, H.
Guerti, M.
2009 3RD INTERNATIONAL CONFERENCE ON SIGNALS, CIRCUITS AND SYSTEMS (SCS 2009), 2009, : 603 - +
[39] ALGORITHM TURNS TO SPEAKER-INDEPENDENT WORD RECOGNITION
OHR, S
ELECTRONIC DESIGN, 1983, 31 (19) : 40 - 41
[40] Offline speaker segmentation using genetic algorithms and mutual information
Salcedo-Sanz, S
Gallardo-Antolín, A
Leiva-Murillo, JM
Bousoño-Calzón, C
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2006, 10 (02) : 175 - 186

← 1 2 3 4 5 →