Using confidence measures to evaluate the speaker turns in speaker segmentation

被引：0

作者：

Chu, Wei ^{[1
]}

Liu, Jia ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

来源：

2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In this paper, we propose a speaker segmentation algorithm using confidence measures, named CM-DISTBIC, which inserts a confidence score computation and fusion procedure into the two-step DISTBIC and MDISTBIC. In the first step, symmetric Kullback-Leibler distance (KL2) distance is replaced by Bayesian Information Criterion (BIC) distance to obtain a lower misdetection rate. In the second step, three different confidence measures are attached to the speaker change candidates according to the distance curve derived from the first step. False alarm peaks with relatively low fused confidence scores are eliminate from the set of potential speak turns. In. the third step, speaker turn candidates are validated through BIC criterion. Compared with DISTBIC and MDISTBIC, the CM-DISTBIC conducted on the broadcast news corpora receives an increase of more than 11.5% and 8.9% in F-score respectively.

引用

页码：728 / 731

页数：4

共 50 条

[41] SPEAKER MODEL ADAPTATION BASED ON CONFIDENCE SCORE
Mengusoglu, Erhan
TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2015, 22 (04): : 873 - 878
[42] Normative data for the personal report of confidence as a speaker
Phillips, GC
Jones, GE
Rieger, EJ
Snell, JB
JOURNAL OF ANXIETY DISORDERS, 1997, 11 (02) : 215 - 220
[43] Speaker segmentation for intelligent responsive space
Kwon, Soonil
HUMAN-COMPUTER INTERACTION, PT 3, PROCEEDINGS, 2007, 4552 : 385 - 392
[44] Estimating and evaluating confidence for forensic speaker recognition
Campbell, WM
Reynolds, DA
Campbell, JP
Brady, KJ
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 717 - 720
[45] Unsupervised speaker segmentation in telephone conversations
Cohen, A
Lapidus, V
NINETEENTH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, 1996, : 102 - 105
[46] Speaker trustworthiness: Shall confidence match evidence?
Pozzi, Melinda
Mazzarella, Diana
PHILOSOPHICAL PSYCHOLOGY, 2024, 37 (01) : 102 - 125
[47] Speaker adaptation for telephony data using speaker clustering
Wu, C
Lubensky, D
Wang, ZH
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 768 - 771
[48] Speaker independent acoustic modeling using speaker normalization
Ishii, J
Fukada, T
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 97 - 100
[49] Uncertainties of Measures in Speaker Recognition Evaluation
Wu, Jin Chu
Martin, Alvin F.
Greenberg, Craig S.
Kacker, Raghu N.
ACTIVE AND PASSIVE SIGNATURES II, 2011, 8040
[50] Unsupervised speaker adaptation using reference speaker weighting
Lai, Tsz-Chung
Mak, Brian
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 380 - +

← 1 2 3 4 5 →