Using confidence measures to evaluate the speaker turns in speaker segmentation

被引：0

作者：

Chu, Wei ^{[1
]}

Liu, Jia ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

来源：

2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3 | 2007年

关键词：

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In this paper, we propose a speaker segmentation algorithm using confidence measures, named CM-DISTBIC, which inserts a confidence score computation and fusion procedure into the two-step DISTBIC and MDISTBIC. In the first step, symmetric Kullback-Leibler distance (KL2) distance is replaced by Bayesian Information Criterion (BIC) distance to obtain a lower misdetection rate. In the second step, three different confidence measures are attached to the speaker change candidates according to the distance curve derived from the first step. False alarm peaks with relatively low fused confidence scores are eliminate from the set of potential speak turns. In. the third step, speaker turn candidates are validated through BIC criterion. Compared with DISTBIC and MDISTBIC, the CM-DISTBIC conducted on the broadcast news corpora receives an increase of more than 11.5% and 8.9% in F-score respectively.

引用

下载

页码：728 / 731

页数：4

共 50 条

[1] Confidence Measures for Speaker Segmentation and their Relation to Speaker Verification
Vaquero, Carlos
Ortega, Alfonso
Villalba, Jesus
Miguel, Antonio
Lleida, Eduardo
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2310 - 2313
[2] Confidence and reliability measures in speaker verification
Richiardi, Jonas
Drygajlo, Andrzej
Prodanov, Plamen
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2006, 343 (06): : 574 - 595
[3] Speaker verification with confidence and reliability measures
Richiardi, Jonas
Prodanov, Plamen
Drygajlo, Andrzej
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 641 - 644
[4] SPEAKER SEGMENTATION USING DEEP SPEAKER VECTORS FOR FAST SPEAKER CHANGE SCENARIOS
Wang, Renyu
Gu, Mingliang
Li, Lantian
Xu, Mingxing
Zheng, Thoms Fang
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5420 - 5424
[5] Confidence measures in multiple pronunciations modeling for speaker verification
BenZeghiba, MF
Bourlard, H
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 389 - 392
[6] Impostor detection in speaker recognition using confusion-based confidence measures
Kim, Kyuhong
Kim, Hoirin
Hahn, Minsoo
ETRI JOURNAL, 2006, 28 (06) : 811 - 814
[7] Stream-based speaker segmentation using speaker factors and eigenvoices
Castaldo, Fabio
Colibro, Daniele
Dalmasso, Emanuele
Laface, Pietro
Vair, Claudio
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4133 - +
[8] Improving Speaker Segmentation via Speaker Identification and Text Segmentation
Li, Runxin
Schultz, Tanja
Jin, Qin
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 928 - 931
[9] Speaker Verification on Summed-Channel Conditions with Confidence Measures
Aviles Casco, Carlos Vaquero
Villalba Lopez, Jesus
Ortega Gimenez, Alfonso
Lleida Solano, Eduardo
COMPUTACION Y SISTEMAS, 2011, 15 (01): : 27 - 37
[10] Speaker Segmentation Using Adapted GMMs
Bellagha, Mohamed Lazhar
Labidi, Mohamed
Maraoui, Mohsen
2017 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2017,

← 1 2 3 4 5 →