Using confidence measures to evaluate the speaker turns in speaker segmentation

被引:0
|
作者
Chu, Wei [1 ]
Liu, Jia [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a speaker segmentation algorithm using confidence measures, named CM-DISTBIC, which inserts a confidence score computation and fusion procedure into the two-step DISTBIC and MDISTBIC. In the first step, symmetric Kullback-Leibler distance (KL2) distance is replaced by Bayesian Information Criterion (BIC) distance to obtain a lower misdetection rate. In the second step, three different confidence measures are attached to the speaker change candidates according to the distance curve derived from the first step. False alarm peaks with relatively low fused confidence scores are eliminate from the set of potential speak turns. In. the third step, speaker turn candidates are validated through BIC criterion. Compared with DISTBIC and MDISTBIC, the CM-DISTBIC conducted on the broadcast news corpora receives an increase of more than 11.5% and 8.9% in F-score respectively.
引用
下载
收藏
页码:728 / 731
页数:4
相关论文
共 50 条
  • [1] Confidence Measures for Speaker Segmentation and their Relation to Speaker Verification
    Vaquero, Carlos
    Ortega, Alfonso
    Villalba, Jesus
    Miguel, Antonio
    Lleida, Eduardo
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2310 - 2313
  • [2] Confidence and reliability measures in speaker verification
    Richiardi, Jonas
    Drygajlo, Andrzej
    Prodanov, Plamen
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2006, 343 (06): : 574 - 595
  • [3] Speaker verification with confidence and reliability measures
    Richiardi, Jonas
    Prodanov, Plamen
    Drygajlo, Andrzej
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 641 - 644
  • [4] SPEAKER SEGMENTATION USING DEEP SPEAKER VECTORS FOR FAST SPEAKER CHANGE SCENARIOS
    Wang, Renyu
    Gu, Mingliang
    Li, Lantian
    Xu, Mingxing
    Zheng, Thoms Fang
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5420 - 5424
  • [5] Confidence measures in multiple pronunciations modeling for speaker verification
    BenZeghiba, MF
    Bourlard, H
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 389 - 392
  • [6] Impostor detection in speaker recognition using confusion-based confidence measures
    Kim, Kyuhong
    Kim, Hoirin
    Hahn, Minsoo
    ETRI JOURNAL, 2006, 28 (06) : 811 - 814
  • [7] Stream-based speaker segmentation using speaker factors and eigenvoices
    Castaldo, Fabio
    Colibro, Daniele
    Dalmasso, Emanuele
    Laface, Pietro
    Vair, Claudio
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4133 - +
  • [8] Improving Speaker Segmentation via Speaker Identification and Text Segmentation
    Li, Runxin
    Schultz, Tanja
    Jin, Qin
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 928 - 931
  • [9] Speaker Verification on Summed-Channel Conditions with Confidence Measures
    Aviles Casco, Carlos Vaquero
    Villalba Lopez, Jesus
    Ortega Gimenez, Alfonso
    Lleida Solano, Eduardo
    COMPUTACION Y SISTEMAS, 2011, 15 (01): : 27 - 37
  • [10] Speaker Segmentation Using Adapted GMMs
    Bellagha, Mohamed Lazhar
    Labidi, Mohamed
    Maraoui, Mohsen
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2017,