Using confidence measures to evaluate the speaker turns in speaker segmentation

被引:0
|
作者
Chu, Wei [1 ]
Liu, Jia [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a speaker segmentation algorithm using confidence measures, named CM-DISTBIC, which inserts a confidence score computation and fusion procedure into the two-step DISTBIC and MDISTBIC. In the first step, symmetric Kullback-Leibler distance (KL2) distance is replaced by Bayesian Information Criterion (BIC) distance to obtain a lower misdetection rate. In the second step, three different confidence measures are attached to the speaker change candidates according to the distance curve derived from the first step. False alarm peaks with relatively low fused confidence scores are eliminate from the set of potential speak turns. In. the third step, speaker turn candidates are validated through BIC criterion. Compared with DISTBIC and MDISTBIC, the CM-DISTBIC conducted on the broadcast news corpora receives an increase of more than 11.5% and 8.9% in F-score respectively.
引用
收藏
页码:728 / 731
页数:4
相关论文
共 50 条
  • [41] SPEAKER MODEL ADAPTATION BASED ON CONFIDENCE SCORE
    Mengusoglu, Erhan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2015, 22 (04): : 873 - 878
  • [42] Normative data for the personal report of confidence as a speaker
    Phillips, GC
    Jones, GE
    Rieger, EJ
    Snell, JB
    JOURNAL OF ANXIETY DISORDERS, 1997, 11 (02) : 215 - 220
  • [43] Speaker segmentation for intelligent responsive space
    Kwon, Soonil
    HUMAN-COMPUTER INTERACTION, PT 3, PROCEEDINGS, 2007, 4552 : 385 - 392
  • [44] Estimating and evaluating confidence for forensic speaker recognition
    Campbell, WM
    Reynolds, DA
    Campbell, JP
    Brady, KJ
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 717 - 720
  • [45] Unsupervised speaker segmentation in telephone conversations
    Cohen, A
    Lapidus, V
    NINETEENTH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, 1996, : 102 - 105
  • [46] Speaker trustworthiness: Shall confidence match evidence?
    Pozzi, Melinda
    Mazzarella, Diana
    PHILOSOPHICAL PSYCHOLOGY, 2024, 37 (01) : 102 - 125
  • [47] Speaker adaptation for telephony data using speaker clustering
    Wu, C
    Lubensky, D
    Wang, ZH
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 768 - 771
  • [48] Speaker independent acoustic modeling using speaker normalization
    Ishii, J
    Fukada, T
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 97 - 100
  • [49] Uncertainties of Measures in Speaker Recognition Evaluation
    Wu, Jin Chu
    Martin, Alvin F.
    Greenberg, Craig S.
    Kacker, Raghu N.
    ACTIVE AND PASSIVE SIGNATURES II, 2011, 8040
  • [50] Unsupervised speaker adaptation using reference speaker weighting
    Lai, Tsz-Chung
    Mak, Brian
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 380 - +