Using confidence measures to evaluate the speaker turns in speaker segmentation

被引:0
|
作者
Chu, Wei [1 ]
Liu, Jia [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a speaker segmentation algorithm using confidence measures, named CM-DISTBIC, which inserts a confidence score computation and fusion procedure into the two-step DISTBIC and MDISTBIC. In the first step, symmetric Kullback-Leibler distance (KL2) distance is replaced by Bayesian Information Criterion (BIC) distance to obtain a lower misdetection rate. In the second step, three different confidence measures are attached to the speaker change candidates according to the distance curve derived from the first step. False alarm peaks with relatively low fused confidence scores are eliminate from the set of potential speak turns. In. the third step, speaker turn candidates are validated through BIC criterion. Compared with DISTBIC and MDISTBIC, the CM-DISTBIC conducted on the broadcast news corpora receives an increase of more than 11.5% and 8.9% in F-score respectively.
引用
下载
收藏
页码:728 / 731
页数:4
相关论文
共 50 条
  • [31] UBM based speaker segmentation and clustering for 2-speaker detection
    Deng, Jing
    Zheng, Thomas Fang
    Wu, Wenhu
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 116 - +
  • [32] BIC-Based Speaker Segmentation Using Divide-and-Conquer Strategies With Application to Speaker Diarization
    Cheng, Shih-Sian
    Wang, Hsin-Min
    Fu, Hsin-Chia
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 141 - 157
  • [33] A new speaker change detection method for two-speaker segmentation
    Adami, AG
    Kajarekar, SS
    Hermansky, H
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3908 - 3911
  • [34] Using Phoneme Recognition and Text-dependent Speaker Verification to Improve Speaker Segmentation for Chinese Speech
    Wang, Gang
    Wu, Xiaojun
    Zheng, Thomas Fang
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1457 - 1460
  • [35] Hierarchical speaker identification using speaker clustering
    Sun, B
    Liu, WJ
    Zhong, QH
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 299 - 304
  • [36] Investigating the contribution of speaker attributes to speaker separability using disentangled speaker representations
    Luu, Chau
    Renals, Steve
    Bell, Peter
    INTERSPEECH 2022, 2022, : 610 - 614
  • [37] SPEAKER SEGMENTATION USING I-VECTOR IN MEETINGS DOMAIN
    Neri, Leonardo V.
    Pinheiro, Hector N. B.
    Ren, Tsang Ing
    Cavalcanti, George D. da C.
    Adami, Andre G.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5455 - 5459
  • [38] Speaker Segmentation Using Parallel Fusion between three Classifiers
    Ouamour, S.
    Sayoud, H.
    Guerti, M.
    2009 3RD INTERNATIONAL CONFERENCE ON SIGNALS, CIRCUITS AND SYSTEMS (SCS 2009), 2009, : 603 - +
  • [39] ALGORITHM TURNS TO SPEAKER-INDEPENDENT WORD RECOGNITION
    OHR, S
    ELECTRONIC DESIGN, 1983, 31 (19) : 40 - 41
  • [40] Offline speaker segmentation using genetic algorithms and mutual information
    Salcedo-Sanz, S
    Gallardo-Antolín, A
    Leiva-Murillo, JM
    Bousoño-Calzón, C
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2006, 10 (02) : 175 - 186