Full-Posterior PLDA based Speaker Diarization of telephone conversations

被引:0
|
作者
Chen, Yanni [1 ]
Yan, Yonghong [1 ]
Hong, Wei [1 ]
Guan, Songzan [1 ]
机构
[1] Chinese Acad Sci, Key Lab Speech Acoust & Content Understanding, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
speaker diarization; I-vector; I-vector extraction; probabilistic linear discrimination analysis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Conventional speaker diarization systems based on factor analysis mainly differ in i-vector scoring, such as the cosine scoring and the newly emerged probabilistic linear discriminant analysis (PLDA) scoring technique. However, during the clustering process, the accuracy of PLDA scoring decreases in short speech segments. The matter becomes even worse when the segments are with arbitrary duration. In this paper, we choose a modified PLDA model, called full posterior distribution PLDA (FP-PLDA) for clustering instead of the standard PLDA model (Std-PLDA). The new model exploits the intrinsic uncertainty of the i-vector extraction. The experiment shows that FP-PLDA has an especially effective performance in the short and variable duration speech segments. It relatively decreases the diarization error rate by around 41% for the cosine scoring system and 30.98% for the standard PLDA system.
引用
收藏
页码:840 / 844
页数:5
相关论文
共 50 条
  • [1] PLDA-BASED DIARIZATION OF TELEPHONE CONVERSATIONS
    Bulut, Ahmet Emin
    Demir, Hakan
    Isik, Yusuf Ziya
    Erdogan, Hakan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4809 - 4813
  • [2] Mahalanobis Based Emission Model for Speaker Diarization of Telephone Conversations
    Furmanov, Tal
    Aminov, Lidiya
    Moyal, Ami
    Lapidot, Itshak
    [J]. 2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [3] Initialization of Iterative-Based Speaker Diarization Systems for Telephone Conversations
    Ben-Harush, Oshry
    Ben-Harush, Ortal
    Lapidot, Itshak
    Guterman, Hugo
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 414 - 425
  • [4] Randomization Effect on Iterative-Based Speaker Diarization System for Telephone Conversations
    Furmanov, Tal
    Aminov, Lidiya
    Moyal, Ami
    Lapidot, Itshak
    [J]. 2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [5] VARIATIONAL BAYES BASED I-VECTOR FOR SPEAKER DIARIZATION OF TELEPHONE CONVERSATIONS
    Zheng, Rong
    Zhang, Ce
    Zhang, Shanshan
    Xu, Bo
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] Multiple feature combination to improve speaker diarization of telephone conversations
    Gupta, Vishwa
    Kenny, Patrick
    Ouellet, Pierre
    Boulianne, Gilles
    Dumouchel, Pierre
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 705 - 710
  • [7] Iterative PLDA Adaptation for Speaker Diarization
    Le Lan, Gael
    Charlet, Delphine
    Larcher, Anthony
    Meignier, Sylvain
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2175 - 2179
  • [8] PLDA-based Clustering for Speaker Diarization of Broadcast Streams
    Silovsky, Jan
    Prazak, Jan
    Cerva, Petr
    Zdansky, Jindrich
    Nouza, Jan
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2920 - +
  • [9] Online Diarization of Telephone Conversations
    Ben-Harush, Oshry
    Lapidot, Itshak
    Guterman, Hugo
    [J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 125 - 130
  • [10] Incremental Diarization of Telephone Conversations
    Ben-Harush, Oshiy
    Lapidot, Itshak
    Guterman, Hugo
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2226 - +