Investigation of Segmentation in i-Vector Based Speaker Diarization of Telephone Speech

被引:11
|
作者
Zajic, Zbynek [1 ]
Kunesova, Marie [1 ,2 ]
Radova, Vlasta [1 ,2 ]
机构
[1] Univ West Bohemia, Fac Sci Appl, NTIS, Univ 8, Plzen 30614, Czech Republic
[2] Univ West Bohemia, Fac Sci Appl, Dept Cybernet, Univ 8, Plzen 30614, Czech Republic
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speaker diarization; Speaker change detection; i-vector; Segmentation;
D O I
10.1007/978-3-319-43958-7_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of this paper is to evaluate the contribution of speaker change detection (SCD) to the performance of a speaker diarization system in the telephone domain. We compare the overall performance of an i-vector based system using both SCD-based segmentation and a naive constant length segmentation with overlapping segments. The diarization system performs K-means clustering of i-vectors which represent the individual segments, followed by a resegmentation step. Experiments were done on the English part of the CallHome corpus. The final results indicate that the use of speaker change detection is beneficial, but the differences between the two segmentation approaches are diminished by the use of resegmentation.
引用
收藏
页码:411 / 418
页数:8
相关论文
共 50 条
  • [1] I-vector similarity based speech segmentation for interested speaker to speaker diarization system
    Bae, Ara
    Yoon, Ki-mu
    Jung, Jaehee
    Chung, Bokyung
    Kim, Wooil
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (05): : 461 - 467
  • [2] VARIATIONAL BAYES BASED I-VECTOR FOR SPEAKER DIARIZATION OF TELEPHONE CONVERSATIONS
    Zheng, Rong
    Zhang, Ce
    Zhang, Shanshan
    Xu, Bo
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [3] Improved i-Vector Representation for Speaker Diarization
    Xu, Yan
    McLoughlin, Ian
    Song, Yan
    Wu, Kui
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2016, 35 (09) : 3393 - 3404
  • [4] Improved i-Vector Representation for Speaker Diarization
    Yan Xu
    Ian McLoughlin
    Yan Song
    Kui Wu
    [J]. Circuits, Systems, and Signal Processing, 2016, 35 : 3393 - 3404
  • [5] An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech
    Senoussaoui, Mohammed
    Kenny, Patrick
    Dehak, Najim
    Dumouchel, Pierre
    [J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 28 - 33
  • [6] ONLINE SPEAKER DIARIZATION USING ADAPTED I-VECTOR TRANSFORMS
    Zhu, Weizhong
    Pelecanos, Jason
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5045 - 5049
  • [7] SPEAKER DIARIZATION WITH PLDA I-VECTOR SCORING AND UNSUPERVISED CALIBRATION
    Sell, Gregory
    Garcia-Romero, Daniel
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 413 - 417
  • [8] Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system
    Madikeri, Srikanth
    Himawan, Ivan
    Motlicek, Petr
    Ferras, Marc
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3105 - 3109
  • [9] SPEAKER SEGMENTATION USING I-VECTOR IN MEETINGS DOMAIN
    Neri, Leonardo V.
    Pinheiro, Hector N. B.
    Ren, Tsang Ing
    Cavalcanti, George D. da C.
    Adami, Andre G.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5455 - 5459
  • [10] Neural Network Speaker Descriptor in Speaker Diarization of Telephone Speech
    Zajic, Zbynek
    Zelinka, Jan
    Mueller, Ludek
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 555 - 563