Investigation of Segmentation in i-Vector Based Speaker Diarization of Telephone Speech

被引：11

作者：

Zajic, Zbynek ^{[1
]}

Kunesova, Marie ^{[1
,2
]}

Radova, Vlasta ^{[1
,2
]}

机构：

[1] Univ West Bohemia, Fac Sci Appl, NTIS, Univ 8, Plzen 30614, Czech Republic

[2] Univ West Bohemia, Fac Sci Appl, Dept Cybernet, Univ 8, Plzen 30614, Czech Republic

来源：

SPEECH AND COMPUTER | 2016年 / 9811卷

关键词：

Speaker diarization; Speaker change detection; i-vector; Segmentation;

D O I：

10.1007/978-3-319-43958-7_49

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The goal of this paper is to evaluate the contribution of speaker change detection (SCD) to the performance of a speaker diarization system in the telephone domain. We compare the overall performance of an i-vector based system using both SCD-based segmentation and a naive constant length segmentation with overlapping segments. The diarization system performs K-means clustering of i-vectors which represent the individual segments, followed by a resegmentation step. Experiments were done on the English part of the CallHome corpus. The final results indicate that the use of speaker change detection is beneficial, but the differences between the two segmentation approaches are diminished by the use of resegmentation.

引用

页码：411 / 418

页数：8

共 50 条

[1] I-vector similarity based speech segmentation for interested speaker to speaker diarization system
Bae, Ara
Yoon, Ki-mu
Jung, Jaehee
Chung, Bokyung
Kim, Wooil
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (05): : 461 - 467
[2] VARIATIONAL BAYES BASED I-VECTOR FOR SPEAKER DIARIZATION OF TELEPHONE CONVERSATIONS
Zheng, Rong
Zhang, Ce
Zhang, Shanshan
Xu, Bo
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[3] Improved i-Vector Representation for Speaker Diarization
Xu, Yan
McLoughlin, Ian
Song, Yan
Wu, Kui
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2016, 35 (09) : 3393 - 3404
[4] Improved i-Vector Representation for Speaker Diarization
Yan Xu
Ian McLoughlin
Yan Song
Kui Wu
[J]. Circuits, Systems, and Signal Processing, 2016, 35 : 3393 - 3404
[5] An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech
Senoussaoui, Mohammed
Kenny, Patrick
Dehak, Najim
Dumouchel, Pierre
[J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 28 - 33
[6] ONLINE SPEAKER DIARIZATION USING ADAPTED I-VECTOR TRANSFORMS
Zhu, Weizhong
Pelecanos, Jason
[J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5045 - 5049
[7] SPEAKER DIARIZATION WITH PLDA I-VECTOR SCORING AND UNSUPERVISED CALIBRATION
Sell, Gregory
Garcia-Romero, Daniel
[J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 413 - 417
[8] Integrating Online I-vector extractor with Information Bottleneck based Speaker Diarization system
Madikeri, Srikanth
Himawan, Ivan
Motlicek, Petr
Ferras, Marc
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3105 - 3109
[9] SPEAKER SEGMENTATION USING I-VECTOR IN MEETINGS DOMAIN
Neri, Leonardo V.
Pinheiro, Hector N. B.
Ren, Tsang Ing
Cavalcanti, George D. da C.
Adami, Andre G.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5455 - 5459
[10] Neural Network Speaker Descriptor in Speaker Diarization of Telephone Speech
Zajic, Zbynek
Zelinka, Jan
Mueller, Ludek
[J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 555 - 563

← 1 2 3 4 5 →