Iterative PLDA Adaptation for Speaker Diarization

被引：7

作者：

Le Lan, Gael ^{[1
,2
]}

Charlet, Delphine ^{[1
]}

Larcher, Anthony ^{[2
]}

Meignier, Sylvain ^{[2
]}

机构：

[1] Orange Labs, Paris, France

[2] Univ Le Mans, LIUM, Le Mans, France

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

关键词：

speaker diarization; PLDA; unsupervised training; domain adaptation; iterative training;

D O I：

10.21437/Interspeech.2016-572

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper investigates iterative PLDA adaptation for cross show speaker diarization applied to small collections of French TV archives based on an i-vector framework. Using the target collection itself for unsupervised adaptation, PLDA parameters are iteratively tuned while score normalization is applied for convergence. Performances are compared, using combinations of target and external data for training and adaptation. The experiments on two distinct target corpora show that the proposed framework can gradually improve an existing system trained on external annotated data. Such results indicate that performing speaker diarization on small collections of unlabeled audio archives should only rely on the availability of a sufficient bootstrap system, which can be incrementally adapted to every target collection. The proposed framework also widens the range of acceptable speaker clustering thresholds for a given performance objective.

引用

页码：2175 / 2179

页数：5

共 50 条

[1] Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering
Vinals, Ignacio
Ortega, Alfonso
Villalba, Jesus
Miguel, Antonio
Lleida, Eduardo
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2829 - 2833
[2] VARIATIONAL BAYESIAN PLDA FOR SPEAKER DIARIZATION IN THE MGB CHALLENGE
Villalba, Jesus
Ortega, Alfonso
Miguel, Antonio
Lleida, Eduardo
[J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 667 - 674
[3] Unsupervised adaptation of PLDA models for broadcast diarization
Ignacio Viñals
Alfonso Ortega
Jesús Villalba
Antonio Miguel
Eduardo Lleida
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2019
[4] Unsupervised adaptation of PLDA models for broadcast diarization
Vinals, Ignacio
Ortega, Alfonso
Villalba, Jesus
Miguel, Antonio
Lleida, Eduardo
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)
[5] PLDA-based Clustering for Speaker Diarization of Broadcast Streams
Silovsky, Jan
Prazak, Jan
Cerva, Petr
Zdansky, Jindrich
Nouza, Jan
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2920 - +
[6] SPEAKER DIARIZATION WITH PLDA I-VECTOR SCORING AND UNSUPERVISED CALIBRATION
Sell, Gregory
Garcia-Romero, Daniel
[J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 413 - 417
[7] Full-Posterior PLDA based Speaker Diarization of telephone conversations
Chen, Yanni
Yan, Yonghong
Hong, Wei
Guan, Songzan
[J]. PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, : 840 - 844
[8] Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
Borgstrorn, Bengt J.
[J]. INTERSPEECH 2021, 2021, : 1039 - 1043
[9] On the Use of Spectral and Iterative Methods for Speaker Diarization
Shum, Stephen
Dehak, Najim
Glass, Jim
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 482 - 485
[10] A GENERALIZED FRAMEWORK FOR DOMAIN ADAPTATION OF PLDA IN SPEAKER RECOGNITION
Wang, Qiongqiong
Okabe, Koji
Lee, Kong Aik
Koshinaka, Takafumi
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6619 - 6623

← 1 2 3 4 5 →