Iterative PLDA Adaptation for Speaker Diarization

被引:7
|
作者
Le Lan, Gael [1 ,2 ]
Charlet, Delphine [1 ]
Larcher, Anthony [2 ]
Meignier, Sylvain [2 ]
机构
[1] Orange Labs, Paris, France
[2] Univ Le Mans, LIUM, Le Mans, France
关键词
speaker diarization; PLDA; unsupervised training; domain adaptation; iterative training;
D O I
10.21437/Interspeech.2016-572
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates iterative PLDA adaptation for cross show speaker diarization applied to small collections of French TV archives based on an i-vector framework. Using the target collection itself for unsupervised adaptation, PLDA parameters are iteratively tuned while score normalization is applied for convergence. Performances are compared, using combinations of target and external data for training and adaptation. The experiments on two distinct target corpora show that the proposed framework can gradually improve an existing system trained on external annotated data. Such results indicate that performing speaker diarization on small collections of unlabeled audio archives should only rely on the availability of a sufficient bootstrap system, which can be incrementally adapted to every target collection. The proposed framework also widens the range of acceptable speaker clustering thresholds for a given performance objective.
引用
收藏
页码:2175 / 2179
页数:5
相关论文
共 50 条
  • [1] Domain Adaptation of PLDA models in Broadcast Diarization by means of Unsupervised Speaker Clustering
    Vinals, Ignacio
    Ortega, Alfonso
    Villalba, Jesus
    Miguel, Antonio
    Lleida, Eduardo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2829 - 2833
  • [2] VARIATIONAL BAYESIAN PLDA FOR SPEAKER DIARIZATION IN THE MGB CHALLENGE
    Villalba, Jesus
    Ortega, Alfonso
    Miguel, Antonio
    Lleida, Eduardo
    [J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 667 - 674
  • [3] Unsupervised adaptation of PLDA models for broadcast diarization
    Ignacio Viñals
    Alfonso Ortega
    Jesús Villalba
    Antonio Miguel
    Eduardo Lleida
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [4] Unsupervised adaptation of PLDA models for broadcast diarization
    Vinals, Ignacio
    Ortega, Alfonso
    Villalba, Jesus
    Miguel, Antonio
    Lleida, Eduardo
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)
  • [5] PLDA-based Clustering for Speaker Diarization of Broadcast Streams
    Silovsky, Jan
    Prazak, Jan
    Cerva, Petr
    Zdansky, Jindrich
    Nouza, Jan
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2920 - +
  • [6] SPEAKER DIARIZATION WITH PLDA I-VECTOR SCORING AND UNSUPERVISED CALIBRATION
    Sell, Gregory
    Garcia-Romero, Daniel
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 413 - 417
  • [7] Full-Posterior PLDA based Speaker Diarization of telephone conversations
    Chen, Yanni
    Yan, Yonghong
    Hong, Wei
    Guan, Songzan
    [J]. PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, : 840 - 844
  • [8] Unsupervised Bayesian Adaptation of PLDA for Speaker Verification
    Borgstrorn, Bengt J.
    [J]. INTERSPEECH 2021, 2021, : 1039 - 1043
  • [9] On the Use of Spectral and Iterative Methods for Speaker Diarization
    Shum, Stephen
    Dehak, Najim
    Glass, Jim
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 482 - 485
  • [10] A GENERALIZED FRAMEWORK FOR DOMAIN ADAPTATION OF PLDA IN SPEAKER RECOGNITION
    Wang, Qiongqiong
    Okabe, Koji
    Lee, Kong Aik
    Koshinaka, Takafumi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6619 - 6623