Phone Adaptive Training for Speaker Diarization

被引:0
|
作者
Bozonnet, Simon [1 ]
Vipperla, Ravichander [1 ]
Evans, Nicholas [1 ]
机构
[1] EURECOM, F-06904 Sophia Antipolis, France
关键词
Speaker Diarization; Phone Adaptive Training; Speaker Discrimination;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The linguistic content of a speech signal is a source of unwanted variation which can degrade speaker diarization performance. This paper presents our latest work to reduce its impact. The new approach, referred to as Phone Adaptive Training (PAT), is analogous to speaker adaptive training used in automatic speech recognition. We report an oracle experiment which shows that PAT has the potential to deliver a 33% relative improvement in the diarization error rate over our baseline system. Practical experiments show significant improvements across two standard, independent evaluation datasets.
引用
收藏
页码:494 / 497
页数:4
相关论文
共 50 条
  • [1] SPEAKER DIARIZATION WITH UNSUPERVISED TRAINING FRAMEWORKL
    Le Lan, Gael
    Meignier, Sylvain
    Charlet, Delphine
    Deleglise, Paul
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5560 - 5564
  • [2] PHONE ADAPTIVE TRAINING FOR SHORT-DURATION SPEAKER VERIFICATION
    Soldi, Giovanni
    Bozonnet, Simon
    Beaugeant, Christophe
    Evans, Nicholas
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2107 - 2111
  • [3] Discriminative Training for Hierarchical Clustering in Speaker Diarization
    Vinyals, Oriol
    Friedland, Gerald
    Morgan, Nelson
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2326 - +
  • [4] ADAPTIVE AND ONLINE SPEAKER DIARIZATION FOR MEETING DATA
    Soldi, Giovanni
    Beaugeant, Christophe
    Evans, Nicholas
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2112 - 2116
  • [5] CHANNEL ADVERSARIAL TRAINING FOR SPEAKER VERIFICATION AND DIARIZATION
    Luu, Chau
    Bell, Peter
    Renals, Steve
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7094 - 7098
  • [6] Interrelate Training and Clustering for Online Speaker Diarization
    Chen, Yifan
    Cheng, Gaofeng
    Yang, Runyan
    Zhang, Pengyuan
    Yan, Yonghong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1352 - 1364
  • [7] An Adaptive Method for Cross-Recording Speaker Diarization
    Le Lan, Gael
    Charlet, Delphine
    Larcher, Anthony
    Meignier, Sylvain
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1821 - 1832
  • [8] Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription
    Silovsky, Jan
    Cerva, Petr
    Zdansky, Jindrich
    Nouza, Jan
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 478 - 481
  • [9] AN ADAPTIVE INITIALIZATION METHOD FOR SPEAKER DIARIZATION BASED ON PROSODIC FEATURES
    Imseng, David
    Friedland, Gerald
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4946 - 4949
  • [10] Adaptive speaker diarization of broadcast news based on factor analysis
    Desplanques, Brecht
    Demuynck, Kris
    Martens, Jean-Pierre
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 72 - 93