pnf Improvements in speaker diarization system

被引:0
|
作者
Fu, Rong [1 ]
Benest, Ian D. [1 ]
机构
[1] Univ York, Dept Comp Sci, York YO10 5DD, N Yorkshire, England
关键词
speaker diarization; model complexity selection; universal background model;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes an automatic speaker diarization system for natural, multi-speaker meeting conversations using one central microphone. It is based on the ICSI-SRI Fall 2004 diarization system (Wooters et al., 2004), but it has a number of significant modifications. The new system is robust to different acoustic environments - it requires neither pre-training models nor development sets to initialize the parameters. It determines the model complexity automatically. It adapts the segment model from a Universal Background Model (UBM), and uses the cross-likelihood ratio (CLR) instead of the Bayesian Information Criterion (BIC) for merging. Finally it uses an intra-cluster/inter-cluster ratio as the stopping criterion. Altogether this reduces the speaker diarization error rate from 25.36% to 21.37% compared to the baseline system (Wooters et al., 2004).
引用
收藏
页码:317 / +
页数:2
相关论文
共 50 条
  • [31] New Advances in Speaker Diarization
    Aronowitz, Hagai
    Zhu, Weizhong
    Suzuki, Masayuki
    Kurata, Gakuto
    Hoory, Ron
    [J]. INTERSPEECH 2020, 2020, : 279 - 283
  • [32] WHERE ARE THE CHALLENGES IN SPEAKER DIARIZATION?
    Sinclair, Mark
    King, Simon
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7741 - 7745
  • [33] Speaker diarization system using HXLPS and deep neural network
    Ramaiah, V. Subba
    Rao, R. Rajeswara
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2018, 57 (01) : 255 - 266
  • [34] The Blame Game: Performance Analysis of Speaker Diarization System Components
    Huijbregts, Marijn
    Wooters, Chuck
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2237 - +
  • [35] KL-HMM BASED SPEAKER DIARIZATION SYSTEM FOR MEETINGS
    Madikeri, Srikanth
    Bourlard, Herve
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4435 - 4439
  • [36] Towards a complete Binary Key System for the Speaker Diarization Task
    Delgado, Hector
    Fredouille, Corinne
    Serrano, Javier
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 572 - 576
  • [37] SPEAKER DIARIZATION IN MEETING AUDIO
    Nwe, Tin Lay
    Sun, Hanwu
    Li, Haizhou
    Rahardja, Susanto
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4073 - 4076
  • [38] The ICSI RT07s speaker diarization system
    Wooters, Chuck
    Huijbregts, Marijn
    [J]. MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2008, 4625 : 509 - 519
  • [39] SYSTEM FUSION AND SPEAKER LINKING FOR LONGITUDINAL DIARIZATION OF TV SHOWS
    Ferras, Marc
    Madikeri, Srikanth
    Motlicek, Petr
    Bourlard, Herve
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5495 - 5499
  • [40] A NOVEL METHOD FOR SELECTING THE NUMBER OF CLUSTERS IN A SPEAKER DIARIZATION SYSTEM
    Lopez-Otero, Paula
    Docio-Fernandez, Laura
    Garcia-Mateo, Carmen
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 656 - 660