A Comparative Study of Bottom-Up and Top-Down Approaches to Speaker Diarization

被引:25
|
作者
Evans, Nicholas [1 ]
Bozonnet, Simon [1 ]
Wang, Dong [1 ]
Fredouille, Corinne [2 ]
Troncy, Raphael [1 ]
机构
[1] EURECOM, Dept Multimedia Commun, F-06560 Sophia Antipolis, France
[2] Univ Avignon, CERI LIA, F-84911 Avignon, France
关键词
Clustering; rich transcription; segmentation; speaker diarization; FEATURES; IMPROVE;
D O I
10.1109/TASL.2011.2159710
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a theoretical framework to analyze the relative merits of the two most general, dominant approaches to speaker diarization involving bottom-up and top-down hierarchical clustering. We present an original qualitative comparison which argues how the two approaches are likely to exhibit different behavior in speaker inventory optimization and model training: bottom-up approaches will capture comparatively purer models and will thus be more sensitive to nuisance variation such as that related to the speech content; top-down approaches, in contrast, will produce less discriminative speaker models but, importantly, models which are potentially better normalized against nuisance variation. We report experiments conducted on two standard, single-channel NIST RT evaluation datasets which validate our hypotheses. Results show that competitive performance can be achieved with both bottom-up and top-down approaches (average DERs of 21% and 22%), and that neither approach is superior. Speaker purification, which aims to improve speaker discrimination, gives more consistent improvements with the top-down system than with the bottom-up system (average DERs of 19% and 25%), thereby confirming that the top-down system is less discriminative and that the bottom-up system is less stable. Finally, we report a new combination strategy that exploits the merits of the two approaches. Combination delivers an average DER of 17% and confirms the intrinsic complementary of the two approaches.
引用
收藏
页码:382 / 392
页数:11
相关论文
共 50 条
  • [1] An Integrated Top-Down/Bottom-Up Approach To Speaker Diarization
    Bozonnet, Simon
    Evans, Nicholas
    Fredouille, Corinne
    Wang, Dong
    Troncy, Raphael
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2654 - +
  • [2] LINGUISTIC INFLUENCES ON BOTTOM-UP AND TOP-DOWN CLUSTERING FOR SPEAKER DIARIZATION
    Bozonnet, Simon
    Wang, Dong
    Evans, Nicholas
    Troncy, Raphael
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4424 - 4427
  • [3] A comparative study of top-down and bottom-up approaches for the preparation of micro/nanosuspensions
    Verma, Sudhir
    Gokhale, Rajeev
    Burgess, Diane J.
    [J]. INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2009, 380 (1-2) : 216 - 222
  • [4] A comparative study of top-down and bottom-up approaches for the preparation of nanosuspensions of glipizide
    Mahesh, Koneti Venkata
    Singh, Sachin Kumar
    Gulati, Monica
    [J]. POWDER TECHNOLOGY, 2014, 256 : 436 - 449
  • [5] COMPARATIVE AND EXPERIMENTAL APPROACHES TO TOP-DOWN AND BOTTOM-UP REGULATION OF BACTERIA
    PACE, ML
    COLE, JJ
    [J]. MICROBIAL ECOLOGY, 1994, 28 (02) : 181 - 193
  • [6] Top-down versus bottom-up approaches in proteomics
    Wehr, Tim
    [J]. LC GC NORTH AMERICA, 2006, 24 (09) : 1004 - +
  • [7] Sustainability and uncertainty: Bottom-up and top-down approaches
    Jensen, K. Klint
    [J]. ITALIAN JOURNAL OF ANIMAL SCIENCE, 2007, 6 : 853 - 855
  • [8] Top-Down and Bottom-Up Approaches to Company Competitiveness
    Kotik, Vitezslav
    Voracek, Jan
    [J]. 15TH INTERNATIONAL FORUM ON KNOWLEDGE ASSET DYNAMICS (IFKAD 2020): KNOWLEDGE IN DIGITAL AGE, 2020, : 100 - 119
  • [9] General modelling approaches: Top-down or bottom-up?
    Worsley, TE
    Harris, RCE
    [J]. NATIONAL TRANSPORT MODELS: RECENT DEVELOPMENTS AND PROSPECTS, 2002, : 57 - 60
  • [10] TOP-DOWN AND BOTTOM-UP APPROACHES TO PROJECT SELECTION
    不详
    [J]. RESEARCH MANAGEMENT, 1978, 21 (02): : 22 - 24