The AMI speaker diarization system for NIST RT06s meeting data

被引:0
|
作者
van Leeuwen, David A. [1 ]
Huijbregts, Marijn [2 ]
机构
[1] TNO, Human Factors, POB 23, NL-3769 DE Soesterberg, Netherlands
[2] Univ Twente, Depat EEMCS, Human Media Interact, Enschede, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe the systems submitted to the NIST RT06s evaluation for the Speech Activity Detection (SAD) and Speaker Diarization (SPKR) tasks. For speech activity detection, a new analysis methodology is presented that generalizes the Detection Erorr Tradeoff analysis commonly used in speaker detection tasks. The speaker diarization systems are based on the TNO and ICSI system submitted for RT05s. For the conference room evaluation Single Distant Microphone condition, the SAD results perform well at 4.23% error rate, and the 'HMM-BIC' SPKR results perform competatively at an error rate of 37.2% including overlapping speech.
引用
收藏
页码:371 / +
页数:3
相关论文
共 39 条
  • [1] Robust Speaker Diarization for Meetings: ICSI RT06s evaluation system
    Anguera, Xavier
    Wooters, Chuck
    Pardo, Jose M.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1674 - 1677
  • [2] Robust speaker diarization for meetings: ICSI RT06S meetings evaluation system
    Anguera, Xavier
    Wooters, Chuck
    Pardo, Jose M.
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 346 - +
  • [3] The TNO speaker diarization system for NIST RT05s meeting data
    van Leeuwen, DA
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3869 : 440 - 449
  • [4] The LIMSI RT06s lecture transcription system
    Lamel, L.
    Bilinski, E.
    Adda, G.
    Gauvain, J. L.
    Schwenk, H.
    [J]. MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2006, 4299 : 457 - +
  • [5] Speaker diarization system on the 2007 NIST rich transcription meeting recognition evaluation
    Sun, Hanwu
    Nwe, Tin Lay
    Chin, Eugene
    Koh, Wei
    Bin, Ma
    Li, Haizhou
    [J]. MULTIMEDIA SYSTEMS AND APPLICATIONS X, 2007, 6777
  • [6] Progress in the AMIDA speaker diarization system for meeting data
    van Leeuwen, David A.
    Konecny, Matej
    [J]. MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2008, 4625 : 475 - 483
  • [7] SPHEREDIAR: AN EFFECTIVE SPEAKER DIARIZATION SYSTEM FOR MEETING DATA
    Kaseva, Tuomas
    Rouhe, Aku
    Kurimo, Mikko
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 373 - 380
  • [8] Speaker Diarization and Linking of Meeting Data
    Ferras, Marc
    Madikeri, Srikanth
    Bourlard, Herve
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 1935 - 1945
  • [9] SPEAKER DIARIZATION SYSTEM FOR RT07 AND RT09 MEETING ROOM AUDIO
    Sun, Hanwu
    Ma, Bin
    Khine, Swe Zin Kalayar
    Li, Haizhou
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4982 - 4985
  • [10] The ICSI RT07s speaker diarization system
    Wooters, Chuck
    Huijbregts, Marijn
    [J]. MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2008, 4625 : 509 - 519