ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge

被引:2
|
作者
Vinals, Ignacio [1 ]
Gimeno, Pablo [1 ]
Ortega, Alfonso [1 ]
Miguel, Antonio [1 ]
Lleida, Eduardo [1 ]
机构
[1] Univ Zaragoza, Aragon Inst Engn Res I3A, ViVoLab, Zaragoza, Spain
来源
关键词
diarization; DIHARD Challenge; PLDA; Variational Bayes; Tree search; M-algorithm;
D O I
10.21437/Interspeech.2019-2462
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
This paper presents the latest improvements in Speaker Diarization obtained by ViVoLAB research group for the 2019 DIHARD Diarization Challenge. This evaluation seeks the improvement of the diarization task in adverse conditions. For this purpose, the audio recordings involve multiple scenarios with no restrictions in terms of speakers, overlapped speech nor quality of the audio. Our submission follows the traditional segmentation-clustering-resegmentation pipeline: Speaker embeddings are extracted from acoustic segments with a single speaker on them, later clustered by means of a PLDA. Our contribution in this work is focused on the clustering step. We present results with our Variational Bayes PLDA clustering and our tree-based clustering strategy, which sequentially assigns the different embeddings to its corresponding speaker according to a PLDA model. Both strategies compare multiple diarization hypotheses and choose their candidate one according to a generative criterion. We also analyze the impact of the different available embeddings in the state-of-the-art with both clustering approaches.
引用
收藏
页码:988 / 992
页数:5
相关论文
共 50 条
  • [1] UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge
    Zajic, Zbynek
    Kunesova, Marie
    Hruz, Marek
    Vanek, Jan
    [J]. INTERSPEECH 2019, 2019, : 993 - 997
  • [2] Speaker Diarization with Deep Speaker Embeddings for DIHARD Challenge II
    Novoselov, Sergey
    Gusev, Aleksei
    Ivanov, Artem
    Pekhovsky, Timur
    Shulipa, Andrey
    Avdeeva, Anastasia
    Gorlanov, Artem
    Kozlov, Alexandr
    [J]. INTERSPEECH 2019, 2019, : 1003 - 1007
  • [3] ZCU-NTIS Speaker Diarization System for the DIHARD 2018 Challenge
    Zajic, Zbynek
    Kunesova, Marie
    Zelinka, Jan
    Hruz, Marek
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2788 - 2792
  • [4] Speaker Diarization with Enhancing Speech for the First DIHARD Challenge
    Sun, Lei
    Du, Jun
    Jiang, Chao
    Zhang, Xueyang
    He, Shan
    Yin, Bing
    Lee, Chin-Hui
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2793 - 2797
  • [5] INVESTIGATING DEEP NEURAL NETWORKS FOR SPEAKER DIARIZATION IN THE DIHARD CHALLENGE
    Himawan, Ivan
    Rahman, Md Hafizur
    Sridharan, Sridha
    Fookes, Clinton
    Kanagasundaram, Ahilan
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1029 - 1035
  • [6] LEAP Diarization System for the Second DIHARD Challenge
    Singh, Prachi
    Vardhan, Harsha M. A.
    Ganapathy, Sriram
    Kanagasundaram, Ahilan
    [J]. INTERSPEECH 2019, 2019, : 983 - 987
  • [7] An Analysis of Speaker Diarization Fusion Methods For The First DIHARD Challenge
    Yin, Bing
    Du, Jun
    Sun, Lei
    Zhang, Xueyang
    He, Shan
    Ling, Zhenhua
    Hu, Guoping
    Guo, Wu
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1473 - 1477
  • [8] BUT SYSTEM FOR THE SECOND DIHARD SPEECH DIARIZATION CHALLENGE
    Landini, Federico
    Wang, Shuai
    Diez, Mireia
    Burget, Lukas
    Matejka, Pavel
    Zmolikova, Katerina
    Mosner, Ladislav
    Silnova, Anna
    Plchot, Oldrich
    Novotny, Ondrej
    Zeinali, Hossein
    Rohdin, Johan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6529 - 6533
  • [9] BUT system for DIHARD Speech Diarization Challenge 2018
    Diez, Mireia
    Landini, Federico
    Burget, Lukas
    Rohdin, Johan
    Silnova, Anna
    Zmolikova, Katerina
    Novotny, Ondrej
    Vesely, Karel
    Glembek, Ondrej
    Plchot, Oldrich
    Mosner, Ladislav
    Matejka, Pavel
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2798 - 2802
  • [10] Scenario-Dependent Speaker Diarization for DIHARD-III Challenge
    Wang, Yu-Xuan
    Du, Jun
    He, Mao-Kui
    Niu, Shu-Tong
    Sun, Lei
    Lee, Chin-Hui
    [J]. INTERSPEECH 2021, 2021, : 3106 - 3110