Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge

被引:10
|
作者
Novotny, Ondrej [1 ]
Matejka, Pavel
Plchot, Oldrich
Glembek, Ondrej
Burget, Lukas
Cernocky, Jan Honza
机构
[1] Brno Univ Technol, Speech FIT, Brno, Czech Republic
关键词
D O I
10.21437/Interspeech.2016-981
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we summarize our efforts for the Speakers In The Wild (SITW) challenge, and we present our findings with this new dataset for speaker recognition. Apart from the standard comparison of different SRE systems, we analyze the use of diarization for dealing with audio segments containing multiple speakers, as in part of the newly introduced enrollment and test protocols, diarization is a necessary system component. Our state-of-the-art systems used in this work utilize both cepstral and DNN-based bottleneck features and are based on i-vectors followed by Probabilistic Linear Discriminant Analysis (PLDA) classifier and logistic regression calibration/fusion. We present both narrow-band (8 kHz) and wide-band (16 kHz) systems together with their fusions.
引用
收藏
页码:828 / 832
页数:5
相关论文
共 50 条
  • [1] A Speaker Recognition System for the SITW Challenge
    Kudashev, Oleg
    Novoselov, Sergey
    Simonchik, Konstantin
    Kozlov, Alexandr
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 833 - 837
  • [2] AUT System for SITW Speaker Recognition Challenge
    Khosravani, Abbas
    Homayounpour, Mohammad Mehdi
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 843 - 847
  • [3] LIA system for the SITW Speaker Recognition Challenge
    Ben Kheder, Waad
    Ajili, Moez
    Bousquet, Pierre-Michel
    Matrouf, Driss
    Bonastre, Tean-Frangois
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 848 - 852
  • [4] Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge
    Liu, Yi
    Tian, Yao
    He, Liang
    Liu, Jia
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 853 - 857
  • [5] AMRITATCS-IITGUWAHATI Combined System for the Speakers in the Wild (SITW) Speaker Recognition Challenge
    George, Kuruvachan K.
    Das, Rohan Kumar
    Jelil, Sarfaraz
    Das, K. Arun
    Kumar, C. Santhosh
    Prasanna, S. R. Mahadeva
    Panda, Ashish
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2842 - 2846
  • [6] The Speakers in the Wild (SITW) Speaker Recognition Database
    McLaren, Mitchell
    Ferrer, Luciana
    Castan, Diego
    Lawson, Aaron
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 818 - 822
  • [7] Speakers In The Wild (SITW): The QUT Speaker Recognition System
    Ghaemmaghami, H.
    Rahman, M. H.
    Himawan, I.
    Dean, D.
    Kanagasundaram, A.
    Sridharan, S.
    Fookes, C.
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 838 - 842
  • [8] STC Speaker Recognition Systems for the VOiCES From a Distance Challenge
    Novoselov, Sergey
    Gusev, Aleksei
    Ivanov, Artem
    Pekhovsky, Timur
    Shulipa, Andrey
    Lavrentyeva, Galina
    Volokhov, Vladimir
    Kozlov, Alexandr
    [J]. INTERSPEECH 2019, 2019, : 2443 - 2447
  • [9] UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation
    Zhang, Chunlei
    Bahmaninezhad, Fahimeh
    Ranjan, Shivesh
    Yu, Chengzhu
    Shokouhi, Navid
    Hansen, John H. L.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1343 - 1347
  • [10] The VoxCeleb Speaker Recognition Challenge: A Retrospective
    Huh, Jaesung
    Chung, Joon Son
    Nagrani, Arsha
    Brown, Andrew
    Jung, Jee-weon
    Garcia-Romero, Daniel
    Zisserman, Andrew
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3850 - 3866