The Speakers in the Wild (SITW) Speaker Recognition Database

被引:168
|
作者
McLaren, Mitchell [1 ]
Ferrer, Luciana [2 ,3 ]
Castan, Diego [1 ]
Lawson, Aaron [1 ]
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
[2] Univ Buenos Aires, FCEN, Dept Comp, Buenos Aires, DF, Argentina
[3] Consejo Nacl Invest Cient & Tecn, Buenos Aires, DF, Argentina
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
关键词
speaker recognition; database; real-world data;
D O I
10.21437/Interspeech.2016-1129
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The Speakers in the Wild (SITW) speaker recognition database contains hand-annotated speech samples from open-source media for the purpose of benchmarking text-independent speaker recognition technology on single and multi-speaker audio acquired across unconstrained or "wild" conditions. The database consists of recordings of 299 speakers, with an average of eight different sessions per person. Unlike existing databases for speaker recognition, this data was not collected under controlled conditions and thus contains real noise, reverberation, intraspeaker variability and compression artifacts. These factors are often convolved in the real world, as the SITW data shows, and they make SITW a challenging database for single- and multi speaker recognition
引用
收藏
页码:818 / 822
页数:5
相关论文
共 50 条
  • [1] Speakers In The Wild (SITW): The QUT Speaker Recognition System
    Ghaemmaghami, H.
    Rahman, M. H.
    Himawan, I.
    Dean, D.
    Kanagasundaram, A.
    Sridharan, S.
    Fookes, C.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 838 - 842
  • [2] AMRITATCS-IITGUWAHATI Combined System for the Speakers in the Wild (SITW) Speaker Recognition Challenge
    George, Kuruvachan K.
    Das, Rohan Kumar
    Jelil, Sarfaraz
    Das, K. Arun
    Kumar, C. Santhosh
    Prasanna, S. R. Mahadeva
    Panda, Ashish
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2842 - 2846
  • [3] Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge
    Liu, Yi
    Tian, Yao
    He, Liang
    Liu, Jia
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 853 - 857
  • [4] The 2016 Speakers in the Wild Speaker Recognition Evaluation
    McLaren, Mitchell
    Ferrer, Luciana
    Castan, Diego
    Lawson, Aaron
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 823 - 827
  • [5] A Speaker Recognition System for the SITW Challenge
    Kudashev, Oleg
    Novoselov, Sergey
    Simonchik, Konstantin
    Kozlov, Alexandr
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 833 - 837
  • [6] AUT System for SITW Speaker Recognition Challenge
    Khosravani, Abbas
    Homayounpour, Mohammad Mehdi
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 843 - 847
  • [7] LIA system for the SITW Speaker Recognition Challenge
    Ben Kheder, Waad
    Ajili, Moez
    Bousquet, Pierre-Michel
    Matrouf, Driss
    Bonastre, Tean-Frangois
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 848 - 852
  • [8] Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge
    Novotny, Ondrej
    Matejka, Pavel
    Plchot, Oldrich
    Glembek, Ondrej
    Burget, Lukas
    Cernocky, Jan Honza
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 828 - 832
  • [9] Polyphone-IPSC: A shared speakers database for evaluation of forensic automatic speaker recognition systems
    Meuwly, D
    Alexander, A
    Drygajlo, A
    Botti, F
    FORENSIC SCIENCE INTERNATIONAL, 2003, 136 : 367 - 367
  • [10] Speaker recognition by location in the space of reference speakers
    Mami, Y
    Charlet, D
    SPEECH COMMUNICATION, 2006, 48 (02) : 127 - 141