Text-dependent speaker verification: Classifiers, databases and RSR2015

被引:182
|
作者
Larcher, Anthony [1 ]
Lee, Kong Aik [1 ]
Ma, Bin [1 ]
Li, Haizhou [1 ]
机构
[1] Human Language Technol Dept 1, Inst Infocomm Res I2R, Singapore 138632, Singapore
关键词
Speaker recognition; Text-dependent; Database; RECOGNITION; SPEECH; HMM; IDENTIFICATION; NORMALIZATION; FEATURES; CORPUS;
D O I
10.1016/j.specom.2014.03.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The RSR2015 database, designed to evaluate text-dependent speaker verification systems under different durations and lexical constraints has been collected and released by the Human Language Technology (HLT) department at Institute for Infocomm Research ((IR)-R-2) in Singapore. English speakers were recorded with a balanced diversity of accents commonly found in Singapore. More than 151 h of speech data were recorded using mobile devices. The pool of speakers consists of 300 participants (143 female and 157 male speakers) between 17 and 42 years old making the RSR2015 database one of the largest publicly available database targeted for text-dependent speaker verification. We provide evaluation protocol for each of the three parts of the database, together with the results of two speaker verification system: the HiLAM system, based on a three layer acoustic architecture, and an i-vector/PLDA system. We thus provide a reference evaluation scheme and a reference performance on RSR2015 database to the research community. The HiLAM outperforms the state-of-the-art i-vector system in most of the scenarios. (C) 2014 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/3.0/).
引用
收藏
页码:56 / 77
页数:22
相关论文
共 50 条
  • [21] On Residual CNN in Text-Dependent Speaker Verification Task
    Malykh, Egor
    Novoselov, Sergey
    Kudashev, Oleg
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 593 - 601
  • [22] Constrained temporal structure for text-dependent speaker verification
    Larcher, Anthony
    Bonastre, Jean-Francois
    Mason, John S. D.
    [J]. DIGITAL SIGNAL PROCESSING, 2013, 23 (06) : 1910 - 1917
  • [23] ATTENTION-BASED MODELS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Chowdhury, F. A. Rezaur Rahman
    Wang, Quan
    Moreno, Ignacio Lopez
    Wan, Li
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5359 - 5363
  • [24] Covariance Based Deep Feature for Text-Dependent Speaker Verification
    Wang, Shuai
    Dinkel, Heinrich
    Qian, Yanmin
    Yu, Kai
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 231 - 242
  • [25] Cohort Selection for Text-dependent Speaker Verification Score Normalization
    Khemiri, Houssemeddine
    Petrovska-Delacretaz, Dijana
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 689 - 692
  • [26] BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020
    Lozano-Diez, Alicia
    Silnova, Anna
    Pulugundla, Bhargav
    Rohdin, Johan
    Vesely, Karel
    Burget, Lukas
    Plchot, Oldrich
    Glembek, Ondrej
    Novotny, Ondvrej
    Matejka, Pavel
    [J]. INTERSPEECH 2020, 2020, : 761 - 765
  • [27] Sub-band based text-dependent speaker verification
    Sivakumaran, P
    Ariyaeeinia, AM
    Loomes, MJ
    [J]. SPEECH COMMUNICATION, 2003, 41 (2-3) : 485 - 509
  • [28] Unsupervised Learning of HMM Topology for Text-dependent Speaker Verification
    Liu, Ming
    Huang, Thomas
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 921 - 924
  • [29] Tandem Features for Text-dependent Speaker Verification on the RedDots Corpus
    Alam, Md Jahangir
    Kenny, Patrick
    Gupta, Vishwa
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 420 - 424
  • [30] Multi-Task Learning for Text-dependent Speaker Verification
    Chen, Nanxin
    Qian, Yanmin
    Yu, Kai
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 185 - 189