Text-dependent speaker verification: Classifiers, databases and RSR2015

被引：182

作者：

Larcher, Anthony ^{[1
]}

Lee, Kong Aik ^{[1
]}

Ma, Bin ^{[1
]}

Li, Haizhou ^{[1
]}

机构：

[1] Human Language Technol Dept 1, Inst Infocomm Res I2R, Singapore 138632, Singapore

来源：

SPEECH COMMUNICATION | 2014年 / 60卷

关键词：

Speaker recognition; Text-dependent; Database; RECOGNITION; SPEECH; HMM; IDENTIFICATION; NORMALIZATION; FEATURES; CORPUS;

D O I：

10.1016/j.specom.2014.03.001

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The RSR2015 database, designed to evaluate text-dependent speaker verification systems under different durations and lexical constraints has been collected and released by the Human Language Technology (HLT) department at Institute for Infocomm Research ((IR)-R-2) in Singapore. English speakers were recorded with a balanced diversity of accents commonly found in Singapore. More than 151 h of speech data were recorded using mobile devices. The pool of speakers consists of 300 participants (143 female and 157 male speakers) between 17 and 42 years old making the RSR2015 database one of the largest publicly available database targeted for text-dependent speaker verification. We provide evaluation protocol for each of the three parts of the database, together with the results of two speaker verification system: the HiLAM system, based on a three layer acoustic architecture, and an i-vector/PLDA system. We thus provide a reference evaluation scheme and a reference performance on RSR2015 database to the research community. The HiLAM outperforms the state-of-the-art i-vector system in most of the scenarios. (C) 2014 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/3.0/).

引用

页码：56 / 77

页数：22

共 50 条

[1] Extended RSR2015 for text-dependent speaker verification over VHF channel
Larcher, Anthony
Lee, Kong Aik
Martinez, Pablo L. Sordo
Trung Hieu Nguyen
Ma, Bin
Li, Haizhou
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1322 - 1326
[2] The RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases
Larcher, Anthony
Lee, Kong Aik
Ma, Bin
Li, Haizhou
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1578 - 1581
[3] Text-Dependent Speaker Verification System: A Review
Debnath, Saswati
Soni, B.
Baruah, U.
Sah, D. K.
[J]. PROCEEDINGS OF 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO), 2015,
[4] Deep feature for text-dependent speaker verification
Liu, Yuan
Qian, Yanmin
Chen, Nanxin
Fu, Tianfan
Zhang, Ya
Yu, Kai
[J]. SPEECH COMMUNICATION, 2015, 73 : 1 - 13
[5] Bidirectional Attention for Text-Dependent Speaker Verification
Fang, Xin
Gao, Tian
Zou, Liang
Ling, Zhenhua
[J]. SENSORS, 2020, 20 (23) : 1 - 17
[6] Robust Methods for Text-Dependent Speaker Verification
Bhukya, Ramesh K.
Prasanna, S. R. Mahadeva
Sarma, Biswajit Dev
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (11) : 5253 - 5288
[7] Content Normalization for Text-dependent Speaker Verification
Dey, Subhadeep
Madikeri, Srikanth
Motlicek, Petr
Ferras, Marc
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1482 - 1486
[8] IMPOSTURE CLASSIFICATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Larcher, Anthony
Lee, Kong Aik
Ma, Bin
Li, Haizhou
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[9] Robust Methods for Text-Dependent Speaker Verification
Ramesh K. Bhukya
S. R. Mahadeva Prasanna
Biswajit Dev Sarma
[J]. Circuits, Systems, and Signal Processing, 2019, 38 : 5253 - 5288
[10] Parallel Speaker and Content Modelling for Text-dependent Speaker Verification
Ma, Jianbo
Irtza, Saad
Sriskandaraja, Kaavya
Sethu, Vidhyasaharan
Ambikairajah, Eliathamby
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 435 - 439

← 1 2 3 4 5 →