EXPLORING SEQUENTIAL CHARACTERISTICS IN SPEAKER BOTTLENECK FEATURE FOR TEXT-DEPENDENT SPEAKER VERIFICATION

被引：0

作者：

Chen, Liping ^{[1
]}

Zhao, Yong ^{[2
]}

Zhang, Shi-Xiong ^{[2
]}

Li, Jie ^{[1
]}

Ye, Guoli ^{[2
]}

Soong, Frank ^{[3
]}

机构：

[1] Microsoft Search Technol Ctr Asia, Beijing, Peoples R China

[2] Microsoft Corp, One Microsoft Way, Redmond, WA 98052 USA

[3] Microsoft Res Asia, Beijing, Peoples R China

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

Text-dependent speaker verification; sequential speaker characteristics; speaker supervector; dynamic time warping; VARIABILITY;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, given the speaker bottleneck feature vectors extracted with speaker discriminant neural networks, we focus on using the sequential speaker characteristics for text-dependent speaker verification. In each evaluation trial, speaker supervectors are used as the representations of the sequential speaker characteristics rendered in the compared speech utterances. To this end, dynamic time warping is used to warp the variable-length speaker feature vector sequences of the utterances to the same length. Thereafter for every utterance, a speaker supervector can be obtained as the concatenation of its speaker feature vectors. We use Euclidean distance and support vector machine (SVM) to compute the decision score on the speaker supervectors. Our experiments on a Microsoft internal keyword-spotting database showed the effectiveness of the proposed speaker supervector for text-dependent speaker verification. Moreover, when SVM backend was used in scoring, the speaker supervector achieved the best EER performance 1.627%, better than the combination of i-vector and probabilistic linear discriminant analysis.

引用

页码：5364 / 5368

页数：5

共 50 条

[31] ATTENTION-BASED MODELS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Chowdhury, F. A. Rezaur Rahman
Wang, Quan
Moreno, Ignacio Lopez
Wan, Li
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5359 - 5363
[32] Cohort Selection for Text-dependent Speaker Verification Score Normalization
Khemiri, Houssemeddine
Petrovska-Delacretaz, Dijana
[J]. 2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 689 - 692
[33] BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020
Lozano-Diez, Alicia
Silnova, Anna
Pulugundla, Bhargav
Rohdin, Johan
Vesely, Karel
Burget, Lukas
Plchot, Oldrich
Glembek, Ondrej
Novotny, Ondvrej
Matejka, Pavel
[J]. INTERSPEECH 2020, 2020, : 761 - 765
[34] Multi-Task Learning for Text-dependent Speaker Verification
Chen, Nanxin
Qian, Yanmin
Yu, Kai
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 185 - 189
[35] Tandem Features for Text-dependent Speaker Verification on the RedDots Corpus
Alam, Md Jahangir
Kenny, Patrick
Gupta, Vishwa
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 420 - 424
[36] Sub-band based text-dependent speaker verification
Sivakumaran, P
Ariyaeeinia, AM
Loomes, MJ
[J]. SPEECH COMMUNICATION, 2003, 41 (2-3) : 485 - 509
[37] Unsupervised Learning of HMM Topology for Text-dependent Speaker Verification
Liu, Ming
Huang, Thomas
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 921 - 924
[38] A study of the relative importance of temporal characteristics in text-dependent and text-constrained speaker verification
Nealand, JH
Pelecanos, JW
Zilca, RD
Ramaswamy, GN
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 653 - 656
[39] ON INSTANTANEOUS AND TRANSITIONAL SPECTRAL INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
BERNASCONI, C
[J]. SPEECH COMMUNICATION, 1990, 9 (02) : 129 - 139
[40] Addressing Text-Dependent Speaker Verification Using Singing Speech
Shi, Yan
Zhou, Juanjuan
Long, Yanhua
Li, Yijie
Mao, Hongwei
[J]. APPLIED SCIENCES-BASEL, 2019, 9 (13):

← 1 2 3 4 5 →