ATTENTION-BASED MODELS FOR TEXT-DEPENDENT SPEAKER VERIFICATION

被引：0

作者：

Chowdhury, F. A. Rezaur Rahman ^{[1
]}

Wang, Quan ^{[2
]}

Moreno, Ignacio Lopez ^{[2
]}

Wan, Li ^{[2
]}

机构：

[1] Washington State Univ, Pullman, WA 99164 USA

[2] Google Inc, Mountain View, CA USA

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

Attention-based model; sequence summarization; speaker recognition; pooling; LSTM;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Attention-based models have recently shown great performance on a range of tasks, such as speech recognition, machine translation, and image captioning due to their ability to summarize relevant information that expands through the entire length of an input sequence. In this paper, we analyze the usage of attention mechanisms to the problem of sequence summarization in our end-to-end text-dependent speaker recognition system. We explore different topologies and their variants of the attention layer. and compare different pooling methods on the attention weights. Ultimately, we show that attention-based models can improves the Equal Error Rate (EER) of our speaker verification system by relatively 14% compared to our non-attention LSTM baseline model.

引用

页码：5359 / 5363

页数：5

共 50 条

[1] Bidirectional Attention for Text-Dependent Speaker Verification
Fang, Xin
Gao, Tian
Zou, Liang
Ling, Zhenhua
[J]. SENSORS, 2020, 20 (23) : 1 - 17
[2] END-TO-END ATTENTION BASED TEXT-DEPENDENT SPEAKER VERIFICATION
Zhang, Shi-Xiong
Chen, Zhuo
Zhao, Yong
Li, Jinyu
Gong, Yifan
[J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 171 - 178
[3] Text-Dependent Speaker Verification System: A Review
Debnath, Saswati
Soni, B.
Baruah, U.
Sah, D. K.
[J]. PROCEEDINGS OF 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO), 2015,
[4] Deep feature for text-dependent speaker verification
Liu, Yuan
Qian, Yanmin
Chen, Nanxin
Fu, Tianfan
Zhang, Ya
Yu, Kai
[J]. SPEECH COMMUNICATION, 2015, 73 : 1 - 13
[5] Covariance Based Deep Feature for Text-Dependent Speaker Verification
Wang, Shuai
Dinkel, Heinrich
Qian, Yanmin
Yu, Kai
[J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 231 - 242
[6] Robust Methods for Text-Dependent Speaker Verification
Bhukya, Ramesh K.
Prasanna, S. R. Mahadeva
Sarma, Biswajit Dev
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (11) : 5253 - 5288
[7] Content Normalization for Text-dependent Speaker Verification
Dey, Subhadeep
Madikeri, Srikanth
Motlicek, Petr
Ferras, Marc
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1482 - 1486
[8] IMPOSTURE CLASSIFICATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Larcher, Anthony
Lee, Kong Aik
Ma, Bin
Li, Haizhou
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[9] Robust Methods for Text-Dependent Speaker Verification
Ramesh K. Bhukya
S. R. Mahadeva Prasanna
Biswajit Dev Sarma
[J]. Circuits, Systems, and Signal Processing, 2019, 38 : 5253 - 5288
[10] Sub-band based text-dependent speaker verification
Sivakumaran, P
Ariyaeeinia, AM
Loomes, MJ
[J]. SPEECH COMMUNICATION, 2003, 41 (2-3) : 485 - 509

← 1 2 3 4 5 →