Text-Independent Speaker Verification Based on Deep Neural Networks and Segmental Dynamic TimeWarping

被引：0

作者：

Adel, Mohamed ^{[1
]}

Afify, Mohamed ^{[1
]}

Gaballah, Akram ^{[2
]}

Fayek, Magda ^{[3
]}

机构：

[1] Microsoft Adv Technol Lab, Cairo, Egypt

[2] Microsoft Corp, Redmond, WA 98052 USA

[3] Cairo Univ, Giza, Egypt

来源：

2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018) | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a new method for text-independent speaker verification that combines segmental dynamic time warping (SDTW) and the d-vector approach. The d-vectors, generated from a feed forward deep neural network trained to distinguish between speakers, are used as features to perform alignment and hence calculate the overall distance between the enrolment and test utterances. We present results on the NIST 2008 data set for speaker verification where the proposed method outperforms the conventional i-vector baseline with PLDA scores and outperforms d-vector approach with local distances based on cosine and PLDA scores. Also score combination with the i-vector/ PLDA baseline leads to significant gains over both methods.

引用

页码：1001 / 1006

页数：6

共 50 条

[31] Text-Independent Speaker Verification Using Lightweight 3D Convolutional Neural Networks
Chen, Jyun-Yan
Jeng, Jin-Tsong
[J]. 2024 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING, ICSSE 2024, 2024,
[32] DEEP BOTTLENECK FEATURES FOR I-VECTOR BASED TEXT-INDEPENDENT SPEAKER VERIFICATION
Ghalehjegh, Sina Hamidi
Rose, Richard C.
[J]. 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 555 - 560
[33] ORTHOGONAL TRAINING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Zhu, Yingke
Mak, Brian
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6584 - 6588
[34] Text-Independent Speaker Verification Based on Information Theoretic Learning
Memon, Sheeraz
Khanzada, Tariq Jameel Saifullah
Bhatti, Sania
[J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2011, 30 (03) : 457 - 468
[35] Text-independent speaker verification in embedded environments
Tydlitat, Borivoj
Navratil, Jiri
Pelecanos, Jason W.
Ramaswamy, Ganesh N.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 293 - +
[36] Adaptive method for text-independent speaker verification
[J]. Zhang, Yiying, 2000, (11):
[37] A Text-Independent Speaker Verification System Based on Cross Entropy
Lu, Xiaochun
Yin, Junxun
[J]. COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2009, 51 : 419 - 426
[38] A novel text-independent speaker verification method based on the global speaker model
Zhang, YY
Zhang, D
Zhu, XY
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (05): : 598 - 602
[39] Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification
Shum, Stephen
Dehak, Najim
Dehak, Reda
Glass, James R.
[J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 76 - 82
[40] Text-independent speaker verification based on relation of MFCC components
Ou, GW
Ke, DF
[J]. 2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 57 - 60

← 1 2 3 4 5 →