DNN BASED SPEAKER EMBEDDING USING CONTENT INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION

被引：0

作者：

Dey, Subhadeep ^{[1
,2
]}

Koshinaka, Takafumi ^{[3
]}

Motlicek, Petr ^{[1
]}

Madikeri, Srikanth ^{[1
]}

机构：

[1] Idiap Res Inst, Martigny, Switzerland

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

[3] NEC Corp Ltd, Tokyo, Japan

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

speaker verification; speaker embedding; i-vectors; content mismatch;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we are interested in exploring Deep Neural Network (DNN) based speaker embedding for Random-digit task using content information. To this end, a technique is applied to automatically select common phonetic units between the enrollment and test data to produce speaker verification scores. Furthermore, a novel approach is proposed to incorporate content information in the DNN directly. It is hypothesized that features extracted using this DNN will be helpful for the task. Experiments on the RSR dataset show that the proposed method outperforms the baseline i-vector system by 43% relative equal error rate.

引用

页码：5344 / 5348

页数：5

共 50 条

[21] Addressing Text-Dependent Speaker Verification Using Singing Speech
Shi, Yan
Zhou, Juanjuan
Long, Yanhua
Li, Yijie
Mao, Hongwei
APPLIED SCIENCES-BASEL, 2019, 9 (13):
[22] Sub-band based text-dependent speaker verification
Sivakumaran, P
Ariyaeeinia, AM
Loomes, MJ
SPEECH COMMUNICATION, 2003, 41 (2-3) : 485 - 509
[23] Analysis of the Hilbert Spectrum for Text-Dependent Speaker Verification
Sharma, Rajib
Bhukya, Ramesh K.
Prasanna, S. R. M.
SPEECH COMMUNICATION, 2018, 96 : 207 - 224
[24] Towards Goat Detection in Text-Dependent Speaker Verification
Toledo-Ronen, Orith
Aronowitz, Hagai
Hoory, Ron
Pelecanos, Jason
Nahamoo, David
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 16 - +
[25] Text-dependent speaker recognition using speaker specific compensation
Laxman, S
Sastry, PS
IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 384 - 387
[26] A Survey on Text-Dependent and Text-Independent Speaker Verification
Tu, Youzhi
Lin, Weiwei
Mak, Man-Wai
IEEE ACCESS, 2022, 10 : 99038 - 99049
[27] Tandem Deep Features for Text-Dependent Speaker Verification
Fu, Tianfan
Qian, Yanmin
Liu, Yuan
Yu, Kai
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1327 - 1331
[28] EXPLORING SEQUENTIAL CHARACTERISTICS IN SPEAKER BOTTLENECK FEATURE FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Chen, Liping
Zhao, Yong
Zhang, Shi-Xiong
Li, Jie
Ye, Guoli
Soong, Frank
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5364 - 5368
[29] Using Phoneme Recognition and Text-dependent Speaker Verification to Improve Speaker Segmentation for Chinese Speech
Wang, Gang
Wu, Xiaojun
Zheng, Thomas Fang
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1457 - 1460
[30] End-to-End Text-Dependent Speaker Verification
Heigold, Georg
Moreno, Ignacio
Bengio, Samy
Shazeer, Noam
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5115 - 5119

← 1 2 3 4 5 →