Articulatory movement features for short-duration text-dependent speaker verification

被引：5

作者：

Zhang Y. ^{[2
]}

Long Y. ^{[2
]}

Shen X. ^{[1
]}

Wei H. ^{[2
]}

Yang M. ^{[2
]}

Ye H. ^{[2
]}

Mao H. ^{[2
]}

机构：

[1] College of Humanities and Communications, Shanghai Normal University, Shanghai

[2] Department of Electronical and Information Engineering, Shanghai Normal University, Shanghai

来源：

International Journal of Speech Technology | 2017年 / 20卷 / 4期

关键词：

Articulatory movement features; Dynamic time warping; Speaker verification; Text-dependent;

D O I：

10.1007/s10772-017-9447-8

中图分类号：

学科分类号：

摘要：

During our pronunciation process, the position and movement properties of articulators such as tongue, jaw, lips, etc are mainly captured by the articulatory movement features (AMFs). This paper investigates to use the AMFs for short-duration text-dependent speaker verification. The AMFs can characterize the relative motion trajectory of articulators of individual speakers directly, which is rarely affected by the external environment. Therefore, we expect that, the AMFs are superior to the traditional acoustic features, such as mel-frequency cepstral coefficients (MFCC), to characterize the speaker identity differences between speakers. The speaker similarity scores measured by the dynamic time warping (DTW) algorithm are used to make the speaker verification decisions. Experimental results show that the AMFs can bring significant performance gains over the traditional MFCC features for short-duration text-dependent speaker verification task. © 2017, Springer Science+Business Media, LLC.

引用

页码：753 / 759

页数：6

共 50 条

[21] EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Dey, Subhadeep
Motlicek, Petr
Madikeri, Srikanth
Ferras, Marc
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5370 - 5374
[22] Data Augmentation Enhanced Speaker Enrollment for Text-dependent Speaker Verification
Sarkar, Achintya Kumar
Sarma, Himangshu
Dwivedi, Priyanka
Tan, Zheng-Hua
2020 3RD INTERNATIONAL CONFERENCE ON ENERGY, POWER AND ENVIRONMENT: TOWARDS CLEAN ENERGY TECHNOLOGIES (ICEPE 2020), 2021,
[23] Template-matching for text-dependent speaker verification
Dey, Subhadeep
Motlicek, Petr
Madikeri, Srikanth
Ferras, Marc
SPEECH COMMUNICATION, 2017, 88 : 96 - 105
[24] End-to-End Text-Dependent Speaker Verification
Heigold, Georg
Moreno, Ignacio
Bengio, Samy
Shazeer, Noam
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5115 - 5119
[25] MODELLING THE ALTERNATIVE HYPOTHESIS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Larcher, Anthony
Lee, Kong Aik
Ma, Bin
Li, Haizhou
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[26] On Residual CNN in Text-Dependent Speaker Verification Task
Malykh, Egor
Novoselov, Sergey
Kudashev, Oleg
SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 593 - 601
[27] Constrained temporal structure for text-dependent speaker verification
Larcher, Anthony
Bonastre, Jean-Francois
Mason, John S. D.
DIGITAL SIGNAL PROCESSING, 2013, 23 (06) : 1910 - 1917
[28] Exploring subsegmental and suprasegmental features for a text-dependent speaker verification in distant speech signals
Avinash, B.
Guruprasad, S.
Yegnanarayana, B.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1073 - +
[29] Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification
Wang, Shuai
Huang, Zili
Qian, Yanmin
Yu, Kai
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1686 - 1696
[30] ATTENTION-BASED MODELS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Chowdhury, F. A. Rezaur Rahman
Wang, Quan
Moreno, Ignacio Lopez
Wan, Li
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5359 - 5363

← 1 2 3 4 5 →