A NOVEL I-VECTOR FRAMEWORK USING MULTIPLE FEATURES AND PCA FOR SPEAKER RECOGNITION IN SHORT SPEECH CONDITION

被引：0

作者：

Zhang, Chi ^{[1
]}

Li, Xiaoqiang ^{[1
]}

Li, Wei ^{[2
,3
]}

Lu, Peizhong ^{[2
]}

Zhang, Wenqiang ^{[2
,3
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China

[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China

[3] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China

来源：

PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP) | 2016年

关键词：

speaker recognition; short speech condition; PCA; i-vector; JOINT FACTOR-ANALYSIS;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Speaker recognition in short speech condition is a difficult topic because the length of training and test speech is very short. One of the main disadvantage of the existing methods for speaker recognition is that they need very sufficient data and it's usually impossible in reality applications. In our experiments, the conventional methods with single feature don't make good performance in short speech. We propose a novel i-vector framework using multiple features and Principal Component Analysis (PCA) in short speech condition to overcome this difficulty, as multiple features combination can represent more aspects of a speaker. PCA is used to map the multiple features to an uncorrelated and orthogonal basis set to meet the requirements of Gaussian Mixture Model (GMM) with diagonal covariance matrices and i-vector. Improvement from the proposed approach compared to a state-of-the-art system are of roughly 50% relative at equal error rate when evaluated on the telephone conditions from the 2010 NIST speaker recognition evaluation (SRE).

引用

页码：499 / 503

页数：5

共 50 条

[1] I-Vector Extraction Using Speaker Relevancy for Short Duration Speaker Recognition
Kang, Woo Hyun
Cho, Won Ik
Jang, Se Young
Lee, Hyeon Seung
Kim, Nam Soo
IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 79 - 87
[2] i-vector Based Speaker Recognition on Short Utterances
Kanagasundaram, Ahilan
Vogt, Robbie
Dean, David
Sridharan, Sridha
Mason, Michael
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2352 - +
[3] Maximum Likelihood i-vector Space Using PCA for Speaker Verification
Lei, Zhenchun
Yang, Yingchun
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2736 - 2739
[4] An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech
Senoussaoui, Mohammed
Kenny, Patrick
Dehak, Najim
Dumouchel, Pierre
ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 28 - 33
[5] Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation
Md Jahangir Alam
Vishwa Gupta
Patrick Kenny
Pierre Dumouchel
EURASIP Journal on Advances in Signal Processing, 2015
[6] Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation
Alam, Md Jahangir
Gupta, Vishwa
Kenny, Patrick
Dumouchel, Pierre
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015, : 1 - 13
[7] Speaker Adaptation Using the I-Vector Technique for Bottleneck Features
Cardinal, Patrick
Dehak, Najim
Zhang, Yu
Glass, James
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2867 - 2871
[8] I-vector Based Speaker Gender Recognition
Wang, Minghe
Chen, Ying
Tang, Zhenmin
Zhang, Erhua
2015 IEEE ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2015, : 729 - 732
[9] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
Mansour, Asma
Chenchah, Farah
Lachiri, Zied
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (06) : 6441 - 6458
[10] Emotional speaker recognition in real life conditions using multiple descriptors and i-vector speaker modeling technique
Asma Mansour
Farah Chenchah
Zied Lachiri
Multimedia Tools and Applications, 2019, 78 : 6441 - 6458

← 1 2 3 4 5 →