Dimension reduction for speaker identification based on mutual information

被引：0

作者：

Lu, Xugang ^{[1
]}

Dang, Jianwu ^{[1
]}

机构：

[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa 9231292, Japan

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

speaker identification; mutual information; feature selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dimension reduction is a necessary step for speech feature extraction in a speaker identification system. Discrete Cosine Transform (DCT) or Principal Component Analysis (PCA) is widely used for dimension reduction. By choosing basis vectors from basis vector pool of DCT or PCA which contribute more to data distribution variance or reconstruction accuracy of speech data set, we can transform the data set by projecting them on to the selected basis vectors. However, keeping the maximum distribution variance or high reconstruction accuracy does not guarantee the optimal keeping of high speaker discriminative information. In this paper, we proposed a basis vector selection method based on mutual information concept which guarantees the keeping of high speaker discriminative information. The mutual information is used to measure the dependency between the features extracted using basis vectors and speaker class labels. The high mutual information related basis vectors are chosen for feature extraction. Considering one speaker feature may be encoded in more than one basis vectors, we proposed to use joint mutual information concept which takes the dependency between feature variables into consideration. Based on the selected basis vectors from DCT or PCA basis vector pool, we extracted features for speaker identification experiments. Experimental results showed that the speaker identification error rate using proposed feature was reduced 11% and 8% on average for DCT and PCA based features respectively.

引用

页码：1157 / 1160

页数：4

共 50 条

[31] Attention based gender and nationality information exploration for speaker identification
Tang, Yong
Liu, Chuang
Leng, Yan
Zhao, Weiwei
Sun, Jiande
Sun, Chengli
Wang, Rongyan
Yuan, Qi
Li, Dengwang
Xu, Huaqiang
DIGITAL SIGNAL PROCESSING, 2022, 123
[32] Dimensionality reduction based on non-parametric mutual information
Faivishevsky, Lev
Goldberger, Jacob
NEUROCOMPUTING, 2012, 80 : 31 - 37
[33] Chatter identification with mutual information
Berger, B
Belai, C
Anand, D
JOURNAL OF SOUND AND VIBRATION, 2003, 267 (01) : 178 - 186
[34] Research on fuzzy rough parallel reduction based on mutual information
Xu, F. (xufeifei@shiep.edu.cn), 1600, Binary Information Press (10):
[35] Supervector Dimension Reduction for Efficient Speaker Age Estimation Based on the Acoustic Speech Signal
Dobry, Gil
Hecht, Ron M.
Avigal, Mireille
Zigel, Yaniv
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1975 - 1985
[36] Individual dimension Gaussian mixture model for speaker identification
Wang, C
Hou, LM
Fang, Y
ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3781 : 172 - 179
[37] Application of the mutual information minimization to speaker recognition/verification improvement
Solé-Casals, J
Faúndez-Zanuy, M
INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 865 - 872
[38] Textile defects Identification Based on Neural Networks and Mutual Information
Abdel-Azim, Gamil
Nasri, Salem
2013 INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS TECHNOLOGY (ICCAT), 2013,
[39] A novel Mutual Information based PCA approach for face identification
Krishnakumar, K.
Vasandkumar, K.
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 22503 - 22519
[40] A novel Mutual Information based PCA approach for face identification
Krishnakumar K
Vasandkumar K
Multimedia Tools and Applications, 2024, 83 : 22503 - 22519

← 1 2 3 4 5 →