Dimension reduction for speaker identification based on mutual information

被引：0

作者：

Lu, Xugang ^{[1
]}

Dang, Jianwu ^{[1
]}

机构：

[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa 9231292, Japan

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

speaker identification; mutual information; feature selection;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Dimension reduction is a necessary step for speech feature extraction in a speaker identification system. Discrete Cosine Transform (DCT) or Principal Component Analysis (PCA) is widely used for dimension reduction. By choosing basis vectors from basis vector pool of DCT or PCA which contribute more to data distribution variance or reconstruction accuracy of speech data set, we can transform the data set by projecting them on to the selected basis vectors. However, keeping the maximum distribution variance or high reconstruction accuracy does not guarantee the optimal keeping of high speaker discriminative information. In this paper, we proposed a basis vector selection method based on mutual information concept which guarantees the keeping of high speaker discriminative information. The mutual information is used to measure the dependency between the features extracted using basis vectors and speaker class labels. The high mutual information related basis vectors are chosen for feature extraction. Considering one speaker feature may be encoded in more than one basis vectors, we proposed to use joint mutual information concept which takes the dependency between feature variables into consideration. Based on the selected basis vectors from DCT or PCA basis vector pool, we extracted features for speaker identification experiments. Experimental results showed that the speaker identification error rate using proposed feature was reduced 11% and 8% on average for DCT and PCA based features respectively.

引用

页码：1157 / 1160

页数：4

共 50 条

[11] Mutual Information Based Output Dimensionality Reduction
Pandey, Shishir
Vaze, Rahul
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 935 - 940
[12] Mutual Information-based Embedding Decoupling for Generalizable Speaker Verification
Li, Jianchen
Han, Jiqing
Deng, Shiwen
Zheng, Tieran
He, Yongjun
Zheng, Guibin
INTERSPEECH 2023, 2023, : 3147 - 3151
[13] Artifact Reduction Based on Mutual Information Measures
Wu, Ming-Te
2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1868 - 1871
[14] Fractal dimension applied to speaker identification
Petry, A
Barone, DAC
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 405 - 408
[15] Manifold learning based speaker dependent dimension reduction for robust text independent speaker verification
Zabihzadeh D.
Moattar M.H.
Zabihzadeh, D. (d.zabihzadeh@gmail.com), 1600, Kluwer Academic Publishers (17): : 271 - 280
[16] Direct estimation of the derivative of quadratic mutual information with application in supervised dimension reduction
2017, MIT Press Journals (29)
[17] Sufficient Dimension Reduction via Squared-Loss Mutual Information Estimation
Suzuki, Taiji
Sugiyama, Masashi
NEURAL COMPUTATION, 2013, 25 (03) : 725 - 758
[18] Direct Estimation of the Derivative of Quadratic Mutual Information with Application in Supervised Dimension Reduction
Tangkaratt, Voot
Sasaki, Hiroaki
Sugiyama, Masashi
NEURAL COMPUTATION, 2017, 29 (08) : 2076 - 2122
[19] Unsupervised Dimension Reduction via Least-Squares Quadratic Mutual Information
Sainui, Janya
Sugiyama, Masashi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (10) : 2806 - 2809
[20] Constrained Maximum Mutual Information Dimensionality Reduction for Language Identification
Huang, Shuai
Coppersmith, Glen A.
Karakos, Damianos
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2035 - 2038

← 1 2 3 4 5 →