Dimension reduction for speaker identification based on mutual information

被引:0
|
作者
Lu, Xugang [1 ]
Dang, Jianwu [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa 9231292, Japan
关键词
speaker identification; mutual information; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimension reduction is a necessary step for speech feature extraction in a speaker identification system. Discrete Cosine Transform (DCT) or Principal Component Analysis (PCA) is widely used for dimension reduction. By choosing basis vectors from basis vector pool of DCT or PCA which contribute more to data distribution variance or reconstruction accuracy of speech data set, we can transform the data set by projecting them on to the selected basis vectors. However, keeping the maximum distribution variance or high reconstruction accuracy does not guarantee the optimal keeping of high speaker discriminative information. In this paper, we proposed a basis vector selection method based on mutual information concept which guarantees the keeping of high speaker discriminative information. The mutual information is used to measure the dependency between the features extracted using basis vectors and speaker class labels. The high mutual information related basis vectors are chosen for feature extraction. Considering one speaker feature may be encoded in more than one basis vectors, we proposed to use joint mutual information concept which takes the dependency between feature variables into consideration. Based on the selected basis vectors from DCT or PCA basis vector pool, we extracted features for speaker identification experiments. Experimental results showed that the speaker identification error rate using proposed feature was reduced 11% and 8% on average for DCT and PCA based features respectively.
引用
收藏
页码:1157 / 1160
页数:4
相关论文
共 50 条
  • [11] Mutual Information Based Output Dimensionality Reduction
    Pandey, Shishir
    Vaze, Rahul
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 935 - 940
  • [12] Mutual Information-based Embedding Decoupling for Generalizable Speaker Verification
    Li, Jianchen
    Han, Jiqing
    Deng, Shiwen
    Zheng, Tieran
    He, Yongjun
    Zheng, Guibin
    INTERSPEECH 2023, 2023, : 3147 - 3151
  • [13] Artifact Reduction Based on Mutual Information Measures
    Wu, Ming-Te
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1868 - 1871
  • [14] Fractal dimension applied to speaker identification
    Petry, A
    Barone, DAC
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 405 - 408
  • [15] Manifold learning based speaker dependent dimension reduction for robust text independent speaker verification
    Zabihzadeh D.
    Moattar M.H.
    Zabihzadeh, D. (d.zabihzadeh@gmail.com), 1600, Kluwer Academic Publishers (17): : 271 - 280
  • [17] Sufficient Dimension Reduction via Squared-Loss Mutual Information Estimation
    Suzuki, Taiji
    Sugiyama, Masashi
    NEURAL COMPUTATION, 2013, 25 (03) : 725 - 758
  • [18] Direct Estimation of the Derivative of Quadratic Mutual Information with Application in Supervised Dimension Reduction
    Tangkaratt, Voot
    Sasaki, Hiroaki
    Sugiyama, Masashi
    NEURAL COMPUTATION, 2017, 29 (08) : 2076 - 2122
  • [19] Unsupervised Dimension Reduction via Least-Squares Quadratic Mutual Information
    Sainui, Janya
    Sugiyama, Masashi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (10) : 2806 - 2809
  • [20] Constrained Maximum Mutual Information Dimensionality Reduction for Language Identification
    Huang, Shuai
    Coppersmith, Glen A.
    Karakos, Damianos
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2035 - 2038