Dimension reduction for speaker identification based on mutual information

被引:0
|
作者
Lu, Xugang [1 ]
Dang, Jianwu [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa 9231292, Japan
关键词
speaker identification; mutual information; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimension reduction is a necessary step for speech feature extraction in a speaker identification system. Discrete Cosine Transform (DCT) or Principal Component Analysis (PCA) is widely used for dimension reduction. By choosing basis vectors from basis vector pool of DCT or PCA which contribute more to data distribution variance or reconstruction accuracy of speech data set, we can transform the data set by projecting them on to the selected basis vectors. However, keeping the maximum distribution variance or high reconstruction accuracy does not guarantee the optimal keeping of high speaker discriminative information. In this paper, we proposed a basis vector selection method based on mutual information concept which guarantees the keeping of high speaker discriminative information. The mutual information is used to measure the dependency between the features extracted using basis vectors and speaker class labels. The high mutual information related basis vectors are chosen for feature extraction. Considering one speaker feature may be encoded in more than one basis vectors, we proposed to use joint mutual information concept which takes the dependency between feature variables into consideration. Based on the selected basis vectors from DCT or PCA basis vector pool, we extracted features for speaker identification experiments. Experimental results showed that the speaker identification error rate using proposed feature was reduced 11% and 8% on average for DCT and PCA based features respectively.
引用
收藏
页码:1157 / 1160
页数:4
相关论文
共 50 条
  • [1] Speaker identification based on complete feature corpus and evaluation of mutual information
    YU Yibiao WANG Shuozhong (School of Electronic Information Engineering
    Chinese Journal of Acoustics, 2005, (03) : 280 - 288
  • [2] Local fuzzy PCA based GMM with dimension reduction on speaker identification
    Lee, KY
    PATTERN RECOGNITION LETTERS, 2004, 25 (16) : 1811 - 1817
  • [3] Selection of the Best Wavelet Packet Nodes Based on Mutual Information for Speaker Identification
    Fernandez, Rafael
    Montalvo, Ana
    Calvo, Jose R.
    Hernandez, Gabriel
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 78 - 85
  • [4] Application of the mutual information minimization to speaker recognition/identification improvement
    Sole-Casals, Jordi
    Faundez-Zanuy, Marcos
    NEUROCOMPUTING, 2006, 69 (13-15) : 1467 - 1474
  • [5] Dimension Compactness in Speaker Identification
    Kanrar, Soumen
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [6] Learning Speaker Representations with Mutual Information
    Ravanelli, Mirco
    Bengio, Yoshua
    INTERSPEECH 2019, 2019, : 1153 - 1157
  • [7] Dimension Reduction Approaches for SVM based Speaker Age Estimation
    Dobry, Gil
    Hecht, Ron M.
    Avigal, Mireille
    Zigel, Yaniv
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1999 - +
  • [8] Influencer identification of dynamical networks based on an information entropy dimension reduction method
    Duan, Dong-Li
    Ji, Si-Yuan
    Yuan, Zi-Wei
    CHINESE PHYSICS B, 2024, 33 (04)
  • [9] Influencer identification of dynamical networks based on an information entropy dimension reduction method
    段东立
    纪思源
    袁紫薇
    Chinese Physics B, 2024, 33 (04) : 170 - 179
  • [10] MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA
    Vijayasenan, Deepu
    Valente, Fabio
    Bourlard, Herve
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4065 - 4068