Dimension reduction for speaker identification based on mutual information

被引:0
|
作者
Lu, Xugang [1 ]
Dang, Jianwu [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa 9231292, Japan
关键词
speaker identification; mutual information; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimension reduction is a necessary step for speech feature extraction in a speaker identification system. Discrete Cosine Transform (DCT) or Principal Component Analysis (PCA) is widely used for dimension reduction. By choosing basis vectors from basis vector pool of DCT or PCA which contribute more to data distribution variance or reconstruction accuracy of speech data set, we can transform the data set by projecting them on to the selected basis vectors. However, keeping the maximum distribution variance or high reconstruction accuracy does not guarantee the optimal keeping of high speaker discriminative information. In this paper, we proposed a basis vector selection method based on mutual information concept which guarantees the keeping of high speaker discriminative information. The mutual information is used to measure the dependency between the features extracted using basis vectors and speaker class labels. The high mutual information related basis vectors are chosen for feature extraction. Considering one speaker feature may be encoded in more than one basis vectors, we proposed to use joint mutual information concept which takes the dependency between feature variables into consideration. Based on the selected basis vectors from DCT or PCA basis vector pool, we extracted features for speaker identification experiments. Experimental results showed that the speaker identification error rate using proposed feature was reduced 11% and 8% on average for DCT and PCA based features respectively.
引用
收藏
页码:1157 / 1160
页数:4
相关论文
共 50 条
  • [31] Attention based gender and nationality information exploration for speaker identification
    Tang, Yong
    Liu, Chuang
    Leng, Yan
    Zhao, Weiwei
    Sun, Jiande
    Sun, Chengli
    Wang, Rongyan
    Yuan, Qi
    Li, Dengwang
    Xu, Huaqiang
    DIGITAL SIGNAL PROCESSING, 2022, 123
  • [32] Dimensionality reduction based on non-parametric mutual information
    Faivishevsky, Lev
    Goldberger, Jacob
    NEUROCOMPUTING, 2012, 80 : 31 - 37
  • [33] Chatter identification with mutual information
    Berger, B
    Belai, C
    Anand, D
    JOURNAL OF SOUND AND VIBRATION, 2003, 267 (01) : 178 - 186
  • [34] Research on fuzzy rough parallel reduction based on mutual information
    Xu, F. (xufeifei@shiep.edu.cn), 1600, Binary Information Press (10):
  • [35] Supervector Dimension Reduction for Efficient Speaker Age Estimation Based on the Acoustic Speech Signal
    Dobry, Gil
    Hecht, Ron M.
    Avigal, Mireille
    Zigel, Yaniv
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1975 - 1985
  • [36] Individual dimension Gaussian mixture model for speaker identification
    Wang, C
    Hou, LM
    Fang, Y
    ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3781 : 172 - 179
  • [37] Application of the mutual information minimization to speaker recognition/verification improvement
    Solé-Casals, J
    Faúndez-Zanuy, M
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 865 - 872
  • [38] Textile defects Identification Based on Neural Networks and Mutual Information
    Abdel-Azim, Gamil
    Nasri, Salem
    2013 INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS TECHNOLOGY (ICCAT), 2013,
  • [39] A novel Mutual Information based PCA approach for face identification
    Krishnakumar, K.
    Vasandkumar, K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 22503 - 22519
  • [40] A novel Mutual Information based PCA approach for face identification
    Krishnakumar K
    Vasandkumar K
    Multimedia Tools and Applications, 2024, 83 : 22503 - 22519