Dimension reduction for speaker identification based on mutual information

被引:0
|
作者
Lu, Xugang [1 ]
Dang, Jianwu [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Nomi, Ishikawa 9231292, Japan
关键词
speaker identification; mutual information; feature selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dimension reduction is a necessary step for speech feature extraction in a speaker identification system. Discrete Cosine Transform (DCT) or Principal Component Analysis (PCA) is widely used for dimension reduction. By choosing basis vectors from basis vector pool of DCT or PCA which contribute more to data distribution variance or reconstruction accuracy of speech data set, we can transform the data set by projecting them on to the selected basis vectors. However, keeping the maximum distribution variance or high reconstruction accuracy does not guarantee the optimal keeping of high speaker discriminative information. In this paper, we proposed a basis vector selection method based on mutual information concept which guarantees the keeping of high speaker discriminative information. The mutual information is used to measure the dependency between the features extracted using basis vectors and speaker class labels. The high mutual information related basis vectors are chosen for feature extraction. Considering one speaker feature may be encoded in more than one basis vectors, we proposed to use joint mutual information concept which takes the dependency between feature variables into consideration. Based on the selected basis vectors from DCT or PCA basis vector pool, we extracted features for speaker identification experiments. Experimental results showed that the speaker identification error rate using proposed feature was reduced 11% and 8% on average for DCT and PCA based features respectively.
引用
收藏
页码:1157 / 1160
页数:4
相关论文
共 50 条
  • [21] STUDY OF MUTUAL INFORMATION FOR SPEAKER RECOGNITION FEATURES
    Garcia, Guillermo
    Eriksson, Thomas
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 601 - 605
  • [22] Mutual Information Adaptive Estimation for Speaker Verification
    Chen C.
    Ji C.
    Li W.
    Chen D.
    Wang L.
    Yang H.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2023, 52 (01): : 125 - 131
  • [23] Mutual Information Enhanced Training for Speaker Embedding
    Tu, Youzhi
    Mak, Man-Wai
    INTERSPEECH 2021, 2021, : 91 - 95
  • [24] Regional mutual information-based identification and reduction of flicker artifacts during video encoding
    Kumar, Vinay
    SIGNAL IMAGE AND VIDEO PROCESSING, 2017, 11 (04) : 621 - 628
  • [25] Regional mutual information-based identification and reduction of flicker artifacts during video encoding
    Vinay Kumar
    Signal, Image and Video Processing, 2017, 11 : 621 - 628
  • [26] Cost Minimization Attribute Reduction Based on Mutual Information
    Xu, Feifei
    Bi, Zhongqin
    Lei, Jingsheng
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 215 - 219
  • [27] Mutual Information Based Feature Selection for Fingerprint Identification
    Adjimi, Ahlem
    Hacine-Gharbi, Abdenour
    Ravier, Philippe
    Mostefai, Messaoud
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2019, 43 (02): : 187 - 198
  • [28] On maximum mutual information speaker-adapted training
    McDonough, John
    Woelfel, Matthias
    Stoimenov, Emilian
    COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 130 - 147
  • [29] Feature selection method based on mutual information and class separability for dimension reduction in multidimensional time series for clinical data
    Fang, Liying
    Zhao, Han
    Wang, Pu
    Yu, Mingwei
    Yan, Jianzhuo
    Cheng, Wenshuai
    Chen, Peiyu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 21 : 82 - 89
  • [30] On maximum mutual information speaker-adapted training
    McDonough, J
    Schaaf, T
    Waibel, A
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 601 - 604