Low bias histogram-based estimation of mutual information for feature selection

被引:39
|
作者
Hacine-Gharbi, Abdenour [1 ,2 ]
Ravier, Philippe [1 ]
Harba, Rachid [1 ]
Mohamadi, Tayeb [3 ]
机构
[1] Univ Orleans, PRISME Lab, Orleans 2, France
[2] Univ Ctr Bordj Bou Arreridj, Dept Elect, LMSE Lab, Elanasser Bordj Bou Arre 34265, Algeria
[3] Ferhat Abbas Univ, Fac Technol, Dept Elect, Setif 19000, Algeria
关键词
Mutual information; Feature selection; Bias; Dimensionality reduction; Shannon entropy; Speech recognition; RELEVANCE;
D O I
10.1016/j.patrec.2012.02.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a low bias histogram-based estimation of mutual information and its application to feature selection problems. By canceling the first order bias, the estimation avoids the bias accumulation problem that affects classical methods. As a consequence, on a synthetic feature selection problem, only the proposed method results in the exact number of features to be chosen in the Gaussian case when compared to four other approaches. In a speech recognition application, the proposed method and the Sturges method are the only ones that lead to a correct number of selected features in the noise free case. In the reduced data case, only the proposed method points out the optimal number of features to select. Finally, in the noisy case, only the proposed method leads to results of high quality; other methods show severely underestimated numbers of selected features. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:1302 / 1308
页数:7
相关论文
共 50 条
  • [1] On Data-Driven Histogram-Based Estimation for Mutual Information
    Silva, Jorge
    Narayanan, Shrikanth S.
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2010, : 1423 - 1427
  • [2] Soft Feature Selection by Using a Histogram-Based Classifier
    Tenmoto, Hiroshi
    Kudo, Mineichi
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2008, 5342 : 572 - +
  • [3] FEATURE SELECTION BASED ON STATISTICAL ESTIMATION OF MUTUAL INFORMATION
    Kozhevin, A. A.
    [J]. SIBERIAN ELECTRONIC MATHEMATICAL REPORTS-SIBIRSKIE ELEKTRONNYE MATEMATICHESKIE IZVESTIYA, 2021, 18 : 720 - 728
  • [4] A new histogram-based estimation technique of entropy and mutual information using mean squared error minimization
    Hacine-Gharbi, A.
    Deriche, M.
    Ravier, P.
    Harba, R.
    Mohamadi, T.
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2013, 39 (03) : 918 - 933
  • [5] Mutual information for feature selection: estimation or counting?
    Nguyen H.B.
    Xue B.
    Andreae P.
    [J]. Evolutionary Intelligence, 2016, 9 (3) : 95 - 110
  • [6] Histogram-Based Estimation for the Divergence Revisited
    Silva, Jorge
    Narayanan, Shrikanth S.
    [J]. 2009 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1- 4, 2009, : 468 - +
  • [7] Histogram-Based Flash Channel Estimation
    Wang, Haobo
    Chen, Tsung-Yi
    Wesel, Richard D.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2015, : 283 - 288
  • [8] Using a Genetic Algorithm with Histogram-Based Feature Selection in Hyperspectral Image Classification
    Walton, Neil S.
    Sheppard, John W.
    Shaw, Joseph A.
    [J]. PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 1364 - 1372
  • [9] Nearest Neighbor For Histogram-based Feature Extraction
    Mohamad, F. S.
    Manaf, A. A.
    Chuprat, S.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 : 1296 - 1305
  • [10] A statistic to estimate the variance of the histogram-based mutual information estimator based on dependent pairs of observations
    Moddemeijer, R
    [J]. SIGNAL PROCESSING, 1999, 75 (01) : 51 - 63