Informative variable identifier: Expanding interpretability in feature selection

被引:22
|
作者
Munoz-Romero, Sergio [1 ,3 ]
Gorostiaga, Arantza [2 ]
Soguero-Ruiz, Cristina [1 ]
Mora-Jimenez, Inmaculada [1 ]
Rojo-Alvarez, Jose Luis [1 ,3 ]
机构
[1] Univ Rey Juan Carlos, Dept Signal Theory & Commun, Madrid 28933, Spain
[2] Univ Basque Country, UPV EHU, Dept Fdn Econ Anal 2, Bilbao, Spain
[3] Univ Politecn Madrid, Ctr Computat Simulat, Boadilla Del Monte, Spain
关键词
Feature selection; Interpretability; Explainable machine learning; Resampling; Classification; FRAMEWORK;
D O I
10.1016/j.patcog.2019.107077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is nowadays an increasing interest in discovering relationships among input variables (also called features) from data to provide better interpretability, which yield more confidence in the solution and provide novel insights about the nature of the problem at hand. We propose a novel feature selection method, called Informative Variable Identifier (IVI), capable of identifying the informative variables and their relationships. It transforms the input-variable space distribution into a coefficient-feature space using existing linear classifiers or a more efficient weight generator that we also propose, Covariance Multiplication Estimator (CME). Informative features and their relationships are determined analyzing the joint distribution of these coefficients with resampling techniques. IVI and CME select the informative variables and then pass them on to any linear or nonlinear classifier. Experiments show that the proposed approach can outperform state-of-art algorithms in terms of feature identification capabilities, and even in classification performance when subsequent classifiers are used. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Sorting variables by using informative vectors as a strategy for feature selection in multivariate regression
    Teofilo, Reinaldo F.
    Martins, Joao Paulo A.
    Ferreira, Marcia M. C.
    [J]. JOURNAL OF CHEMOMETRICS, 2009, 23 (1-2) : 32 - 48
  • [42] A variable informative criterion based on weighted voting strategy combined with LASSO for variable selection in multivariate calibration
    Zhang, Ruoqiu
    Zhang, Feiyu
    Chen, Wanchao
    Xiong, Qin
    Chen, Zengkai
    Yao, Heming
    Ge, Jiong
    Hu, Yun
    Du, Yiping
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 184 : 132 - 141
  • [43] An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways
    Peng, Bin
    Zhu, Dianwen
    Ander, Bradley P.
    Zhang, Xiaoshuai
    Xue, Fuzhong
    Sharp, Frank R.
    Yang, Xiaowei
    [J]. PLOS ONE, 2013, 8 (07):
  • [44] Variable selection in semiparametric regression models for longitudinal data with informative observation times
    Jazi, Omidali Aghababaei
    Pullenayegum, Eleanor
    [J]. STATISTICS IN MEDICINE, 2022, 41 (17) : 3281 - 3298
  • [45] Input variable selection for feature extraction in classification problems
    Choi, Sang-Il
    Oh, Jiyong
    Choi, Chong-Ho
    Kim, Chunghoon
    [J]. SIGNAL PROCESSING, 2012, 92 (03) : 636 - 648
  • [46] Feature–granularity selection with variable costs for hybrid data
    Shujiao Liao
    Qingxin Zhu
    Yuhua Qian
    [J]. Soft Computing, 2019, 23 : 13105 - 13126
  • [47] On the use of variable complementarity for feature selection in cancer classification
    Meyer, Patrick E.
    Bontempi, Gianluca
    [J]. APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2006, 3907 : 91 - 102
  • [48] Feature extraction of weighted data for implicit variable selection
    Sanchez, Luis
    Martinez, Fernando
    Castellanos, German
    Salazar, Augusto
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2007, 4673 : 840 - 847
  • [49] Variable Length Black Hole for Optimization and Feature Selection
    Qadir, Tara Othman
    Fuad, Norfaiza
    Taujuddin, Nik Shahidah Afifi Md
    [J]. IEEE ACCESS, 2022, 10 : 63855 - 63866
  • [50] CircRNA identification and feature interpretability analysis
    Mengting Niu
    Chunyu Wang
    Yaojia Chen
    Quan Zou
    Ren Qi
    Lei Xu
    [J]. BMC Biology, 22