Efficient Feature Selection in the Presence of Multiple Feature Classes

被引:3
|
作者
Dhillon, Paramveer S. [1 ]
Foster, Dean [2 ]
Ungar, Lyle H. [1 ]
机构
[1] Univ Penn, CIS, Philadelphia, PA 19104 USA
[2] Univ Penn, Stat, Philadelphia, PA 19104 USA
关键词
D O I
10.1109/ICDM.2008.56
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an information theoretic approach to feature selection when the data possesses feature classes. Feature classes are pervasive in real data. For example, in gene expression data, the genes which serve as features may be divided into classes based on their membership in gene families or pathways. When doing word sense disambiguation or named entity extraction, features fall into classes including adjacent words, their parts of speech, and the topic and venue of the document the word is in. When predictive features occur predominantly in a small number of feature classes, our information theoretic approach significantly improves feature selection. Experiments on real and synthetic data demonstrate substantial improvement in predictive accuracy over the standard L-0 penalty-based stepwise and streamwise feature selection methods as well as over Lasso and Elastic Nets, all of which are oblivious to the existence of feature classes.
引用
收藏
页码:779 / +
页数:2
相关论文
共 50 条
  • [41] Efficient feature selection and classification for microarray data
    Li, Zifa
    Xie, Weibo
    Liu, Tao
    PLOS ONE, 2018, 13 (08):
  • [42] An Efficient Marine Predators Algorithm for Feature Selection
    Abd Elminaam, Diaa Salama
    Nabil, Ayman
    Ibraheem, Shimaa A.
    Houssein, Essam H.
    IEEE ACCESS, 2021, 9 : 60136 - 60153
  • [43] Efficient Spectral Feature Selection with Minimum Redundancy
    Zhao, Zheng
    Wang, Lei
    Liu, Huan
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 673 - 678
  • [44] An hybrid metaheuristic approach for efficient feature selection
    B. Madhusudhanan
    P. Sumathi
    N. Shunmuga Karpagam
    A. Mahesh
    P. Anlet Pamila Suhi
    Cluster Computing, 2019, 22 : 14541 - 14549
  • [45] An hybrid metaheuristic approach for efficient feature selection
    Madhusudhanan, B.
    Sumathi, P.
    Karpagam, N. Shunmuga
    Mahesh, A.
    Suhi, P. Anlet Pamila
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 14541 - 14549
  • [46] An Efficient Feature Selection for SAR Target Classification
    Amrani, Moussa
    Yang, Kai
    Zhao, Dongyang
    Fan, Xiaopeng
    Jiang, Feng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 68 - 78
  • [47] An Efficient Statistical Feature Selection Based Classification
    Narayanamma, K. Laxmi
    Krishnaiah, R., V
    Sammulal, P.
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (04): : 27 - 40
  • [48] Efficient Feature Selection for Intrusion Detection Systems
    Ahmadi, S. Sareh
    Rashad, Sherif
    Elgazzar, Heba
    2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 1029 - 1034
  • [49] An Efficient Feature Selection Method for Activity Classification
    Zhang, Shumei
    McCullagh, Paul
    Callaghan, Vic
    2014 INTERNATIONAL CONFERENCE ON INTELLIGENT ENVIRONMENTS (IE), 2014, : 16 - 22
  • [50] Efficient greedy feature selection for unsupervised learning
    Farahat, Ahmed K.
    Ghodsi, Ali
    Kamel, Mohamed S.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 35 (02) : 285 - 310