A conservative feature subset selection algorithm with missing data

被引:17
|
作者
Aussem, Alex [1 ]
de Morais, Sergio Rodrigues [2 ]
机构
[1] Univ Lyon, LIESP, UCBL, F-69622 Villeurbanne, France
[2] Univ Lyon, LIESP, INSA Lyon, F-69622 Villeurbanne, France
关键词
Missing data; Feature selection; Bayesian networks; Markov boundary;
D O I
10.1016/j.neucom.2009.05.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel conservative feature subset selection method with incomplete data sets. The method is conservative in the sense that it selects the minimal subset of features that renders the rest of the features independent of the target (the class variable) without making any assumption about the missing data mechanism. This is achieved in the context of determining the Markov blanket of the target that reflects the worst-case assumption about the missing data mechanism, including the case when data are not missing at random. An application of the method on synthetic and real-world incomplete data is carried Out to illustrate its practical relevance. The method is compared against state-of-the-art approaches Such as the expectation-maximization (EM) algorithm and the available case technique. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:585 / 590
页数:6
相关论文
共 50 条
  • [1] A Conservative Feature Subset Selection Algorithm with Missing Data
    Aussem, Alex
    de Morais, Sergio Rodrigues
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 725 - 730
  • [2] Random feature subset selection for analysis of data with missing features
    DePasquale, Joseph
    Polikar, Robi
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 2378 - 2383
  • [3] THE FEATURE SUBSET SELECTION ALGORITHM
    Liu Yongguo Li Xueming Wu Zhongfu (Department of Computer Science and Engineering
    [J]. Journal of Electronics(China), 2003, (01) : 57 - 61
  • [4] Overview Of Feature Subset Selection Algorithm For High Dimensional Data
    Gandhi, Swati S.
    Prabhune, S. S.
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2017), 2017, : 618 - 623
  • [5] A Novel Scalable and Data Efficient Feature Subset Selection Algorithm
    de Morais, Sergio Rodrigues
    Aussem, Alex
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 298 - +
  • [6] Random feature subset selection for ensemble based classification of data with missing features
    DePasquale, Joseph
    Polikar, Robi
    [J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2007, 4472 : 251 - +
  • [7] Feature Subset Selection within a Simulated Annealing Data Mining Algorithm
    Debuse J.C.W.
    Rayward-Smith V.J.
    [J]. Journal of Intelligent Information Systems, 1997, 9 (1) : 57 - 81
  • [8] Feature subset selection for data and feature streams: a review
    Carlos Villa-Blanco
    Concha Bielza
    Pedro Larrañaga
    [J]. Artificial Intelligence Review, 2023, 56 : 1011 - 1062
  • [9] A neuro fuzzy algorithm for feature subset selection
    Chakraborty, B
    Chakraborty, G
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2001, E84A (09): : 2182 - 2188
  • [10] Feature subset selection using a genetic algorithm
    Yang, JH
    Honavar, V
    [J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (02): : 44 - 49