A framework for cost-based feature selection

被引:54
|
作者
Bolon-Canedo, V. [1 ]
Porto-Diaz, I. [1 ]
Sanchez-Marono, N. [1 ]
Alonso-Betanzos, A. [1 ]
机构
[1] Univ A Coruna, Lab Res & Dev Artificial Intelligence LIDIA, Dept Comp Sci, La Coruna 15071, Spain
关键词
Cost-based feature selection; Machine learning; Filter methods; NEURAL-NETWORKS;
D O I
10.1016/j.patcog.2014.01.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last few years, the dimensionality of datasets involved in data mining applications has increased dramatically. In this situation, feature selection becomes indispensable as it allows for dimensionality reduction and relevance detection. The research proposed in this paper broadens the scope of feature selection by taking into consideration not only the relevance of the features but also their associated costs. A new general framework is proposed, which consists of adding a new term to the evaluation function of a filter feature selection method so that the cost is taken into account. Although the proposed methodology could be applied to any feature selection filter, in this paper the approach is applied to two representative filter methods: Correlation-based Feature Selection (CFS) and Minimal-Redundancy-Maximal-Relevance (mRMR), as an example of use. The behavior of the proposed framework is tested on 17 heterogeneous classification datasets, employing a Support Vector Machine (SVM) as a classifier. The results of the experimental study show that the approach is sound and that it allows the user to reduce the cost without compromising the classification error. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2481 / 2489
页数:9
相关论文
共 50 条
  • [1] Cost-based Feature Selection for Network Model Choice
    Raynal, Louis
    Hoffmann, Till
    Onnela, Jukka-Pekka
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (03) : 1109 - 1118
  • [2] Cost-based feature subset selection for interactive image analysis
    Smits, PC
    Annoni, A
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 386 - 389
  • [3] Cost-based feature selection for GIS-embedded data fusion
    Smits, PC
    Annoni, A
    [J]. IGARSS 2000: IEEE 2000 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOL I - VI, PROCEEDINGS, 2000, : 2614 - 2616
  • [4] Cost-based feature selection for Support Vector Machines: An application in credit scoring
    Maldonado, Sebastian
    Perez, Juan
    Bravo, Cristian
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 261 (02) : 656 - 665
  • [5] Towards a framework for cost-based transformation
    Skillicorn, DB
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 1996, 42 (05) : 331 - 340
  • [6] Spectrum Sensing Scheduling in a Cost-based Framework
    Kelkar, Aditya
    Cheng, Qi
    [J]. 2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1051 - 1055
  • [7] Cost-Based Join Algorithm Selection in Hadoop
    Gu, Jun
    Peng, Shu
    Wang, X. Sean
    Rao, Weixiong
    Yang, Min
    Cao, Yu
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, PT II, 2014, 8787 : 246 - 261
  • [8] Selection of materialized views:: A cost-based approach
    Baril, X
    Bellahsène, Z
    [J]. ADVANCED INFORMATION SYSTEMS ENGINEERING, PROCEEDINGS, 2003, 2681 : 665 - 680
  • [9] Multi-Objective Particle Swarm Optimization Approach for Cost-Based Feature Selection in Classification
    Zhang, Yong
    Gong, Dun-wei
    Cheng, Jian
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (01) : 64 - 75
  • [10] Cost-Based Feature Transfer for Vehicle Occupant Classification
    Perrett, Toby
    Mirmehdi, Majid
    [J]. COMPUTER VISION - ACCV 2016 WORKSHOPS, PT I, 2017, 10116 : 405 - 419