A framework for cost-based feature selection

被引:54
|
作者
Bolon-Canedo, V. [1 ]
Porto-Diaz, I. [1 ]
Sanchez-Marono, N. [1 ]
Alonso-Betanzos, A. [1 ]
机构
[1] Univ A Coruna, Lab Res & Dev Artificial Intelligence LIDIA, Dept Comp Sci, La Coruna 15071, Spain
关键词
Cost-based feature selection; Machine learning; Filter methods; NEURAL-NETWORKS;
D O I
10.1016/j.patcog.2014.01.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last few years, the dimensionality of datasets involved in data mining applications has increased dramatically. In this situation, feature selection becomes indispensable as it allows for dimensionality reduction and relevance detection. The research proposed in this paper broadens the scope of feature selection by taking into consideration not only the relevance of the features but also their associated costs. A new general framework is proposed, which consists of adding a new term to the evaluation function of a filter feature selection method so that the cost is taken into account. Although the proposed methodology could be applied to any feature selection filter, in this paper the approach is applied to two representative filter methods: Correlation-based Feature Selection (CFS) and Minimal-Redundancy-Maximal-Relevance (mRMR), as an example of use. The behavior of the proposed framework is tested on 17 heterogeneous classification datasets, employing a Support Vector Machine (SVM) as a classifier. The results of the experimental study show that the approach is sound and that it allows the user to reduce the cost without compromising the classification error. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2481 / 2489
页数:9
相关论文
共 50 条
  • [41] Cost-based QoS routing
    Chu, J
    Lea, CT
    Wong, A
    [J]. ICCCN 2003: 12TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, PROCEEDINGS, 2003, : 485 - 490
  • [42] Cost-Based California Effects
    Frankenreiter, Jens
    [J]. YALE JOURNAL ON REGULATION, 2022, 39 (03): : 1155 - 1217
  • [43] Regression with Cost-based Rejection
    Cheng, Xin
    Cao, Yuzhou
    Wang, Haobo
    Wei, Hongxin
    An, Bo
    Feng, Lei
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [44] CSAS: Cost-Based Storage Auto-Selection, a Fine Grained Storage Selection Mechanism for Spark
    Wang, Bo
    Tang, Jie
    Zhang, Rui
    Gu, Zhimin
    [J]. NETWORK AND PARALLEL COMPUTING (NPC 2017), 2017, 10578 : 150 - 154
  • [45] Cost-Based Predictive Spatiotemporal Join
    Han, Wook-Shin
    Kim, Jaehwa
    Lee, Byung Suk
    Tao, Yufei
    Rantzau, Ralf
    Markl, Volker
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (02) : 220 - 233
  • [46] Cost-based, integrated design optimization
    Azhar Iqbal
    Jorn S. Hansen
    [J]. Structural and Multidisciplinary Optimization, 2006, 32 : 447 - 461
  • [47] Cost-based Query Optimization for XPath
    Li, Dong
    Chen, Wenhao
    Liang, Xiaochong
    Guan, Jida
    Xu, Yang
    Lu, Xiuyu
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2014, 8 (04): : 1935 - 1948
  • [48] Cost-based tolerancing of optical systems
    Youngworth, Richard N.
    Stone, Bryan D.
    [J]. Optics and Photonics News, 2000, 11 (12):
  • [49] COST-BASED ABDUCTION AND MAP EXPLANATION
    CHARNIAK, E
    SHIMONY, SE
    [J]. ARTIFICIAL INTELLIGENCE, 1994, 66 (02) : 345 - 374
  • [50] COST-BASED ACTIVITIES A FOCUS COST BENEFIT FOR ORGANIZATIONS
    Manchay Reyes, Gina Judith
    Herrera Freire, Alex Humberto
    Ruiz Cueva, Mayra Beatriz
    [J]. REVISTA UNIVERSIDAD Y SOCIEDAD, 2019, 11 (05): : 243 - 248