Feature construction as a bi-level optimization problem

被引:9
|
作者
Hammami, Marwa [1 ]
Bechikh, Slim [1 ,3 ]
Louati, Ali [2 ]
Makhlouf, Mohamed [4 ]
Ben Said, Lamjed [1 ]
机构
[1] Univ Tunis, SMART Lab, ISG Campus, Tunis, Tunisia
[2] Prince Sattam bin Abdulaziz Univ, Informat Syst Dept, Alkharj 11942, Saudi Arabia
[3] Kennesaw State Univ, LMVSR, Kennesaw, GA 30144 USA
[4] Kedge Business Sch, Talence, France
来源
NEURAL COMPUTING & APPLICATIONS | 2020年 / 32卷 / 17期
关键词
Feature construction; Data classification; Bi-level optimization; Evolutionary algorithms; WRAPPER EVOLUTIONARY APPROACH; MULTIPLE-FEATURE CONSTRUCTION; FEATURE-SELECTION; CLASSIFICATION;
D O I
10.1007/s00521-020-04784-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection and construction are important preprocessing techniques in data mining. They allow not only dimensionality reduction but also classification accuracy and efficiency improvement. While feature selection consists in selecting a subset of relevant feature from the original feature set, feature construction corresponds to the generation of new high-level features, called constructed features, where each one of them is a combination of a subset of original features. Based on these definitions, feature construction could be seen as a bi-level optimization problem where the feature subset should be defined first and then the corresponding (near) optimal combination of the selected features should be found. Motivated by this observation, we propose, in this paper, a bi-level evolutionary approach for feature construction. The basic idea of our algorithm, named bi-level feature construction genetic algorithm (BFC-GA), is to evolve an upper-level population for the task of feature selection, while optimizing the feature combinations at the lower level by evolving a follower population. It is worth noting that for each upper-level individual (feature subset), a whole lower-level population is optimized to find the corresponding (near) optimal feature combination (constructed feature). In this way, BFC-GA would be able to output a set of optimized constructed features that could be very informative to the considered classifier. A detailed experimental study has been conducted on a set of commonly used datasets with varying dimensions. The statistical analysis of the obtained results shows the competitiveness and the outperformance of our bi-level feature construction approach with respect to many state-of-the-art algorithms.
引用
收藏
页码:13783 / 13804
页数:22
相关论文
共 50 条
  • [31] A bi-level emergency evacuation traffic optimization model for urban evacuation problem
    Liu, Yanyue
    Zhang, Zhao
    Mo, Lei
    Yu, Bin
    Li, Zhenhua
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2024,
  • [32] Approximation Algorithms for a Bi-level Knapsack Problem
    Chen, Lin
    Zhang, Guochuan
    [J]. COMBINATORIAL OPTIMIZATION AND APPLICATIONS, 2011, 6831 : 399 - 410
  • [33] A bi-level maximal covering location problem
    Martha-Selene Casas-Ramírez
    José-Fernando Camacho-Vallejo
    Juan A. Díaz
    Dolores E. Luna
    [J]. Operational Research, 2020, 20 : 827 - 855
  • [34] Android malware detection as a Bi-level problem
    Jerbi, Manel
    Dagdia, Zaineb Chelly
    Bechikh, Slim
    Ben Said, Lamjed
    [J]. COMPUTERS & SECURITY, 2022, 121
  • [35] Approximation algorithms for a bi-level knapsack problem
    Chen, Lin
    Zhang, Guochuan
    [J]. THEORETICAL COMPUTER SCIENCE, 2013, 497 : 1 - 12
  • [36] A bi-level maximal covering location problem
    Casas-Ramirez, Martha-Selene
    Camacho-Vallejo, Jose-Fernando
    Diaz, Juan A.
    Luna, Dolores E.
    [J]. OPERATIONAL RESEARCH, 2020, 20 (02) : 827 - 855
  • [37] Bi-level ensemble method for unsupervised feature selection
    Zhou, Peng
    Wang, Xia
    Du, Liang
    [J]. INFORMATION FUSION, 2023, 100
  • [38] A bi-level optimization model for technology selection
    Aviso, Kathleen B.
    Chiu, Anthony S. F.
    Ubando, Aristotle T.
    Tan, Raymond R.
    [J]. JOURNAL OF INDUSTRIAL AND PRODUCTION ENGINEERING, 2021, 38 (08) : 573 - 580
  • [39] Model building using bi-level optimization
    G. K. D. Saharidis
    I. P. Androulakis
    M. G. Ierapetritou
    [J]. Journal of Global Optimization, 2011, 49 : 49 - 67
  • [40] Bi-level optimization in papermaking process design
    Linnala, Mikko
    Hamalainen, Jari
    [J]. NORDIC PULP & PAPER RESEARCH JOURNAL, 2012, 27 (04) : 774 - 782