A Length-Adaptive Non-Dominated Sorting Genetic Algorithm for Bi-Objective High-Dimensional Feature Selection

被引:17
|
作者
Gong, Yanlu [1 ]
Zhou, Junhai [1 ]
Wu, Quanwang [1 ]
Zhou, MengChu [2 ]
Wen, Junhao [3 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
[2] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
[3] Chongqing Univ, Coll Big Data & Software Engn, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Bi-objective optimization; feature selection (FS); genetic algorithm; high-dimensional data; length-adaptive; MULTIOBJECTIVE FEATURE-SELECTION; DIFFERENTIAL EVOLUTION; SEARCH;
D O I
10.1109/JAS.2023.123648
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a crucial data preprocessing method in data mining, feature selection (FS) can be regarded as a bi-objective optimization problem that aims to maximize classification accuracy and minimize the number of selected features. Evolutionary computing (EC) is promising for FS owing to its powerful search capability. However, in traditional EC-based methods, feature subsets are represented via a length-fixed individual encoding. It is ineffective for high-dimensional data, because it results in a huge search space and prohibitive training time. This work proposes a length-adaptive non-dominated sorting genetic algorithm (LA-NSGA) with a length-variable individual encoding and a length-adaptive evolution mechanism for bi-objective high-dimensional FS. In LA-NSGA, an initialization method based on correlation and redundancy is devised to initialize individuals of diverse lengths, and a Pareto dominance-based length change operator is introduced to guide individuals to explore in promising search space adaptively. Moreover, a dominance-based local search method is employed for further improvement. The experimental results based on 12 high-dimensional gene datasets show that the Pareto front of feature subsets produced by LA-NSGA is superior to those of existing algorithms.
引用
收藏
页码:1834 / 1844
页数:11
相关论文
共 50 条
  • [1] A Length-Adaptive Non-Dominated Sorting Genetic Algorithm for Bi-Objective High-Dimensional Feature Selection
    Yanlu Gong
    Junhai Zhou
    Quanwang Wu
    MengChu Zhou
    Junhao Wen
    IEEE/CAAJournalofAutomaticaSinica, 2023, 10 (09) : 1834 - 1844
  • [2] LAGAM: A Length-Adaptive Genetic Algorithm With Markov Blanket for High-Dimensional Feature Selection in Classification
    Zhou, Junhai
    Wu, Quanwang
    Zhou, MengChu
    Wen, Junhao
    Al-Turki, Yusuf
    Abusorrah, Abdullah
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (11) : 6858 - 6869
  • [3] A non-dominated sorting genetic algorithm for a bi-objective pick-up and delivery problem
    Velasco, N.
    Dejax, P.
    Gueret, C.
    Prins, C.
    ENGINEERING OPTIMIZATION, 2012, 44 (03) : 305 - 325
  • [4] A Bi-Search Evolutionary Algorithm for High-Dimensional Bi-Objective Feature Selection
    Xu, Hang
    Xue, Bing
    Zhang, Mengjie
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (05): : 3489 - 3502
  • [5] Hybrid non-dominated sorting genetic algorithm with adaptive operators selection
    Mashwani, Wali Khan
    Salhi, Abdellah
    Yeniay, Ozgur
    Hussian, H.
    Jan, M. A.
    APPLIED SOFT COMPUTING, 2017, 56 : 1 - 18
  • [6] Application of a Non-Dominated Sorting Genetic Algorithm to Solve a Bi-Objective Scheduling Problem Regarding Printed Circuit Boards
    Chang, Yung-Chia
    Chang, Kuei-Hu
    Zheng, Ching-Ping
    MATHEMATICS, 2022, 10 (13)
  • [7] A non-dominated sorting based customized random-key genetic algorithm for the bi-objective traveling thief problem
    Jonatas B. C. Chagas
    Julian Blank
    Markus Wagner
    Marcone J. F. Souza
    Kalyanmoy Deb
    Journal of Heuristics, 2021, 27 : 267 - 301
  • [8] A non-dominated sorting based customized random-key genetic algorithm for the bi-objective traveling thief problem
    Chagas, Jonatas B. C.
    Blank, Julian
    Wagner, Markus
    Souza, Marcone J. F.
    Deb, Kalyanmoy
    JOURNAL OF HEURISTICS, 2021, 27 (03) : 267 - 301
  • [9] A problem-specific non-dominated sorting genetic algorithm for supervised feature selection
    Zhou, Yu
    Zha, Wenjun
    Kang, Junhao
    Zhang, Xiao
    Wang, Xu
    INFORMATION SCIENCES, 2021, 547 : 841 - 859
  • [10] Bi-objective feature selection in high-dimensional datasets using improved binary chimp optimization algorithm
    Al-qudah, Nour Elhuda A.
    Abed-alguni, Bilal H.
    Barhoush, Malek
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 6107 - 6148