A biobjective feature selection algorithm for large omics datasets

被引:1
|
作者
Cavique, Luis [1 ,2 ]
Mendes, Armando B. [3 ,4 ]
Martiniano, Hugo F. M. C. [1 ,5 ]
Correia, Luis [1 ]
机构
[1] FCUL, MAS BioISI, Lisbon, Portugal
[2] Univ Aberta, Lisbon, Portugal
[3] Univ Acores, Ponta Delgada, Portugal
[4] Univ Minho, Algoritmi, Braga, Portugal
[5] Inst Dr Ricardo Jorge, Lisbon, Portugal
关键词
biobjective optimization; feature selection; heuristic decomposition; logical analysis of data;
D O I
10.1111/exsy.12301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is one of the most important concepts in data mining when dimensionality reduction is needed. The performance measures of feature selection encompass predictive accuracy and result comprehensibility. Consistency-based methods are a significant category of feature selection research that substantially improves the comprehensibility of the result using the parsimony principle. In this work, the biobjective version of the algorithm logical analysis of inconsistent data is applied to large volumes of data. In order to deal with hundreds of thousands of attributes, heuristic decomposition uses parallel processing to solve a set covering problem and a cross-validation technique. The biobjective solutions contain the number of reduced features and the accuracy. The algorithm is applied to omics datasets with genome-like characteristics of patients with rare diseases.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Duplication Analysis-Based Evolutionary Algorithm for Biobjective Feature Selection
    Xu, Hang
    Xue, Bing
    Zhang, Mengjie
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2021, 25 (02) : 205 - 218
  • [2] Ant Colony Algorithm for Feature Selection on Microarray Datasets
    Fahrudin, Tresna Maulana
    Syarif, Iwan
    Barakbah, Ali Ridho
    [J]. 2016 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2016, : 351 - 356
  • [3] Datasets Meta-Feature Description for Recommending Feature Selection Algorithm
    Filchenkov, Andrey
    Pendryak, Arseniy
    [J]. 2015 ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE AND INFORMATION EXTRACTION, SOCIAL MEDIA AND WEB SEARCH FRUCT CONFERENCE (AINL-ISMW FRUCT), 2015, : 11 - 18
  • [4] Applications of Feature Selection Techniques on Large Biomedical Datasets
    Ewen, Nicolas
    Abdou, Tamer
    Bener, Ayse
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11489 : 543 - 548
  • [5] Comparison of Mutual Information-based Feature Selection Method for Biological Omics Datasets
    Huang, Zhijun
    [J]. 2021 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2021), 2021, : 60 - 63
  • [6] FEATURE SELECTION FOR IMBALANCED DATASETS BASED ON IMPROVED GENETIC ALGORITHM
    Du, Limin
    Xu, Yang
    Jin, Liuqian
    [J]. DECISION MAKING AND SOFT COMPUTING, 2014, 9 : 119 - 124
  • [7] A Progressive Feature Selection Algorithm for Ultra Large Feature Spaces
    Zhang, Qi
    Weng, Fuliang
    Feng, Zhe
    [J]. COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 561 - 568
  • [8] A Novel Classifier-independent Feature Selection Algorithm for Imbalanced Datasets
    Zhu, Quanyin
    Cao, Suqun
    [J]. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS, 2009, : 77 - +
  • [9] AltWOA: Altruistic Whale Optimization Algorithm for feature selection on microarray datasets
    Kundu, Rohit
    Chattopadhyay, Soham
    Cuevas, Erik
    Sarkar, Ram
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 144
  • [10] Hybrid feature selection based on SLI and genetic algorithm for microarray datasets
    Abasabadi, Sedighe
    Nematzadeh, Hossein
    Motameni, Homayun
    Akbari, Ebrahim
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (18): : 19725 - 19753