Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets

被引:15
|
作者
Ooi, Chia Huey [1 ]
Chetty, Madhu [1 ]
Teng, Shyh Wei [1 ]
机构
[1] Monash Univ, Gippsland Sch Informat Technol, Churchill, Vic, Australia
关键词
tissue classification; microarray data analysis; multiclass classification; feature selection; classifier aggregation;
D O I
10.1007/s10618-006-0055-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The high dimensionality of microarray datasets endows the task of multiclass tissue classification with various difficulties-the main challenge being the selection of features deemed relevant and non-redundant to form the predictor set for classifier training. The necessity of varying the emphases on relevance and redundancy, through the use of the degree of differential prioritization (DDP) during the search for the predictor set is also of no small importance. Furthermore, there are several types of decomposition technique for the feature selection (FS) problem-all-classes-at-once, one-vs.-all (OVA) or pairwise (PW). Also, in multiclass problems, there is the need to consider the type of classifier aggregation used-whether non-aggregated (a single machine), or aggregated (OVA or PW). From here, first we propose a systematic approach to combining the distinct problems of FS and classification. Then, using eight well-known multiclass microarray datasets, we empirically demonstrate the effectiveness of the DDP in various combinations of FS decomposition types and classifier aggregation methods. Aided by the variable DDP, feature selection leads to classification performance which is better than that of rank-based or equal-priorities scoring methods and accuracies higher than previously reported for benchmark datasets with large number of classes. Finally, based on several criteria, we make general recommendations on the optimal choice of the combination of FS decomposition type and classifier aggregation method for multiclass microarray datasets.
引用
收藏
页码:329 / 366
页数:38
相关论文
共 50 条
  • [41] A Nested Genetic Algorithm for feature selection in high-dimensional cancer Microarray datasets
    Sayed, Sabah
    Nassef, Mohammad
    Badr, Amr
    Farag, Ibrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 121 : 233 - 243
  • [42] A multi-objective feature selection and classifier ensemble technique for microarray data analysis
    Dash, Rasmita
    Misra, Bijan Bihari
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (02) : 123 - 160
  • [43] A technique for feature selection in multiclass problems
    Bruzzone, L
    Serpico, SB
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2000, 21 (03) : 549 - 563
  • [44] A fuzzy gaussian rank aggregation ensemble feature selection method for microarray data
    Venkatesh, B.
    Anuradha, J.
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2020, 24 (04) : 289 - 301
  • [45] Multiclass Lung Cancer Diagnosis by Gene Expression Programming and Microarray Datasets
    Azzawi, Hasseeb
    Hou, Jingyu
    Alanni, Russul
    Xiang, Yong
    Abdu-Aljabar, Rana
    Azzawi, Ali
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 541 - 553
  • [46] A genetic programming-based approach to the classification of multiclass microarray datasets
    Liu, Kun-Hong
    Xu, Chun-Gui
    BIOINFORMATICS, 2009, 25 (03) : 331 - 337
  • [47] Feature selection with limited datasets
    Kupinski, MA
    Giger, ML
    MEDICAL PHYSICS, 1999, 26 (10) : 2176 - 2182
  • [48] Distributed feature selection: A hesitant fuzzy correlation concept for microarray high-dimensional datasets
    Ebrahimpour, Mohammad Kazem
    Eftekhari, Mahdi
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 173 : 51 - 64
  • [49] A GA-Based Approach to ICA Feature Selection: An Efficient Method to Classify Microarray Datasets
    Liu, Kun-Hong
    Zhang, Jun
    Li, Bo
    Du, Ji-Xiang
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 432 - +
  • [50] Fuzzy-rough discriminative feature selection and classification algorithm, with application to microarray and image datasets
    Kumar P, Pramod
    Vadakkepat, Prahlad
    Poh, Loh Ai
    APPLIED SOFT COMPUTING, 2011, 11 (04) : 3429 - 3440