Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets

被引:15
|
作者
Ooi, Chia Huey [1 ]
Chetty, Madhu [1 ]
Teng, Shyh Wei [1 ]
机构
[1] Monash Univ, Gippsland Sch Informat Technol, Churchill, Vic, Australia
关键词
tissue classification; microarray data analysis; multiclass classification; feature selection; classifier aggregation;
D O I
10.1007/s10618-006-0055-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The high dimensionality of microarray datasets endows the task of multiclass tissue classification with various difficulties-the main challenge being the selection of features deemed relevant and non-redundant to form the predictor set for classifier training. The necessity of varying the emphases on relevance and redundancy, through the use of the degree of differential prioritization (DDP) during the search for the predictor set is also of no small importance. Furthermore, there are several types of decomposition technique for the feature selection (FS) problem-all-classes-at-once, one-vs.-all (OVA) or pairwise (PW). Also, in multiclass problems, there is the need to consider the type of classifier aggregation used-whether non-aggregated (a single machine), or aggregated (OVA or PW). From here, first we propose a systematic approach to combining the distinct problems of FS and classification. Then, using eight well-known multiclass microarray datasets, we empirically demonstrate the effectiveness of the DDP in various combinations of FS decomposition types and classifier aggregation methods. Aided by the variable DDP, feature selection leads to classification performance which is better than that of rank-based or equal-priorities scoring methods and accuracies higher than previously reported for benchmark datasets with large number of classes. Finally, based on several criteria, we make general recommendations on the optimal choice of the combination of FS decomposition type and classifier aggregation method for multiclass microarray datasets.
引用
收藏
页码:329 / 366
页数:38
相关论文
共 50 条
  • [1] Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets
    Chia Huey Ooi
    Madhu Chetty
    Shyh Wei Teng
    Data Mining and Knowledge Discovery, 2007, 14 : 329 - 366
  • [2] A Study on the Importance of Differential Prioritization in Feature Selection Using Toy Datasets
    Ooi, Chia Huey
    Teng, Shyh Wei
    Chetty, Madhu
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2008, 5265 : 311 - 322
  • [3] Relevance, redundancy and differential prioritization in feature selection for multiclass gene expression data
    Ooi, CH
    Chetty, M
    Teng, SW
    BIOLOGICAL AND MEDICAL DATA ANALYSIS, PROCEEDINGS, 2005, 3745 : 367 - 378
  • [4] Stable feature selection and classification algorithms for multiclass microarray data
    Student, Sebastian
    Fujarewicz, Krzysztof
    BIOLOGY DIRECT, 2012, 7
  • [5] Stable feature selection and classification algorithms for multiclass microarray data
    Sebastian Student
    Krzysztof Fujarewicz
    Biology Direct, 7
  • [6] Ant Colony Algorithm for Feature Selection on Microarray Datasets
    Fahrudin, Tresna Maulana
    Syarif, Iwan
    Barakbah, Ali Ridho
    2016 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2016, : 351 - 356
  • [7] A review of microarray datasets and applied feature selection methods
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    Benitez, J. M.
    Herrera, F.
    INFORMATION SCIENCES, 2014, 282 : 111 - 135
  • [8] Feature Selection and Ensemble Meta Classifier for Multiclass Imbalance Data Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Alias, Suraya
    Lammasha, Mohamed A. M.
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2018, 2018, : 134 - 139
  • [9] Iterative ensemble feature selection for multiclass classification of imbalanced microarray data
    Yang, Junshan
    Zhou, Jiarui
    Zhu, Zexuan
    Ma, Xiaoliang
    Ji, Zhen
    JOURNAL OF BIOLOGICAL RESEARCH-THESSALONIKI, 2016, 23
  • [10] Comparison and Evaluation of the Combinations of Feature Selection and Classifier on Microarray Data
    Yan, Chaokun
    Zhang, Jun
    Kang, Xi
    Gong, Zhengze
    Wang, Jianlin
    Zhang, Ge
    2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 133 - 137