Comparison and Evaluation of the Combinations of Feature Selection and Classifier on Microarray Data

被引:0
|
作者
Yan, Chaokun [1 ]
Zhang, Jun [1 ]
Kang, Xi [1 ]
Gong, Zhengze [1 ]
Wang, Jianlin [1 ]
Zhang, Ge [1 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Cancer classification prediction; Microarray data; Data analysis; Feature selection; Classification prediction; ALGORITHM; PREDICTION;
D O I
10.1109/ICBDA51983.2021.9403151
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As gene chip technology is widely used in cancer research, a large number of valuable microarray data has been rapidly accumulated. These data have the characteristics of "high-dimensional small samples", in which most genes are unrelated or redundant. For high-dimensional, small-sample, high-noise, and few-sample binary classification datasets, we explore which combination of feature selection method and classifier can achieve the relatively best prediction accuracy, while the number of features included is relatively low. We adopt the standard data analysis procedures: preprocessing the data set, using different feature selection methods to generate feature subsets, and applying different classifiers to predict each feature subset. The results are compared to find out which combination with the relatively high prediction accuracy and the relatively small number of features.
引用
收藏
页码:133 / 137
页数:5
相关论文
共 50 条
  • [1] A multi-objective feature selection and classifier ensemble technique for microarray data analysis
    Dash, Rasmita
    Misra, Bijan Bihari
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (02) : 123 - 160
  • [2] Prominent feature selection of microarray data
    Yihui Liu School of Computer Science and Information Technology
    Progress in Natural Science:Materials International, 2009, 19 (10) : 1365 - 1371
  • [3] FEATURE DISCRETIZATION AND SELECTION IN MICROARRAY DATA
    Ferreira, Artur
    Figueiredo, Mario
    KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 465 - 469
  • [4] Wavelet feature selection for microarray data
    Liu, Yihui
    2007 IEEE/NIH LIFE SCIENCE SYSTEMS AND APPLICATIONS WORKSHOP, 2007, : 205 - 208
  • [5] Prominent feature selection of microarray data
    Liu, Yihui
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2009, 19 (10) : 1365 - 1371
  • [6] Comparison of population based metaheuristics for feature selection:: Application to microarray data classification
    Talbi, E-G.
    Jourdan, L.
    Garcia-Nieto, J.
    Alba, E.
    2008 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1-3, 2008, : 45 - +
  • [7] Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets
    Chia Huey Ooi
    Madhu Chetty
    Shyh Wei Teng
    Data Mining and Knowledge Discovery, 2007, 14 : 329 - 366
  • [8] Differential prioritization in feature selection and classifier aggregation for multiclass microarray datasets
    Ooi, Chia Huey
    Chetty, Madhu
    Teng, Shyh Wei
    DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 14 (03) : 329 - 366
  • [9] MaskedPainter: Feature selection for microarray data analysis
    Apiletti, Daniele
    Baralis, Elena
    Bruno, Giulia
    Fiori, Alessandro
    INTELLIGENT DATA ANALYSIS, 2012, 16 (04) : 717 - 737
  • [10] Boosting for Feature Selection for Microarray Data Analysis
    Guile, Geoffrey R.
    Wang, Wenjia
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 2559 - 2563