Comparison and Evaluation of the Combinations of Feature Selection and Classifier on Microarray Data

被引:0
|
作者
Yan, Chaokun [1 ]
Zhang, Jun [1 ]
Kang, Xi [1 ]
Gong, Zhengze [1 ]
Wang, Jianlin [1 ]
Zhang, Ge [1 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Cancer classification prediction; Microarray data; Data analysis; Feature selection; Classification prediction; ALGORITHM; PREDICTION;
D O I
10.1109/ICBDA51983.2021.9403151
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As gene chip technology is widely used in cancer research, a large number of valuable microarray data has been rapidly accumulated. These data have the characteristics of "high-dimensional small samples", in which most genes are unrelated or redundant. For high-dimensional, small-sample, high-noise, and few-sample binary classification datasets, we explore which combination of feature selection method and classifier can achieve the relatively best prediction accuracy, while the number of features included is relatively low. We adopt the standard data analysis procedures: preprocessing the data set, using different feature selection methods to generate feature subsets, and applying different classifiers to predict each feature subset. The results are compared to find out which combination with the relatively high prediction accuracy and the relatively small number of features.
引用
收藏
页码:133 / 137
页数:5
相关论文
共 50 条
  • [41] Parallel classification and feature selection in microarray data using SPRINT
    Mitchell, Lawrence
    Sloan, Terence M.
    Mewissen, Muriel
    Ghazal, Peter
    Forster, Thorsten
    Piotrowski, Michal
    Trew, Arthur
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2014, 26 (04): : 854 - 865
  • [42] Modified PSO based Feature Selection for Microarray Data Classification
    Mohapatra, Puspanjali
    Chakravarty, S.
    2015 IEEE POWER, COMMUNICATION AND INFORMATION TECHNOLOGY CONFERENCE (PCITC-2015), 2015, : 703 - 709
  • [43] The application of feature selection methods to analyze the tissue microarray data
    Lin, Weipeng
    Liu, Kunhong
    Liu, Guoyan
    Proceedings of 4th International Workshop on Advanced Computational Intelligence, IWACI 2011, 2011, : 455 - 460
  • [44] Integrating Biological Information for Feature Selection in Microarray Data Classification
    Fang, Ong Huey
    Mustapha, Norwati
    Sulaiman, Md. Nasir
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 2, 2010, : 330 - 334
  • [45] Stable feature selection and classification algorithms for multiclass microarray data
    Sebastian Student
    Krzysztof Fujarewicz
    Biology Direct, 7
  • [46] Quality of feature selection based on microarray gene expression data
    Maciejewski, Henryk
    COMPUTATIONAL SCIENCE - ICCS 2008, PT 3, 2008, 5103 : 140 - 147
  • [47] Stable feature selection and classification algorithms for multiclass microarray data
    Student, Sebastian
    Fujarewicz, Krzysztof
    BIOLOGY DIRECT, 2012, 7
  • [48] Comparison of Feature Selection Methods for Cross-Laboratory Microarray Analysis
    Liu, Hsi-Che
    Peng, Pei-Chen
    Hsieh, Tzung-Chien
    Yeh, Ting-Chi
    Lin, Chih-Jen
    Chen, Chien-Yu
    Hou, Jen-Yin
    Shih, Lee-Yung
    Liang, Der-Cherng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (03) : 593 - 604
  • [49] An Experimental Comparison of Feature-Selection and Classification Methods for Microarray Datasets
    Cilia, Nicole Dalia
    De Stefano, Claudio
    Fontanella, Francesco
    Raimondo, Stefano
    di Freca, Alessandra Scotto
    INFORMATION, 2019, 10 (03)
  • [50] Feature Selection and Ensemble Meta Classifier for Multiclass Imbalance Data Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    Alias, Suraya
    Lammasha, Mohamed A. M.
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2018, 2018, : 134 - 139