Improved Microarray Data Analysis using Feature Selection Methods with Machine Learning Methods

被引:0
|
作者
Sun, Jing [1 ]
Passi, Kalpdrum [1 ]
Jain, Chakresh Kumar [2 ]
机构
[1] Laurentian Univ, Dept Math & Comp Sci, Sudbury, ON, Canada
[2] Jaypee Inst lnformat Technol, Dept Biotechnol, Noida, India
关键词
10-folds Cross Validation; Support Vector Machine; Random Forest; Neural Network; K-Nearest-Neighbor; Feature selection; mRMR; MaxRel; QPFS; PLS; REGRESSION; DISCOVERY;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray data analysis directly relates with the state of disease through gene expression profile, and is based upon several feature extractions to classification methodologies. This paper focuses on the study of 8 different ways of feature selection preprocess methods from 4 different feature selection methods. They are Minimum Redundancy-Maximum Relevance (mRMR), Max Relevance (MaxRel), Quadratic Programming Feature Selection (QPFS) and Partial Least Squared (PLS) methods. In this study, microarray datasets of colon cancer and leukemia cancer were used for implementing and testing four different classifiers i.e. K-Nearest-Neighbor (KNN), Random Forest (RF), Support Vector Machine (SVM) and Neural Network (NN). The performance was measured by accuracy and AUe (area under the curve) value. The experimental results show that discretization can somehow improve performance of microarray data analysis, and mRMR gives the best performance of microarray data analysis on the colon and leukemia datasets. We also list some results on comparative performance of methods for the specific (data-ratio) number of features.
引用
收藏
页码:1527 / 1534
页数:8
相关论文
共 50 条
  • [21] Prediction of core cancer genes using a hybrid of feature selection and machine learning methods
    Liu, Y. X.
    Zhang, N. N.
    He, Y.
    Lun, L. J.
    [J]. GENETICS AND MOLECULAR RESEARCH, 2015, 14 (03): : 8871 - 8882
  • [22] Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods
    Taghizadeh, Eskandar
    Heydarheydari, Sahel
    Saberi, Alihossein
    JafarpoorNesheli, Shabnam
    Rezaeijo, Seyed Masoud
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [23] Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods
    Eskandar Taghizadeh
    Sahel Heydarheydari
    Alihossein Saberi
    Shabnam JafarpoorNesheli
    Seyed Masoud Rezaeijo
    [J]. BMC Bioinformatics, 23
  • [24] Comparison of Feature Selection Methods and Machine Learning Classifiers for Radiomics Analysis in Glioma Grading
    Sun, Pan
    Wang, Defeng
    Mok, Vincent C. T.
    Shi, Lin
    [J]. IEEE ACCESS, 2019, 7 : 102010 - 102020
  • [25] Data Oriented Financial Analysis using Machine Learning Methods
    Altan, Cisem
    Kalayci, Sacide
    Koroglu, Bilge
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 37 - 41
  • [26] Feature selection methods in microarray gene expression data: a systematic mapping study
    Mahnaz Vahmiyan
    Mohammadtaghi Kheirabadi
    Ebrahim Akbari
    [J]. Neural Computing and Applications, 2022, 34 : 19675 - 19702
  • [27] Combination of Feature Selection Methods for the Effective Classification of Microarray Gene Expression Data
    Sheela, T.
    Rangarajan, Lalitha
    [J]. RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 137 - 145
  • [28] Feature selection methods in microarray gene expression data: a systematic mapping study
    Vahmiyan, Mahnaz
    Kheirabadi, Mohammadtaghi
    Akbari, Ebrahim
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22): : 19675 - 19702
  • [29] Novel sparse Bayesian feature selection methods for microarray analysis and QSAR variable selection
    Winkler, David
    Burden, Frank
    Halley, Julianne
    [J]. DRUGS OF THE FUTURE, 2007, 32 : 26 - 26
  • [30] Evaluating feature selection methods for learning in data mining applications
    Piramuthu, S
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2004, 156 (02) : 483 - 494