Improved Microarray Data Analysis using Feature Selection Methods with Machine Learning Methods

被引:0
|
作者
Sun, Jing [1 ]
Passi, Kalpdrum [1 ]
Jain, Chakresh Kumar [2 ]
机构
[1] Laurentian Univ, Dept Math & Comp Sci, Sudbury, ON, Canada
[2] Jaypee Inst lnformat Technol, Dept Biotechnol, Noida, India
关键词
10-folds Cross Validation; Support Vector Machine; Random Forest; Neural Network; K-Nearest-Neighbor; Feature selection; mRMR; MaxRel; QPFS; PLS; REGRESSION; DISCOVERY;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray data analysis directly relates with the state of disease through gene expression profile, and is based upon several feature extractions to classification methodologies. This paper focuses on the study of 8 different ways of feature selection preprocess methods from 4 different feature selection methods. They are Minimum Redundancy-Maximum Relevance (mRMR), Max Relevance (MaxRel), Quadratic Programming Feature Selection (QPFS) and Partial Least Squared (PLS) methods. In this study, microarray datasets of colon cancer and leukemia cancer were used for implementing and testing four different classifiers i.e. K-Nearest-Neighbor (KNN), Random Forest (RF), Support Vector Machine (SVM) and Neural Network (NN). The performance was measured by accuracy and AUe (area under the curve) value. The experimental results show that discretization can somehow improve performance of microarray data analysis, and mRMR gives the best performance of microarray data analysis on the colon and leukemia datasets. We also list some results on comparative performance of methods for the specific (data-ratio) number of features.
引用
收藏
页码:1527 / 1534
页数:8
相关论文
共 50 条
  • [31] Evaluating feature selection methods for learning in data mining applications
    Piramuthu, S
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2004, 156 (02) : 483 - 494
  • [32] A review of microarray datasets and applied feature selection methods
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    Benitez, J. M.
    Herrera, F.
    [J]. INFORMATION SCIENCES, 2014, 282 : 111 - 135
  • [33] Evaluating feature selection methods for learning in data mining applications
    Piramuthu, S
    [J]. PROCEEDINGS OF THE THIRTY-FIRST HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL V: MODELING TECHNOLOGIES AND INTELLIGENT SYSTEMS TRACK, 1998, : 294 - 301
  • [34] A comparative study of improvements Pre-filter methods bring on feature selection using microarray data
    Wang Y.
    Fan X.
    Cai Y.
    [J]. Health Information Science and Systems, 2 (1)
  • [35] A critical review of feature selection methods for machine learning in IoT security
    Li, Jing
    Othman, Mohd Shahizan
    Chen, Hewan
    Yusuf, Lizawati Mi
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2024, 30 (03) : 264 - 312
  • [36] Prediction of heart disease by classifying with feature selection and machine learning methods
    Gazeloglu, Cengiz
    [J]. PROGRESS IN NUTRITION, 2020, 22 (02): : 660 - 670
  • [37] A Study on Facial Expression Change Detection Using Machine Learning Methods with Feature Selection Technique
    Sung, Sang-Ha
    Kim, Sangjin
    Park, Byung-Kwon
    Kang, Do-Young
    Sul, Sunhae
    Jeong, Jaehyun
    Kim, Sung-Phil
    [J]. MATHEMATICS, 2021, 9 (17)
  • [38] Heart Diseases Prediction for Optimization based Feature Selection and Classification using Machine Learning Methods
    Rajinikanth, N.
    Pavithra, L.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 636 - 643
  • [39] Feature selection and predicting chemotherapy-induced ulcerative mucositis using machine learning methods
    Satheeshkumar, Poolakkad S.
    El-Dallal, Mohammed
    Mohan, Minu P.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 154
  • [40] Classification of lung cancer using ensemble-based feature selection and machine learning methods
    Cai, Zhihua
    Xu, Dong
    Zhang, Qing
    Zhang, Jiexia
    Ngai, Sai-Ming
    Shao, Jianlin
    [J]. MOLECULAR BIOSYSTEMS, 2015, 11 (03) : 791 - 800