Improved Microarray Data Analysis using Feature Selection Methods with Machine Learning Methods

被引:0
|
作者
Sun, Jing [1 ]
Passi, Kalpdrum [1 ]
Jain, Chakresh Kumar [2 ]
机构
[1] Laurentian Univ, Dept Math & Comp Sci, Sudbury, ON, Canada
[2] Jaypee Inst lnformat Technol, Dept Biotechnol, Noida, India
关键词
10-folds Cross Validation; Support Vector Machine; Random Forest; Neural Network; K-Nearest-Neighbor; Feature selection; mRMR; MaxRel; QPFS; PLS; REGRESSION; DISCOVERY;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray data analysis directly relates with the state of disease through gene expression profile, and is based upon several feature extractions to classification methodologies. This paper focuses on the study of 8 different ways of feature selection preprocess methods from 4 different feature selection methods. They are Minimum Redundancy-Maximum Relevance (mRMR), Max Relevance (MaxRel), Quadratic Programming Feature Selection (QPFS) and Partial Least Squared (PLS) methods. In this study, microarray datasets of colon cancer and leukemia cancer were used for implementing and testing four different classifiers i.e. K-Nearest-Neighbor (KNN), Random Forest (RF), Support Vector Machine (SVM) and Neural Network (NN). The performance was measured by accuracy and AUe (area under the curve) value. The experimental results show that discretization can somehow improve performance of microarray data analysis, and mRMR gives the best performance of microarray data analysis on the colon and leukemia datasets. We also list some results on comparative performance of methods for the specific (data-ratio) number of features.
引用
收藏
页码:1527 / 1534
页数:8
相关论文
共 50 条
  • [1] Brain Neural Data Analysis Using Machine Learning Feature Selection and Classification Methods
    Bozhkov, Lachezar
    Georgieva, Petia
    Trifonov, Roumen
    [J]. ENGINEERING APPLICATIONS OF NEURAL NETWORKS (EANN 2014), 2014, 459 : 123 - 132
  • [2] Microarray Data Classification Using Feature Selection and Regularized Methods with Sampling Methods
    Jyothi, Saddi
    Reddy, Y. Sowmya
    Lavanya, K.
    [J]. UBIQUITOUS INTELLIGENT SYSTEMS, 2022, 302 : 351 - 358
  • [3] Comparative study of feature selection methods on microarray data
    Miyamoto, T
    Uchimura, S
    Hamamoto, Y
    Iizuka, N
    Oka, M
    Yamada-Okabe, H
    [J]. IEEE EMBS APBME 2003, 2003, : 82 - 83
  • [4] Feature Selection in Pulmonary Function Test Data with Machine Learning Methods
    Karakis, Rukiye
    Guler, Inan
    Isik, Ali Hakan
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [5] Improved Permeability Prediction of Porous Media by Feature Selection and Machine Learning Methods Comparison
    Tian, J. W.
    Qi, Chongchong
    Peng, Kang
    Sun, Yingfeng
    Yaseen, Zaher Mundher
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2022, 36 (02)
  • [6] A Comparative Study of Improvements Filter Methods Bring on Feature Selection Using Microarray Data
    Wang, Yingying
    Fan, Xiaomao
    Cai, Yunpeng
    [J]. HEALTH INFORMATION SCIENCE, HIS 2014, 2014, 8423 : 55 - 62
  • [7] Feature selection and classification for microarray data analysis: Evolutionary methods for identifying predictive genes
    Jirapech-Umpai, T
    Aitken, S
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [8] Feature selection and classification for microarray data analysis: Evolutionary methods for identifying predictive genes
    Thanyaluk Jirapech-Umpai
    Stuart Aitken
    [J]. BMC Bioinformatics, 6
  • [9] ANALYSIS OF LArTPC DATA USING MACHINE LEARNING METHODS
    Falko, A.
    Gogota, O.
    Yermolenko, R.
    Kadenko, I.
    [J]. JOURNAL OF PHYSICAL STUDIES, 2024, 28 (01):
  • [10] Analysis of Fleet Data Using Machine Learning Methods
    Ebel, André
    Riemer, Thomas
    Reuss, Hans-Christian
    [J]. Tongji Daxue Xuebao/Journal of Tongji University, 2021, 49 : 186 - 193