Use of SVM-based ensemble feature selection method for gene expression data analysis

被引:0
|
作者
Zhang, Shizhi [1 ]
Zhang, Mingjin [1 ]
机构
[1] Qinghai Minzu Univ, Sch Chem & Chem Engn, Xining 810007, Peoples R China
关键词
ensemble feature selection; gene expression data; support vector machine; CANCER; IDENTIFICATION; CLASSIFICATION; DISCOVERY; PATTERNS;
D O I
10.1515/sagmb-2022-0002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene selection is one of the key steps for gene expression data analysis. An SVM-based ensemble feature selection method is proposed in this paper. Firstly, the method builds many subsets by using Monte Carlo sampling. Secondly, ranking all the features on each of the subsets and integrating them to obtain a final ranking list. Finally, the optimum feature set is determined by a backward feature elimination strategy. This method is applied to the analysis of 4 public datasets: the Leukemia, Prostate, Colorectal, and SMK_CAN, resulting 7, 10, 13, and 32 features. The AUC obtained from independent test sets are 0.9867, 0.9796, 0.9571, and 0.9575, respectively. These results indicate that the features selected by the proposed method can improve sample classification accuracy, and thus be effective for gene selection from gene expression data.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Hybrid Feature Selection Method using Gene Expression Data
    Chuang, Li-Yeh
    Wu, Kuo-Chuan
    Yang, Cheng-Hong
    2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, : 199 - +
  • [42] A Novel Method of Feature Selection based on SVM
    Liu, Quanjin
    Zhao, Zhimin
    Li, Ying-Xin
    Yu, Xiaolei
    Wang, Yong
    JOURNAL OF COMPUTERS, 2013, 8 (08) : 2144 - 2149
  • [43] An ensemble feature selection method based on mRMR for paired microarray data
    He, Lihua
    Cao, Zhongbo
    Wang, Yan
    Du, Wei
    Liang, Yanchun
    Journal of Computational Information Systems, 2014, 10 (11): : 4875 - 4882
  • [44] A Comparative Analysis of Swarm Intelligence and Evolutionary Algorithms for Feature Selection in SVM-Based Hyperspectral Image Classification
    Shang, Yiqun
    Zheng, Xinqi
    Li, Jiayang
    Liu, Dongya
    Wang, Peipei
    REMOTE SENSING, 2022, 14 (13)
  • [45] SVM-based feature extraction for face recognition
    Kim, Sang-Ki
    Park, Youn Jung
    Toh, Kar-Ann
    Lee, Sangyoun
    PATTERN RECOGNITION, 2010, 43 (08) : 2871 - 2881
  • [46] A SVM-Based Method for Sentiment Analysis in Persian Language
    Hajmohammadi, Mohammad Sadegh
    Ibrahim, Roliana
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2012), 2013, 8768
  • [47] DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis
    Zhang, Chuheng
    Li, Yuanqi
    Chen, Xi
    Jin, Yifei
    Tang, Pingzhong
    Li, Jian
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 781 - 790
  • [48] A METHOD FOR FEATURE SELECTION BASED ON THE OPTIMAL HYPERPLANE OF SVM AND INDEPENDENT ANALYSIS
    Hu, Lin-Fang
    Gong, Wei
    Qi, Li-Xiao
    Wang, Ping
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 114 - 117
  • [49] An Efficient SVM-Based Feature Selection Model for Cancer Classification Using High-Dimensional Microarray Data
    El Kafrawy, Passent
    Fathi, Hanaa
    Qaraad, Mohammed
    Kelany, Ayda K.
    Chen, Xumin
    IEEE ACCESS, 2021, 9 : 155353 - 155369
  • [50] Mass Classification in Mammograms Using Selected Geometry and Texture Features, and a New SVM-Based Feature Selection Method
    Liu, Xiaoming
    Tang, Jinshan
    IEEE SYSTEMS JOURNAL, 2014, 8 (03): : 910 - 920