Use of SVM-based ensemble feature selection method for gene expression data analysis

被引:0
|
作者
Zhang, Shizhi [1 ]
Zhang, Mingjin [1 ]
机构
[1] Qinghai Minzu Univ, Sch Chem & Chem Engn, Xining 810007, Peoples R China
关键词
ensemble feature selection; gene expression data; support vector machine; CANCER; IDENTIFICATION; CLASSIFICATION; DISCOVERY; PATTERNS;
D O I
10.1515/sagmb-2022-0002
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene selection is one of the key steps for gene expression data analysis. An SVM-based ensemble feature selection method is proposed in this paper. Firstly, the method builds many subsets by using Monte Carlo sampling. Secondly, ranking all the features on each of the subsets and integrating them to obtain a final ranking list. Finally, the optimum feature set is determined by a backward feature elimination strategy. This method is applied to the analysis of 4 public datasets: the Leukemia, Prostate, Colorectal, and SMK_CAN, resulting 7, 10, 13, and 32 features. The AUC obtained from independent test sets are 0.9867, 0.9796, 0.9571, and 0.9575, respectively. These results indicate that the features selected by the proposed method can improve sample classification accuracy, and thus be effective for gene selection from gene expression data.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] SVM-based feature selection for characterization of focused compound collections
    Byvatov, E
    Schneider, G
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03): : 993 - 999
  • [22] SVM-based Reliability Analysis Method
    Li Wei
    Yu Xiaolin
    PROCEEDING OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES, 2009, : 584 - 588
  • [23] Study on a SVM-based data fusion method
    Zang, XH
    Zhao, J
    Wang, C
    Cai, H
    2004 IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, VOLS 1 AND 2, 2004, : 413 - 415
  • [24] Null space based feature selection method for gene expression data
    Alok Sharma
    Seiya Imoto
    Satoru Miyano
    Vandana Sharma
    International Journal of Machine Learning and Cybernetics, 2012, 3 : 269 - 276
  • [25] Null space based feature selection method for gene expression data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    Sharma, Vandana
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2012, 3 (04) : 269 - 276
  • [26] A Feature Selection Based Serial SVM Ensemble Classifier
    Cao, Jianjun
    Lv, Guojun
    Chang, Chen
    Li, Hongmei
    IEEE ACCESS, 2019, 7 : 144516 - 144523
  • [27] Improving an SVM-based Liver Segmentation Strategy by the F-score Feature Selection Method
    Xu, Y.
    Liu, J.
    Hu, Q. M.
    Chen, Z. J.
    Du, X. H.
    Heng, P. A.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 13 - 16
  • [28] Tumor CE Image Classification Using SVM-Based Feature Selection
    Li, Baopu
    Meng, Max Q-H
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010,
  • [29] EFFECTIVE FEATURE EXTRACTION METHOD FOR SVM-BASED PROFILED ATTACKS
    Ngoc Quy Tran
    Hur, Junbeom
    Hieu Minh Nguyen
    COMPUTING AND INFORMATICS, 2021, 40 (05) : 1108 - 1135
  • [30] Identifying Diagnostic Biomarkers of Breast Cancer Based on Gene Expression Data and Ensemble Feature Selection
    Li, Lingyu
    Algabri, Yousif A.
    Liu, Zhi-Ping
    CURRENT BIOINFORMATICS, 2023, 18 (03) : 232 - 246