A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information

被引:153
|
作者
Lin, Xiaohui [2 ]
Yang, Fufang [2 ]
Zhou, Lina [1 ]
Yin, Peiyuan [1 ]
Kong, Hongwei [1 ]
Xing, Wenbin [3 ]
Lu, Xin [1 ]
Jia, Lewen [4 ]
Wang, Quancai [2 ]
Xu, Guowang [1 ]
机构
[1] Chinese Acad Sci, Dalian Inst Chem Phys, CAS Key Lab Separat Sci Analyt Chem, Dalian 116023, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Peoples R China
[3] Sixth Peoples Hosp, Dalian 116001, Peoples R China
[4] Dalian Med Univ, Affiliated Hosp 1, Dept Nephrol, Dalian 116011, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial contrast variables; Mutual information; SVM-RFE; Liver diseases; Metabolomics; HEPATOMA PLASMA-MEMBRANES; GENE SELECTION; SVM-RFE; HEPATOCELLULAR-CARCINOMA; REGENERATING LIVER; MASS-SPECTROMETRY; FATTY-ACIDS; L-CARNITINE; CLASSIFICATION; METABOLOMICS;
D O I
10.1016/j.jchromb.2012.05.020
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Filtering the discriminative metabolites from high dimension metabolome data is very important in metabolomics study. Support vector machine-recursive feature elimination (SVM-RFE) is an efficient feature selection technique and has shown promising applications in the analysis of the metabolome data. SVM-RFE measures the weights of the features according to the support vectors, noise and non-informative variables in the high dimension data may affect the hyper-plane of the SVM learning model. Hence we proposed a mutual information (MI)-SVM-RFE method which filters out noise and non-informative variables by means of artificial variables and MI, then conducts SVM-RFE to select the most discriminative features. A serum metabolomics data set from patients with chronic hepatitis B, cirrhosis and hepatocellular carcinoma analyzed by liquid chromatography-mass spectrometry (LC-MS) was used to demonstrate the validation of our method. An accuracy of 74.33 +/- 2.98% to distinguish among three liver diseases was obtained, better than 72.00 +/- 4.15% from the original SVM-RFE. Thirty-four ion features were defined to distinguish among the control and 3 liver diseases, 17 of them were identified. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:149 / 155
页数:7
相关论文
共 50 条
  • [1] Feature Selection Method Based on Mutual Information and Support Vector Machine
    Liu, Gang
    Yang, Chunlei
    Liu, Sen
    Xiao, Chunbao
    Song, Bin
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (06)
  • [2] Feature clustering based support vector machine recursive feature elimination for gene selection
    Huang, Xiaojuan
    Zhang, Li
    Wang, Bangjun
    Li, Fanzhang
    Zhang, Zhao
    [J]. APPLIED INTELLIGENCE, 2018, 48 (03) : 594 - 607
  • [3] Feature clustering based support vector machine recursive feature elimination for gene selection
    Xiaojuan Huang
    Li Zhang
    Bangjun Wang
    Fanzhang Li
    Zhao Zhang
    [J]. Applied Intelligence, 2018, 48 : 594 - 607
  • [4] Support Vector Machine-Recursive Feature Elimination for the Diagnosis of Parkinson Disease based on Speech Analysis
    Ma, Hengbo
    Tan, Tianyu
    Zhou, Hongpeng
    Gao, Tianyi
    [J]. 2016 SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2016, : 34 - 40
  • [5] An Enhanced Hybrid Feature Selection Technique Using Term Frequency-Inverse Document Frequency and Support Vector Machine-Recursive Feature Elimination for Sentiment Classification
    Nafis, Nur Syafiqah Mohd
    Awang, Suryanti
    [J]. IEEE ACCESS, 2021, 9 : 52177 - 52192
  • [6] Mass Diagnosis in Mammography with Mutual Information Based Feature Selection and Support Vector Machine
    Liu, Xiaoming
    Li, Bo
    Liu, Jun
    Xu, Xin
    Feng, Zhilin
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2012, 2012, 7390 : 1 - 8
  • [7] A hybrid feature selection method combining Gini index and support vector machine with recursive feature elimination for gene expression classification
    Almutiri, Talal
    Saeed, Faisal
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (01) : 41 - 62
  • [8] Hybrid adapted fast correlation FCBF-support vector machine recursive feature elimination for feature selection
    Djellali, Hayet
    Ghoualmi-Zine, Nacira
    Guessoum, Souad
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2020, 14 (03): : 269 - 279
  • [9] Raman spectra exploring breast tissues: Comparison of principal component analysis and support vector machine-recursive feature elimination
    Hu, Chengxu
    Wang, Juexin
    Zheng, Chao
    Xu, Shuping
    Zhang, Haipeng
    Liang, Yanchun
    Bi, Lirong
    Fan, Zhimin
    Han, Bing
    Xu, Weiqing
    [J]. MEDICAL PHYSICS, 2013, 40 (06)
  • [10] A Hybrid Feature Selection Approach for Parkinson’s Detection Based on Mutual Information Gain and Recursive Feature Elimination
    Rohit Lamba
    Tarun Gulati
    Anurag Jain
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 10263 - 10276