A novel multiclass gene selection method based on SVM/MLP cross validation

被引:0
|
作者
Zhang, Junying [1 ]
Zhang, Hongyi [1 ]
Liu, Shenling [1 ]
Wang, Yue Joseph [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Engn, Xian 710071, Peoples R China
[2] Virginia Polytech Inst & State Univ, Dept Elect & Comp Engn, Alexandria, VA 22314 USA
基金
美国国家科学基金会;
关键词
DNA microarray data; gene selection; curse of dimensionality; diagnostic genes; cross validation; SVM; MLP;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
gene selection is one of the major challenges of biochip technology for resolution of curse of dimentionality which occurs especially in DNA microarray dataset where there are more than thousands of genes and only a few experiments (samples), and for gene diagnosis where only a gene subset is enough for diagnosis of diseases. This paper presents a gene selection method by training linear SVM (support vector machine)/nonlinear MLP (multi-layer perceptron) classifiers and testing them with cross validation for finding gene subset which is optimal/suboptimal for diagnosis of binary/multiple disease classes. The process is to select genes with linear SVM classifier incrementally for the diagnosis of each binary disease class pair, by testing its generalization ability with leave-one-out cross validation; the union of them is used as initialized gene subset for the discrimination of all the disease classes, from which genes are deleted one by (one decrementally by removing the gene which brings the greatest decrease of the generalization power after the removal, where generalization is measured by leave-one-out and leave-4-out cross validation. For real DNA microarray data with 2308 genes and only 64 labelled samples belonging to 4 disease classes, only 6 genes are selected to be diagnostic genes. The diagnostic genes are tested with 6-2-4 MLP with both leave-one-out and leave-4-out cross validation, resulting in no misclassification.
引用
收藏
页码:2205 / +
页数:2
相关论文
共 50 条
  • [2] Gene association study with SVM, MLP and cross-validation for the diagnosis of diseases
    Zhang, Junying
    Liu, Shenling
    Wang, Yue
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2008, 18 (06) : 741 - 750
  • [3] A Novel Method of Feature Selection based on SVM
    Liu, Quanjin
    Zhao, Zhimin
    Li, Ying-Xin
    Yu, Xiaolei
    Wang, Yong
    JOURNAL OF COMPUTERS, 2013, 8 (08) : 2144 - 2149
  • [4] Granularity selection for cross-validation of SVM
    Liu, Yong
    Liao, Shizhong
    INFORMATION SCIENCES, 2017, 378 : 475 - 483
  • [5] Novel feature selection method based on NGA/PCA and SVM
    Faculty of Computer and Information, Hefei University of Technology, Hefei 230009, China
    Xitong Fangzhen Xuebao, 2007, 20 (4823-4826): : 4823 - 4826
  • [6] SVM-RFE Based Feature Selection and Taguchi Parameters Optimization for Multiclass SVM Classifier
    Huang, Mei-Ling
    Hung, Yung-Hsiang
    Lee, W. M.
    Li, R. K.
    Jiang, Bo-Ru
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [7] A Novel SVM-RFE for Gene Selection
    Tan, Jun-Yan
    Yang, Zhi-Xia
    Deng, Naiyang
    OPTIMIZATION AND SYSTEMS BIOLOGY, 2009, 11 : 237 - +
  • [8] Novel multiclass SVM-based binary decision tree classifier
    Osman, Hossam
    2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, VOLS 1-3, 2007, : 540 - 543
  • [9] A Novel Multiclass Classification Method with Gene Expression Programming
    Huang, Jiangtao
    Deng, Chuang
    WISM: 2009 INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, : 139 - +
  • [10] A Hybrid Feature Selection Method Using Multiclass SVM for Diagnosis of Erythemato-Squamous Disease
    Maryam
    Setiawan, Noor Akhmad
    Wahyunggoro, Oyas
    INTERNATIONAL CONFERENCE ON MATHEMATICS: PURE, APPLIED AND COMPUTATION: EMPOWERING ENGINEERING USING MATHEMATICS, 2017, 1867