An analytical method for multiclass molecular cancer classification

被引:54
|
作者
Rifkin, R [1 ]
Mukherjee, S
Tamayo, P
Ramaswamy, S
Yeang, CH
Angelo, M
Reich, M
Poggio, T
Lander, ES
Golub, TR
Mesirov, JP
机构
[1] MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02139 USA
[2] Dana Farber Canc Inst, Dept Adult Oncol, Boston, MA 02115 USA
[3] Dana Farber Canc Inst, Dept Pediat Oncol, Boston, MA 02115 USA
[4] MIT, Dept Biol, Cambridge, MA 02139 USA
[5] MIT, McGovern Inst, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
[6] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
[7] X Mine, Brisbane, CA 94005 USA
关键词
multiclass classification; support vector machine; tumor; molecular classification; pattern recognition; cancer; computational biology;
D O I
10.1137/S0036144502411986
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Modern cancer treatment relies upon microscopic tissue examination to classify tumors according to anatomical site of origin. This approach is effective but subjective and variable even among experienced clinicians and pathologists. Recently, DNA microarray-generated gene expression data has been used to build molecular cancer classifiers. Previous work from our group and others demonstrated methods for solving pairwise classification problems using such global gene expression patterns. However, classification across multiple primary tumor classes poses new methodological and computational challenges. In this paper we describe a computational methodology for multiclass prediction that combines class-specific (one vs. all) binary support vector machines. We apply this methodology to the diagnosis of multiple common adult malignancies using DNA microarray data from a collection of 198 tumor samples, spanning 14 of the most common tumor types. Overall classification accuracy is 78%, far exceeding the expected accuracy for random classification. In a large subset of the samples (80%), the algorithm attains 90% accuracy. The methodology described in this paper both demonstrates that accurate gene expression-based multiclass cancer diagnosis is possible and highlights some of the analytic challenges inherent in applying such strategies to biomedical research.
引用
收藏
页码:706 / 723
页数:18
相关论文
共 50 条
  • [1] Multiclass classification machine based on the analytical center
    Li, XQ
    Yue, JH
    Leng, YG
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1471 - 1474
  • [2] Multiclass classification based on the analytical center of version space
    Zeng, FZ
    Qiu, ZD
    Yue, JH
    Li, XQ
    CHINESE JOURNAL OF ELECTRONICS, 2005, 14 (01): : 83 - 86
  • [3] An Evolutionary Multitasking Method for Multiclass Classification
    Cheng, Fan
    Zhang, Congcong
    Zhang, Xingyi
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2022, 17 (04) : 54 - 69
  • [4] MULTICLASS, MULTIRESIDUE ANALYTICAL METHOD FOR PESTICIDES IN WATER
    THOMPSON, JF
    REID, SJ
    KANTOR, EJ
    ARCHIVES OF ENVIRONMENTAL CONTAMINATION AND TOXICOLOGY, 1977, 6 (2-3) : 143 - 157
  • [5] A multiclass classification method based on output design
    Qiang, Qi
    He, Qinming
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 15 - 19
  • [6] Decomposition Method for Neural Multiclass Classification Problem
    El Ayech, H.
    Trabelsi, A.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 15, 2006, 15 : 150 - 153
  • [7] Degree of Differential Prioritization Prediction for Multiclass Molecular Classification
    Ooi, Chia Huey
    Chetty, Madhu
    Teng, Shyh Wei
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2009, 28 (04): : 45 - 51
  • [8] PUGSVM: a caBIG™ analytical tool for multiclass gene selection and predictive classification
    Yu, Guoqiang
    Li, Huai
    Ha, Sook
    Shih, Ie-Ming
    Clarke, Robert
    Hoffman, Eric P.
    Madhavan, Subha
    Xuan, Jianhua
    Wang, Yue
    BIOINFORMATICS, 2011, 27 (05) : 736 - 738
  • [9] A multiclass classification method by distance mapping learning network
    Suzuki, K
    Hashimoto, S
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 393 - 397
  • [10] GENERALIZATION OF POTENTIAL FUNCTION METHOD TO MULTICLASS PATTERN CLASSIFICATION
    BABU, CC
    CHAN, WC
    INTERNATIONAL JOURNAL OF CONTROL, 1971, 13 (05) : 865 - &