Recognition of Multiple Imbalanced Cancer Types Based on DNA Microarray Data Using Ensemble Classifiers

被引:12
|
作者
Yu, Hualong [1 ]
Hong, Shufang [1 ]
Yang, Xibei [1 ]
Ni, Jun [2 ]
Dan, Yuanyuan [3 ]
Qin, Bin [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp Sci & Engn, Zhenjiang 212003, Peoples R China
[2] Univ Iowa, Dept Radiol, Carver Coll Med, Iowa City, IA 52242 USA
[3] Jiangsu Univ Sci & Technol, Sch Biol & Chem Engn, Zhenjiang 212003, Peoples R China
基金
中国国家自然科学基金;
关键词
SUPPORT VECTOR MACHINES; GENE-EXPRESSION DATA; MOLECULAR CLASSIFICATION; REGULATORY NETWORK; CLASS PREDICTION; CARCINOMAS; BINARY;
D O I
10.1155/2013/239628
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
DNA microarray technology can measure the activities of tens of thousands of genes simultaneously, which provides an efficient way to diagnose cancer at the molecular level. Although this strategy has attracted significant research attention, most studies neglect an important problem, namely, that most DNA microarray datasets are skewed, which causes traditional learning algorithms to produce inaccurate results. Some studies have considered this problem, yet they merely focus on binary-class problem. In this paper, we dealt with multiclass imbalanced classification problem, as encountered in cancer DNA microarray, by using ensemble learning. We utilized one-against-all coding strategy to transform multiclass to multiple binary classes, each of them carrying out feature subspace, which is an evolving version of random subspace that generates multiple diverse training subsets. Next, we introduced one of two different correction technologies, namely, decision threshold adjustment or random undersampling, into each training subset to alleviate the damage of class imbalance. Specifically, support vector machine was used as base classifier, and a novel voting rule called counter voting was presented for making a final decision. Experimental results on eight skewed multiclass cancer microarray datasets indicate that unlike many traditional classification approaches, our methods are insensitive to class imbalance.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Ensemble Classifiers Based on Kernel PCA for Cancer Data Classification
    Zhou, Jin
    Pan, Yuqi
    Chen, Yuehui
    Liu, Yang
    [J]. EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2009, 5755 : 955 - +
  • [32] Spoken language recognition using ensemble classifiers
    Ma, Bin
    Li, Haizhou
    Tong, Rong
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2053 - 2062
  • [33] Recognition of colon cells using ensemble of classifiers
    Kruk, M.
    Osowski, S.
    Koktysz, R.
    [J]. 2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 288 - +
  • [34] Video based face recognition using multiple classifiers
    Tang, XO
    Li, ZF
    [J]. SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2004, : 345 - 349
  • [35] FACIAL EXPRESSION RECOGNITION USING ENSEMBLE OF CLASSIFIERS
    Zavaschi, T. H. H.
    Koerich, A. L.
    Oliveira, L. E. S.
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1489 - 1492
  • [36] Improving Accelerometer-Based Activity Recognition by Using Ensemble of Classifiers
    Daghistani, Tahani
    Alshammari, Riyad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (05) : 128 - 133
  • [37] Iterative ensemble feature selection for multiclass classification of imbalanced microarray data
    Yang, Junshan
    Zhou, Jiarui
    Zhu, Zexuan
    Ma, Xiaoliang
    Ji, Zhen
    [J]. JOURNAL OF BIOLOGICAL RESEARCH-THESSALONIKI, 2016, 23
  • [38] Ensemble Classifiers for Acute Leukemia Classification Using Microarray Gene Expression Data under uncertainty
    Gamal, Mona
    Zaied, Abdel Nasser H.
    Rushdy, Ehab
    [J]. Neutrosophic Sets and Systems, 2022, 49 : 164 - 183
  • [39] Cancer Classification from Gene Expression Based Microarray Data Using SVM Ensemble
    Begum, Shemim
    Chakraborty, Debasis
    Sarkar, Ram
    [J]. 2015 International Conference on Condition Assessment Techniques in Electrical Systems (CATCON), 2015, : 13 - 16
  • [40] Cancer identification based on DNA microarray data
    Liu, Yihui
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 153 - +