Thinned ECOC Decomposition for Gene Expression Based Cancer Classification

被引:1
|
作者
Hatami, Nima [1 ]
机构
[1] Shahed Univ, Dept Elect Engn, Tehran, Iran
关键词
Multiple Classifier Systems (MCS); Thinning; Diversity; Error Correcting Output Codes (ECOC); Support Vector Machine; Cancer classification; Gene expression data;
D O I
10.1109/ISDA.2008.308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer classification using gene expression data has the great importance in bioinformatics and is known to contain the keys for addressing the fundamental problems relating to cancer diagnosis and drug discovery. Error correcting output coding (ECOC) is a method to design Multiple Classifier Systems (MCS), which reduces a multi-class problem into some binary sub problem. A key issue in design of any ECOC ensemble is defining optimal code matrix with maximum discrimination power and minimum number of columns. This paper introduces a heuristic method for application dependent design of optimal ECOC matrix base on the thinning algorithm used in the ensemble design. The key idea of proposed method which called Thinned ECOC is to remove some redundant and unnecessary columns of any initial code matrix successively based on a metric defined for each column. Experimental results on two real datasets show the robustness of Thinned ECOC in comparison with the other existing code generation methods.
引用
收藏
页码:216 / 221
页数:6
相关论文
共 50 条
  • [1] Thinned-ECOC ensemble based on sequential code shrinking
    Hatami, Nima
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 936 - 947
  • [2] Gene expression based cancer classification
    Tarek, Sara
    Abd Elwahab, Reda
    Shoman, Mahmoud
    EGYPTIAN INFORMATICS JOURNAL, 2017, 18 (03) : 151 - 159
  • [3] Gene boosting for cancer classification based on gene expression profiles
    Hong, Jin-Hyuk
    Cho, Sung-Bae
    PATTERN RECOGNITION, 2009, 42 (09) : 1761 - 1767
  • [4] Dataset complexity and gene expression based cancer classification
    Okun, Oleg
    Priisalu, Helen
    APPLICATIONS OF FUZZY SETS THEORY, 2007, 4578 : 484 - +
  • [5] Classification of Cancer Types based on Gene Expression Data
    He, Yinchao
    Bockmon, Ryan
    Modey, Miracle
    Roscoe, Sarah
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2175 - 2182
  • [6] Multiclass cancer classification based on gene expression comparison
    Yang, Sitan
    Naiman, Daniel Q.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2014, 13 (04) : 477 - 496
  • [7] A Hierarchical Ensemble of ECOC for cancer classification based on multi-class microarray data
    Liu, Kun-Hong
    Zeng, Zhi-Hao
    Ng, Vincent To Yee
    INFORMATION SCIENCES, 2016, 349 : 102 - 118
  • [8] Cancer Classification Ensemble System Based on Gene Expression Profiles
    Tarek, Sara
    Elwahab, Reda Abd
    Shoman, Mahmoud
    2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,
  • [9] Gene Expression Profiles based Human Cancer Diseases Classification
    Salem, Hanaa
    Attiya, Gamal
    El-Fishawy, Nawal
    2015 11TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2015, : 181 - 187
  • [10] Cancer classification based on gene expression using neural networks
    Hu, H. P.
    Niu, Z. J.
    Bai, Y. P.
    Tan, X. H.
    GENETICS AND MOLECULAR RESEARCH, 2015, 14 (04) : 17605 - 17611