Ensemble learning-based classification of microarray cancer data on tree-based features

被引:18
|
作者
Dagnew, Guesh [1 ]
Shekar, B. H. [1 ]
机构
[1] Mangalore Univ, Dept Studies & Res Comp Sci, Mangalore, Karnataka, India
关键词
PARTICLE SWARM OPTIMIZATION; GENE-EXPRESSION DATA; FEATURE-SELECTION; ALGORITHM;
D O I
10.1049/ccs2.12003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer is a group of related diseases with high mortality rate characterized by abnormal cell growth which attacks the body tissues. Microarray cancer data is a prominent research topic across many disciplines focused to address problems related to the higher curse of dimensionality, a small number of samples, noisy data and imbalance class. A random forest (RF) tree-based feature selection and ensemble learning based on hard voting and soft voting is proposed to classify microarray cancer data using six different base classifiers. The selected features due to RF tree are submitted to the base classifiers as the training set. Then, an ensemble learning method is applied to the base classifiers in which case each base classifier predicts class label individually. The final prediction is carried out hard and soft voting techniques that use majority voting and weighted probability on the test set. The proposed ensemble learning method is validated on eight different standard microarray cancer datasets, of which three of the datasets are binary class and the remaining five datasets are multi-class datasets. Experimental results of the proposed method show 1.00 classification accuracy on six of the datasets and 0.96 on two of the datasets.
引用
下载
收藏
页码:48 / 60
页数:13
相关论文
共 50 条
  • [21] Ensemble classification of microarray data based on correlation analysis
    Yu, Hualong
    Gu, Guochang
    Liu, Haibo
    Shen, Jing
    Zhao, Jing
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (02): : 328 - 335
  • [22] Tree-based disease classification using protein data
    Zhu, HT
    Yu, CY
    Zhang, HP
    PROTEOMICS, 2003, 3 (09) : 1673 - 1677
  • [23] Deep learning-based tree classification using mobile LiDAR data
    Guan, Haiyan
    Yu, Yongtao
    Ji, Zheng
    Li, Jonathan
    Zhang, Qi
    REMOTE SENSING LETTERS, 2015, 6 (11) : 864 - 873
  • [24] Tree-based models for inductive classification on the Web Of Data
    Rizzo, Giuseppe
    d'Amato, Claudia
    Fanizzi, Nicola
    Esposito, Floriana
    JOURNAL OF WEB SEMANTICS, 2017, 45 : 1 - 22
  • [25] Tree-based classification and regression Part 3: Tree-based procedures
    Gunter, B
    QUALITY PROGRESS, 1998, 31 (02) : 121 - 123
  • [26] Evaluating Tree-based Ensemble Strategies for Imbalanced Network Attack Classification
    Soon, Hui Fern
    Amir, Amiza
    Nishizaki, Hiromitsu
    Zahri, Nik Adilah Hanin
    Kamarudin, Latifah Munirah
    Azemi, Saidatul Norlyana
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 1124 - 1134
  • [27] Suite of decision tree-based classification algorithms on cancer gene expression data
    Al Snousy, Mohmad Badr
    El-Deeb, Hesham Mohamed
    Badran, Khaled
    Al Khlil, Ibrahim Ali
    EGYPTIAN INFORMATICS JOURNAL, 2011, 12 (02) : 73 - 82
  • [28] Tree-based Ensemble Classifier Learning for Automatic Brain Glioma Segmentation
    Amiri, Samya
    Mahjoub, Mohamed Ali
    Rekik, Islem
    NEUROCOMPUTING, 2018, 313 : 135 - 142
  • [29] Adapted Deep Ensemble Learning-Based Voting Classifier for Osteosarcoma Cancer Classification
    Walid, Md. Abul Ala
    Mollick, Swarnali
    Shill, Pintu Chandra
    Baowaly, Mrinal Kanti
    Islam, Md. Rabiul
    Ahamad, Md. Martuza
    Othman, Manal A.
    Samad, Md Abdus
    DIAGNOSTICS, 2023, 13 (19)
  • [30] Natural mortality estimation using tree-based ensemble learning models
    Liu, Chanjuan
    Zhou, Shijie
    Wang, You-Gan
    Hu, Zhihua
    ICES JOURNAL OF MARINE SCIENCE, 2020, 77 (04) : 1414 - 1426