Ensemble learning-based classification of microarray cancer data on tree-based features

被引:18
|
作者
Dagnew, Guesh [1 ]
Shekar, B. H. [1 ]
机构
[1] Mangalore Univ, Dept Studies & Res Comp Sci, Mangalore, Karnataka, India
关键词
PARTICLE SWARM OPTIMIZATION; GENE-EXPRESSION DATA; FEATURE-SELECTION; ALGORITHM;
D O I
10.1049/ccs2.12003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cancer is a group of related diseases with high mortality rate characterized by abnormal cell growth which attacks the body tissues. Microarray cancer data is a prominent research topic across many disciplines focused to address problems related to the higher curse of dimensionality, a small number of samples, noisy data and imbalance class. A random forest (RF) tree-based feature selection and ensemble learning based on hard voting and soft voting is proposed to classify microarray cancer data using six different base classifiers. The selected features due to RF tree are submitted to the base classifiers as the training set. Then, an ensemble learning method is applied to the base classifiers in which case each base classifier predicts class label individually. The final prediction is carried out hard and soft voting techniques that use majority voting and weighted probability on the test set. The proposed ensemble learning method is validated on eight different standard microarray cancer datasets, of which three of the datasets are binary class and the remaining five datasets are multi-class datasets. Experimental results of the proposed method show 1.00 classification accuracy on six of the datasets and 0.96 on two of the datasets.
引用
下载
收藏
页码:48 / 60
页数:13
相关论文
共 50 条
  • [41] Query-Based Versus Tree-Based Classification: Application to Banking Data
    Masyutin, Alexey
    Kashnitsky, Yury
    FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 664 - 673
  • [42] Comparison of tree-based ensemble models for regression
    Park, Sangho
    Kim, Chanmin
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2022, 29 (05) : 561 - 590
  • [43] Tree-based classification of tabla strokes
    Deolekar, Subodh
    Abraham, Siby
    CURRENT SCIENCE, 2018, 115 (09): : 1724 - 1731
  • [44] Ensemble learning and hierarchical data representation for microarray classification
    Bosio, Mattia
    Bellot, Pau
    Salembier, Philippe
    Oliveras Verges, Albert
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2013,
  • [45] Tree-Based Vehicle Classification System
    Saripan, Kiatkachorn
    Nuthong, Chaiwat
    2017 14TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2017, : 439 - 442
  • [46] Tree-based signatures for shape classification
    Bauckhage, Christian
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2105 - 2108
  • [47] On the quality of tree-based protein classification
    Lazareva-Ulitsky, B
    Diemer, K
    Thomas, PD
    BIOINFORMATICS, 2005, 21 (09) : 1876 - 1890
  • [48] Learning in data-limited multimodal scenarios: Scandent decision forests and tree-based features
    Hor, Soheil
    Moradi, Mehdi
    MEDICAL IMAGE ANALYSIS, 2016, 34 : 30 - 41
  • [49] Forecasting regional in-situ thermal conductivity of soil based on tree-based ensemble learning
    Li, Xuquan
    Gong, Mingyu
    Dong, Jierui
    Zhou, Ziyi
    Han, Bo
    Yu, Huili
    INTERNATIONAL COMMUNICATIONS IN HEAT AND MASS TRANSFER, 2024, 159
  • [50] Potential of Ensemble Learning to Improve Tree-Based Classifiers for Landslide Susceptibility Mapping
    Song, Jiahui
    Wang, Yi
    Fang, Zhice
    Peng, Ling
    Hong, Haoyuan
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 4642 - 4662