A weighted ensemble-based active learning model to label microarray data

被引:0
|
作者
Rajonya De
Anuran Chakraborty
Agneet Chatterjee
Ram Sarkar
机构
[1] Jadavpur University,Computer Science and Engineering
关键词
Active learning; Classifier ensemble; Gene expression; Cancer classification;
D O I
暂无
中图分类号
学科分类号
摘要
Classification of cancerous genes from microarray data is an important research area in bioinformatics. Large amount of microarray data are available, but it is very costly to label them. This paper proposes an active learning model, a semi-supervised classification approach, to label the microarray data using which predictions can be made even with lesser amount of labeled data. Initially, a pool of unlabeled instances is given from which some instances are randomly chosen for labeling. Successive selection of instances to be labeled from unlabeled pool is determined by selection algorithms. The proposed method is devised following an ensemble approach to combine the decisions of three classifiers in order to arrive at a consensus which provides a more accurate prediction of the class label to ensure that each individual classifier learns in an uncorrelated manner. Our method combines the heuristic techniques used by an active learning algorithm to choose training samples with the multiple learning paradigm attained by an ensemble to optimize the search space by choosing efficiently from an already sparse learning pool. On evaluating the proposed method on 10 microarray datasets, we achieve performance which is comparable with state-of-the-art methods. The code and datasets are given at https://github.com/anuran-Chakraborty/Active-learning.
引用
收藏
页码:2427 / 2441
页数:14
相关论文
共 50 条
  • [41] Ensemble-based Model for Rainfall Nowcasting using Automatic Weather Station Data
    Shah, Nita H.
    Shukla, Bipasha Paul
    Priamvada, Anupam
    [J]. Journal of Engineering Science and Technology Review, 2022, 15 (04): : 111 - 116
  • [42] Ensemble-Based Machine Learning for Predicting Sudden Human Fall Using Health Data
    Saxena, Utkarsh
    Moulik, Soumen
    Nayak, Soumya Ranjan
    Hanne, Thomas
    Roy, Diptendu Sinha
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [43] LEARNING HOW TO INTERPOLATE FOURIER DATA WITH UNKNOWN AUTOREGRESSIVE STRUCTURE: AN ENSEMBLE-BASED APPROACH
    Kim, Tae Hyung
    Haldar, Justin P.
    [J]. CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1471 - 1475
  • [44] Ensemble-based hybrid probabilistic sampling for imbalanced data learning in lung nodule CAD
    Cao, Peng
    Yang, Jinzhu
    Li, Wei
    Zhao, Dazhe
    Zaiane, Osmar
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2014, 38 (03) : 137 - 150
  • [45] Ensemble-Based Distributed Learning for Generative Adversarial Networks
    Liu, Chonghe
    Ren, Jinke
    Yu, Guanding
    [J]. 2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [46] Ensemble-based discriminant learning with boosting for face recognition
    Lu, JW
    Plataniotis, KN
    Venetsanopoulos, AN
    Li, SZ
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (01): : 166 - 178
  • [47] Ensemble-based discriminant manifold learning for face recognition
    Zhang, Junping
    He, Li
    Zhou, Zhi-Hua
    [J]. ADVANCES IN NATURAL COMPUTATION, PT 1, 2006, 4221 : 29 - 38
  • [48] Ensemble-based kernel learning for a class of data assimilation problems with imperfect forward simulators
    Luo, Xiaodong
    [J]. PLOS ONE, 2019, 14 (07):
  • [49] Simulating rare events using a weighted ensemble-based string method
    Adelman, Joshua L.
    Grabe, Michael
    [J]. JOURNAL OF CHEMICAL PHYSICS, 2013, 138 (04):
  • [50] Ensemble-Based Weighted Voting Approach for the Early Diagnosis of Diabetes Mellitus
    Chakravarthy, S. R. Sannasi
    Rajaguru, Harikumar
    [J]. SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2021, 2022, 93 : 451 - 460