A weighted ensemble-based active learning model to label microarray data

被引:0
|
作者
Rajonya De
Anuran Chakraborty
Agneet Chatterjee
Ram Sarkar
机构
[1] Jadavpur University,Computer Science and Engineering
关键词
Active learning; Classifier ensemble; Gene expression; Cancer classification;
D O I
暂无
中图分类号
学科分类号
摘要
Classification of cancerous genes from microarray data is an important research area in bioinformatics. Large amount of microarray data are available, but it is very costly to label them. This paper proposes an active learning model, a semi-supervised classification approach, to label the microarray data using which predictions can be made even with lesser amount of labeled data. Initially, a pool of unlabeled instances is given from which some instances are randomly chosen for labeling. Successive selection of instances to be labeled from unlabeled pool is determined by selection algorithms. The proposed method is devised following an ensemble approach to combine the decisions of three classifiers in order to arrive at a consensus which provides a more accurate prediction of the class label to ensure that each individual classifier learns in an uncorrelated manner. Our method combines the heuristic techniques used by an active learning algorithm to choose training samples with the multiple learning paradigm attained by an ensemble to optimize the search space by choosing efficiently from an already sparse learning pool. On evaluating the proposed method on 10 microarray datasets, we achieve performance which is comparable with state-of-the-art methods. The code and datasets are given at https://github.com/anuran-Chakraborty/Active-learning.
引用
收藏
页码:2427 / 2441
页数:14
相关论文
共 50 条
  • [1] A weighted ensemble-based active learning model to label microarray data
    De, Rajonya
    Chakraborty, Anuran
    Chatterjee, Agneet
    Sarkar, Ram
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (10) : 2427 - 2441
  • [2] An Active Learning Approach for Ensemble-based Data Stream Mining
    Alabdulrahman, Rabaa
    Viktor, Herna
    Paquet, Eric
    [J]. KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 275 - 282
  • [3] Ensemble-based active learning for parse selection
    Osborne, M
    Baldridge, J
    [J]. HLT-NAACL 2004: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, 2004, : 89 - 96
  • [4] An Ensemble-based Active Learning for Breast Cancer Classification
    Lee, Sanghoon
    Amgad, Mohamed
    Masoud, Mohamed
    Subramanian, Rajasekaran
    Gutman, David
    Cooper, Lee
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2549 - 2553
  • [5] Multi-class ensemble-based active learning
    Koerner, Christine
    Wrobel, Stefan
    [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 687 - 694
  • [6] Deep Anomaly Detection with Ensemble-Based Active Learning
    Tang, Xuning
    Astle, Yihua Shi
    Freeman, Craig
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1663 - 1670
  • [7] An ensemble-based incremental learning approach to data fusion
    Parikh, Devi
    Polikar, Robi
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02): : 437 - 450
  • [8] Ensemble-based Classifiers for Cancer Classification Using Human Tumor Microarray Data
    Margoosian, Argin
    Abouei, Jamshid
    [J]. 2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [9] Ensemble-based data assimilation
    Zhang, Fuqing
    Snyder, Chris
    [J]. BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2007, 88 (04) : 565 - 568
  • [10] An Ensemble-based Approach to Fast Classification of Multi-label Data Streams
    Kong, Xiangnan
    Yu, Philip S.
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING (COLLABORATECOM), 2011, : 95 - 104