Imbalanced Class Learning in Epigenetics

被引:13
|
作者
Haque, M. Muksitul [1 ,2 ]
Skinner, Michael K. [1 ]
Holder, Lawrence B. [2 ]
机构
[1] Washington State Univ, Sch Biol Sci, Ctr Reprod Biol, Pullman, WA 99164 USA
[2] Washington State Univ, Sch Elect Engn & Comp Sci, Pullman, WA 99164 USA
关键词
biology; computational molecular biology; DNA; genomics; machine earning; TRANSGENERATIONAL INHERITANCE; CLASSIFICATION; DISEASE; TARGETS;
D O I
10.1089/cmb.2014.0008
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In machine learning, one of the important criteria for higher classification accuracy is a balanced dataset. Datasets with a large ratio between minority and majority classes face hindrance in learning using any classifier. Datasets having a magnitude difference in number of instances between the target concept result in an imbalanced class distribution. Such datasets can range from biological data, sensor data, medical diagnostics, or any other domain where labeling any instances of the minority class can be time-consuming or costly or the data may not be easily available. The current study investigates a number of imbalanced class algorithms for solving the imbalanced class distribution present in epigenetic datasets. Epigenetic (DNA methylation) datasets inherently come with few differentially DNA methylated regions (DMR) and with a higher number of non-DMR sites. For this class imbalance problem, a number of algorithms are compared, including the TAN+AdaBoost algorithm. Experiments performed on four epigenetic datasets and several known datasets show that an imbalanced dataset can have similar accuracy as a regular learner on a balanced dataset.
引用
收藏
页码:492 / 507
页数:16
相关论文
共 50 条
  • [1] Boundary Focal Loss for Class Imbalanced Learning
    Lin, Weizhong
    Wu, Peng
    Xiao, Xuan
    2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021), 2021,
  • [2] Defying Imbalanced Forgetting in Class Incremental Learning
    Xu, Shixiong
    Meng, Gaofeng
    Nie, Xing
    Ni, Bolin
    Fan, Bin
    Xiang, Shiming
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 16211 - 16219
  • [3] Fairness-aware Class Imbalanced Learning
    Subramanian, Shivashankar
    Rahimi, Afshin
    Baldwin, Timothy
    Cohn, Trevor
    Frermann, Lea
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2045 - 2051
  • [4] Adapting MultiBoost Ensemble for Class Imbalanced Learning
    Mustafa, Ghulam
    Niu, Zhendong
    Chen, Jie
    2015 IEEE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2015, : 12 - 17
  • [5] Adjusting Decision Boundary for Class Imbalanced Learning
    Kim, Byungju
    Kim, Junmo
    IEEE ACCESS, 2020, 8 : 81674 - 81685
  • [6] Minority Class Oriented Active Learning for Imbalanced Datasets
    Aggarwal, Umang
    Popescu, Adrian
    Hudelot, Celine
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9920 - 9927
  • [7] Class Rectification Hard Mining for Imbalanced Deep Learning
    Dong, Qi
    Gong, Shaogang
    Zhu, Xiatian
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1869 - 1878
  • [8] Margin calibration in SVM class-imbalanced learning
    Yang, Chan-Yun
    Yang, Jr-Syu
    Wang, Jian-Jun
    NEUROCOMPUTING, 2009, 73 (1-3) : 397 - 411
  • [9] Dynamically balancing class losses in imbalanced deep learning
    Zhao, Yaochi
    Liu, Shiguang
    Hu, Zhuhua
    ELECTRONICS LETTERS, 2022, 58 (05) : 203 - 206
  • [10] Prototypical Classifier for Robust Class-Imbalanced Learning
    Wei, Tong
    Shi, Jiang-Xin
    Li, Yu-Feng
    Zhang, Min-Ling
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 44 - 57