Transfer learning for class imbalance problems with inadequate data

被引:0
|
作者
Samir Al-Stouhi
Chandan K. Reddy
机构
[1] Honda Automobile Technology Research,Department of Computer Science
[2] Wayne State University,undefined
来源
关键词
Rare class; Transfer learning; Class imbalance; AdaBoost; Weighted majority algorithm; HealthCare informatics; Text mining;
D O I
暂无
中图分类号
学科分类号
摘要
A fundamental problem in data mining is to effectively build robust classifiers in the presence of skewed data distributions. Class imbalance classifiers are trained specifically for skewed distribution datasets. Existing methods assume an ample supply of training examples as a fundamental prerequisite for constructing an effective classifier. However, when sufficient data are not readily available, the development of a representative classification algorithm becomes even more difficult due to the unequal distribution between classes. We provide a unified framework that will potentially take advantage of auxiliary data using a transfer learning mechanism and simultaneously build a robust classifier to tackle this imbalance issue in the presence of few training samples in a particular target domain of interest. Transfer learning methods use auxiliary data to augment learning when training examples are not sufficient and in this paper we will develop a method that is optimized to simultaneously augment the training data and induce balance into skewed datasets. We propose a novel boosting-based instance transfer classifier with a label-dependent update mechanism that simultaneously compensates for class imbalance and incorporates samples from an auxiliary domain to improve classification. We provide theoretical and empirical validation of our method and apply to healthcare and text classification applications.
引用
收藏
页码:201 / 228
页数:27
相关论文
共 50 条
  • [21] Class-Imbalance Adversarial Transfer Learning Network for Cross-Domain Fault Diagnosis with Imbalanced Data
    Kuang, Jiachen
    Xu, Guanghua
    Tao, Tangfei
    Wu, Qingqiang
    IEEE Transactions on Instrumentation and Measurement, 2022, 71
  • [22] Class-Imbalance Adversarial Transfer Learning Network for Cross-Domain Fault Diagnosis With Imbalanced Data
    Kuang, Jiachen
    Xu, Guanghua
    Tao, Tangfei
    Wu, Qingqiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [23] A Comparative Study of One-Class Classifiers in Machine Learning Problems with Extreme Class Imbalance
    Sotiropoulos, Dionysios
    Giannoulis, Christos
    Tsihrintzis, George A.
    5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 362 - 364
  • [24] A New Performance Measure for Class Imbalance Learning. Application to Bioinformatics Problems
    Batuwita, Rukshan
    Palade, Vasile
    EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 545 - 550
  • [25] On the existence of a threshold in class imbalance problems
    Silva, Evandro J. R.
    Zanchettin, Cleber
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2714 - 2719
  • [26] Properties of a GP Active Learning Framework for Streaming Data with Class Imbalance
    Khanchi, Sara
    Heywood, Malcolm I.
    Zincir-Heywood, A. Nur
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 945 - 952
  • [27] Transfer learning with one-class data
    Chen, Jixu
    Liu, Xiaoming
    PATTERN RECOGNITION LETTERS, 2014, 37 : 32 - 40
  • [28] A Learning Framework for Online Class Imbalance Learning
    Wang, Shuo
    Minku, Leandro L.
    Yao, Xin
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND ENSEMBLE LEARNING (CIEL), 2013, : 36 - 45
  • [29] The class imbalance problem in deep learning
    Ghosh, Kushankur
    Bellinger, Colin
    Corizzo, Roberto
    Branco, Paula
    Krawczyk, Bartosz
    Japkowicz, Nathalie
    MACHINE LEARNING, 2024, 113 (07) : 4845 - 4901
  • [30] On dynamic ensemble selection and data preprocessing for multi-class imbalance learning
    Cruz, Rafael M. O.
    Sabourin, Robert
    Cavalcanti, George D. C.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 189 - 194