Transfer learning for class imbalance problems with inadequate data

被引:0
|
作者
Samir Al-Stouhi
Chandan K. Reddy
机构
[1] Honda Automobile Technology Research,Department of Computer Science
[2] Wayne State University,undefined
来源
关键词
Rare class; Transfer learning; Class imbalance; AdaBoost; Weighted majority algorithm; HealthCare informatics; Text mining;
D O I
暂无
中图分类号
学科分类号
摘要
A fundamental problem in data mining is to effectively build robust classifiers in the presence of skewed data distributions. Class imbalance classifiers are trained specifically for skewed distribution datasets. Existing methods assume an ample supply of training examples as a fundamental prerequisite for constructing an effective classifier. However, when sufficient data are not readily available, the development of a representative classification algorithm becomes even more difficult due to the unequal distribution between classes. We provide a unified framework that will potentially take advantage of auxiliary data using a transfer learning mechanism and simultaneously build a robust classifier to tackle this imbalance issue in the presence of few training samples in a particular target domain of interest. Transfer learning methods use auxiliary data to augment learning when training examples are not sufficient and in this paper we will develop a method that is optimized to simultaneously augment the training data and induce balance into skewed datasets. We propose a novel boosting-based instance transfer classifier with a label-dependent update mechanism that simultaneously compensates for class imbalance and incorporates samples from an auxiliary domain to improve classification. We provide theoretical and empirical validation of our method and apply to healthcare and text classification applications.
引用
收藏
页码:201 / 228
页数:27
相关论文
共 50 条
  • [41] On the Performance of Oversampling Techniques for Class Imbalance Problems
    Kong, Jiawen
    Rios, Thiago
    Kowalczyk, Wojtek
    Menzel, Stefan
    Back, Thomas
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 84 - 96
  • [42] Systematic review of class imbalance problems in manufacturing
    de Giorgio, Andrea
    Cola, Gabriele
    Wang, Lihui
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 71 : 620 - 644
  • [43] Model validation failure in class imbalance problems
    Kang, Seokho
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 146
  • [44] MTSbag: A Method to Solve Class Imbalance Problems
    Hsiao, Yu-Hsiang
    Su, Chao-Ton
    Fu, Pin-Cheng
    Chen, Mu-Chen
    2018 7TH INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2018), 2018, : 524 - 529
  • [45] A hybrid approach to accelerate the classification accuracy of cervical cancer data with class imbalance problems
    Manoharan, J. Samuel
    Braveen, M.
    Subramanian, G. Ganesan
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2021, 25 (3-4) : 234 - 261
  • [46] Feature Selection with Class Hierarchy for Imbalance Problems
    Horio, Tomoya
    Kudo, Mineichi
    PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2021, 13055 : 229 - 238
  • [47] SWSEL: Sliding Window-based Selective Ensemble Learning for class-imbalance problems
    Dai, Qi
    Liu, Jian-wei
    Yang, Jia-Peng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [48] FedGen: Personalized federated learning with data generation for enhanced model customization and class imbalance
    Zhao, Peng
    Guo, Shaocong
    Li, Yanan
    Yang, Shusen
    Ren, Xuebin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 164
  • [49] Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data
    Seol, Mihye
    Kim, Taejoon
    SENSORS, 2023, 23 (03)
  • [50] Measuring the class-imbalance extent of multi-class problems
    Ortigosa-Hernandez, Jonathan
    Inza, Inaki
    Lozano, Jose A.
    PATTERN RECOGNITION LETTERS, 2017, 98 : 32 - 38