Finding the Proverbial Needle: Improving Minority Class Identification Under Extreme Class Imbalance

被引:0
|
作者
Trent Geisler
Herman Ray
Ying Xie
机构
[1] United States Military Academy,Department of Systems Engineering
[2] Kennesaw State University,College of Computing and Software Engineering, School of Data Science and Analytics
[3] Kennesaw State University,College of Computing and Software Engineering, Department of Information Technology
来源
Journal of Classification | 2023年 / 40卷
关键词
Statistical machine learning; Imbalanced learning; Logistic regression; Binary classification; Weighted loss function;
D O I
暂无
中图分类号
学科分类号
摘要
Imbalanced learning problems typically consist of data with skewed class distributions, coupled with large misclassification costs for the rare events. For binary classification, logistic regression is a common supervised learning technique chosen to perform this task. Unfortunately, the model performs poorly on classification tasks when class distributions are highly imbalanced. To improve this generalization, we implement a novel instance-level weighting methodology for the minority class in the loss function. We build our method from a recently published, locally weighted log-likelihood objective function, where each of the minority class weights are learned from the data. We improve upon this previous approach by creating a convex and hyperparameter-free loss function that improves generalization performance for datasets exhibiting extreme class imbalance.
引用
收藏
页码:192 / 212
页数:20
相关论文
共 50 条
  • [21] Transfer synthetic over-sampling for class-imbalance learning with limited minority class data
    Xu-Ying Liu
    Sheng-Tao Wang
    Min-Ling Zhang
    [J]. Frontiers of Computer Science, 2019, 13 : 996 - 1009
  • [22] Decision tree induction based on minority entropy for the class imbalance problem
    Boonchuay, Kesinee
    Sinapiromsaran, Krung
    Lursinsap, Chidchanok
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (03) : 769 - 782
  • [23] Decision tree induction based on minority entropy for the class imbalance problem
    Kesinee Boonchuay
    Krung Sinapiromsaran
    Chidchanok Lursinsap
    [J]. Pattern Analysis and Applications, 2017, 20 : 769 - 782
  • [24] Extreme minority class detection in imbalanced data for network intrusion
    Milosevic, Marija S.
    Ciric, Vladimir M.
    [J]. COMPUTERS & SECURITY, 2022, 123
  • [25] Detection of Korean Phishing Messages Using Biased Discriminant Analysis under Extreme Class Imbalance Problem
    Kim, Siyoon
    Park, Jeongmin
    Ahn, Hyun
    Lee, Yonggeol
    [J]. INFORMATION, 2024, 15 (05)
  • [26] OBMI: oversampling borderline minority instances by a two-stage Tomek link-finding procedure for class imbalance problem
    Leng, Qiangkui
    Guo, Jiamei
    Tao, Jiaqing
    Meng, Xiangfu
    Wang, Changzhong
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 4775 - 4792
  • [27] Improving classification of mature microRNA by solving class imbalance problem
    Wang, Ying
    Li, Xiaoye
    Tao, Bairui
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [28] Improving classification of mature microRNA by solving class imbalance problem
    Ying Wang
    Xiaoye Li
    Bairui Tao
    [J]. Scientific Reports, 6
  • [29] Improving Performance Prediction on Education Data with Noise and Class Imbalance
    Radwan, Akram M.
    Cataltepe, Zehra
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2018, 24 (04): : 777 - 784
  • [30] Weighted Online Sequential Extreme Learning Machine for Class Imbalance Learning
    Mirza, Bilal
    Lin, Zhiping
    Toh, Kar-Ann
    [J]. NEURAL PROCESSING LETTERS, 2013, 38 (03) : 465 - 486