Finding the Proverbial Needle: Improving Minority Class Identification Under Extreme Class Imbalance

被引:0
|
作者
Trent Geisler
Herman Ray
Ying Xie
机构
[1] United States Military Academy,Department of Systems Engineering
[2] Kennesaw State University,College of Computing and Software Engineering, School of Data Science and Analytics
[3] Kennesaw State University,College of Computing and Software Engineering, Department of Information Technology
来源
Journal of Classification | 2023年 / 40卷
关键词
Statistical machine learning; Imbalanced learning; Logistic regression; Binary classification; Weighted loss function;
D O I
暂无
中图分类号
学科分类号
摘要
Imbalanced learning problems typically consist of data with skewed class distributions, coupled with large misclassification costs for the rare events. For binary classification, logistic regression is a common supervised learning technique chosen to perform this task. Unfortunately, the model performs poorly on classification tasks when class distributions are highly imbalanced. To improve this generalization, we implement a novel instance-level weighting methodology for the minority class in the loss function. We build our method from a recently published, locally weighted log-likelihood objective function, where each of the minority class weights are learned from the data. We improve upon this previous approach by creating a convex and hyperparameter-free loss function that improves generalization performance for datasets exhibiting extreme class imbalance.
引用
收藏
页码:192 / 212
页数:20
相关论文
共 50 条
  • [1] Finding the Proverbial Needle: Improving Minority Class Identification Under Extreme Class Imbalance
    Geisler, Trent
    Ray, Herman
    Xie, Ying
    [J]. JOURNAL OF CLASSIFICATION, 2023, 40 (01) : 192 - 212
  • [2] Needle in a Haystack: Label-Efficient Evaluation under Extreme Class Imbalance
    Marchant, Neil G.
    Rubinstein, Benjamin I. P.
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1180 - 1190
  • [3] Handling Class Imbalance by Estimating Minority Class Statistics
    Ansari, Faizanuddin
    Das, Swagatam
    Shamsolmoali, Pourya
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [4] Hyperparameter Optimisation for Improving Classification under Class Imbalance
    Kong, Jiawen
    Kowalczyk, Wojtek
    Duc Anh Nguyen
    Back, Thomas
    Menzel, Stefan
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 3072 - 3078
  • [5] Class imbalance and the curse of minority hubs
    Tomasev, Nenad
    Mladenic, Dunja
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 53 : 157 - 172
  • [6] Code Quality Prediction Under Super Extreme Class Imbalance
    Lee, Noah
    Abreu, Rui
    Nagappan, Nachiappan
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2022), 2022, : 99 - 104
  • [7] A Machine Learning Approach for Dark-Matter Particle Identification Under Extreme Class Imbalance
    Sutrisno, Raymond
    Vilalta, Ricardo
    Renshaw, Andrew
    [J]. ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XXVIII, 2019, 523 : 115 - 118
  • [8] Extreme Learning Machine prediction under high class imbalance in bioinformatics
    Rodriguez, T.
    Di Persia, L. E.
    Milone, D. H.
    Stegmayer, G.
    [J]. 2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [9] Resampling Techniques for Learning Under Extreme Verification Latency with Class Imbalance
    Frederickson, Christopher
    Polikar, Robi
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [10] Equilibrating the Recognition of the Minority Class in the Imbalance Context
    Cleofas-Sanchez, L.
    Camacho-Nieto, O.
    Sanchez-Garreta, J. S.
    Yanez-Marquez, C.
    Valdovinos-Rosas, R. M.
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2014, 8 (01): : 27 - 36