Density-based weighting for imbalanced regression

被引:0
|
作者
Michael Steininger
Konstantin Kobs
Padraig Davidson
Anna Krause
Andreas Hotho
机构
[1] University of Würzburg,Chair of Computer Science X
来源
Machine Learning | 2021年 / 110卷
关键词
Imbalanced regression; Cost-sensitive learning; Sample weighting; Kernel-density estimation; Supervised learning;
D O I
暂无
中图分类号
学科分类号
摘要
In many real world settings, imbalanced data impedes model performance of learning algorithms, like neural networks, mostly for rare cases. This is especially problematic for tasks focusing on these rare occurrences. For example, when estimating precipitation, extreme rainfall events are scarce but important considering their potential consequences. While there are numerous well studied solutions for classification settings, most of them cannot be applied to regression easily. Of the few solutions for regression tasks, barely any have explored cost-sensitive learning which is known to have advantages compared to sampling-based methods in classification tasks. In this work, we propose a sample weighting approach for imbalanced regression datasets called DenseWeight and a cost-sensitive learning approach for neural network regression with imbalanced data called DenseLoss based on our weighting scheme. DenseWeight weights data points according to their target value rarities through kernel density estimation (KDE). DenseLoss adjusts each data point’s influence on the loss according to DenseWeight, giving rare data points more influence on model training compared to common data points. We show on multiple differently distributed datasets that DenseLoss significantly improves model performance for rare data points through its density-based weighting scheme. Additionally, we compare DenseLoss to the state-of-the-art method SMOGN, finding that our method mostly yields better performance. Our approach provides more control over model training as it enables us to actively decide on the trade-off between focusing on common or rare cases through a single hyperparameter, allowing the training of better models for rare data points.
引用
收藏
页码:2187 / 2211
页数:24
相关论文
共 50 条
  • [1] Density-based weighting for imbalanced regression
    Steininger, Michael
    Kobs, Konstantin
    Davidson, Padraig
    Krause, Anna
    Hotho, Andreas
    [J]. MACHINE LEARNING, 2021, 110 (08) : 2187 - 2211
  • [2] Density-Based Logistic Regression
    Chen, Wenlin
    Chen, Yixin
    Mao, Yi
    Guo, Baolong
    [J]. 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 140 - 148
  • [3] A Density-Based Random Forest for Imbalanced Data Classification
    Dong, Jia
    Qian, Quan
    [J]. FUTURE INTERNET, 2022, 14 (03):
  • [4] Density-Based Discriminative Nonnegative Representation Model for Imbalanced Classification
    Li, Yanting
    Wang, Shuai
    Jin, Junwei
    Tao, Hongwei
    Nan, Jiaofen
    Wu, Huaiguang
    Chen, C. L. Philip
    [J]. NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [5] Two density-based sampling approaches for imbalanced and overlapping data
    Mayabadi, Sima
    Saadatfar, Hamid
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 241
  • [6] Density-Based Discriminative Nonnegative Representation Model for Imbalanced Classification
    Yanting Li
    Shuai Wang
    Junwei Jin
    Hongwei Tao
    Jiaofen Nan
    Huaiguang Wu
    C. L. Philip Chen
    [J]. Neural Processing Letters, 56
  • [7] Kernel Density-Based Linear Regression Estimate
    Yao, Weixin
    Zhao, Zhibiao
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2013, 42 (24) : 4499 - 4512
  • [8] Hybrid density-based adaptive weighted collaborative representation for imbalanced learning
    Yanting Li
    Shuai Wang
    Junwei Jin
    Hongwei Tao
    Chuang Han
    C. L. Philip Chen
    [J]. Applied Intelligence, 2024, 54 : 4334 - 4351
  • [9] A gravitational density-based mass sharing method for imbalanced data classification
    Rahmati, Farshad
    Nezamabadi-pour, Hossein
    Nikpour, Bahareh
    [J]. SN APPLIED SCIENCES, 2020, 2 (02)
  • [10] Hybrid density-based adaptive weighted collaborative representation for imbalanced learning
    Li, Yanting
    Wang, Shuai
    Jin, Junwei
    Tao, Hongwei
    Han, Chuang
    Chen, C. L. Philip
    [J]. APPLIED INTELLIGENCE, 2024, 54 (05) : 4334 - 4351