IRDA: Implicit data augmentation for deep imbalanced regression

被引:1
|
作者
Zhu, Weiyao [1 ]
Wu, Ou [1 ]
Yang, Nan [1 ]
机构
[1] Tianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
关键词
Deep imbalanced regression; Implicit data augmentation; Regularization; Regression loss;
D O I
10.1016/j.ins.2024.120873
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imbalanced data distributions are prevalent in real -world classification and regression tasks. Data augmentation is a commonly employed technique to mitigate this issue, with implicit methods gaining attention for their effectiveness and efficiency. However, implicit data augmentation methods have not been extensively explored in the context of regression tasks. To address this gap, we introduce IRDA, a novel learning method for regression that incorporates implicit data augmentation. Our approach includes developing a new augmentation strategy specifically tailored for deep imbalanced regression tasks, and a regression loss function that is suitable for real -world data with imbalanced label distributions and non -uniformly distributed features. We derive an easily computable surrogate loss and propose two implicit data augmentation algorithms, one incorporating meta -learning and one without. Additionally, we provide regularization perspective to offer a deeper understanding of IRDA. We evaluate IRDA on five datasets, including a large-scale dataset, demonstrating its effectiveness in mitigating the adverse effects of imbalanced data distribution and its adaptability to various regression tasks.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Leveraging GANs data augmentation for imbalanced medical image classification
    Ding, Hongwei
    Huang, Nana
    Cui, Xiaohui
    APPLIED SOFT COMPUTING, 2024, 165
  • [32] Few-shot imbalanced classification based on data augmentation
    Xuewei Chao
    Lixin Zhang
    Multimedia Systems, 2023, 29 : 2843 - 2851
  • [33] Let Multi-classification Help Deep Imbalanced Regression
    Lin, Dekun
    Peng, Tailai
    Chen, Rui
    Xie, Xinran
    Cui, Zhe
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT III, 2024, 15018 : 430 - 447
  • [34] Handling imbalanced textual data: an attention-based data augmentation approach
    Sah, Amit Kumar
    Abulaish, Muhammad
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [35] An imbalanced data learning approach for tool wear monitoring based on data augmentation
    Zhang, Bowen
    Liu, Xianli
    Yue, Caixu
    Liu, Shaoyang
    Li, Xuebing
    Liang, Steven Y.
    Wang, Lihui
    JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (01) : 399 - 420
  • [36] A Study on the Impact of Data Characteristics in Imbalanced Regression Tasks
    Branco, Paula
    Torgo, Luis
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 193 - 202
  • [37] An imbalanced data learning approach for tool wear monitoring based on data augmentation
    Zhang, Bowen
    Liu, Xianli
    Yue, Caixu
    Liu, Shaoyang
    Li, Xuebing
    Liang, Steven Y.
    Wang, Lihui
    JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (01) : 399 - 420
  • [38] Multi-output regression for imbalanced data stream
    Peng, Tao
    Sellami, Sana
    Boucelma, Omar
    Chbeir, Richard
    EXPERT SYSTEMS, 2023, 40 (10)
  • [39] Chebyshev approaches for imbalanced data streams regression models
    Ehsan Aminian
    Rita P. Ribeiro
    João Gama
    Data Mining and Knowledge Discovery, 2021, 35 : 2389 - 2466
  • [40] Chebyshev approaches for imbalanced data streams regression models
    Aminian, Ehsan
    Ribeiro, Rita P.
    Gama, Joao
    DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (06) : 2389 - 2466