A novel oversampling technique based on the manifold distance for class imbalance learning

被引:2
|
作者
Guo, Yinan [1 ]
Jiao, Botao [1 ]
Yang, Lingkai [1 ]
Cheng, Jian [2 ]
Yang, Shengxiang [3 ]
Tang, Fengzhen [4 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou, Jiangsu, Peoples R China
[2] China Coal Res Inst, Beijing 100013, Peoples R China
[3] De Montfort Univ, Leicester LE1 9BH, Leics, England
[4] Shenyang Inst Automat, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
class imbalance learning; oversampling; manifold learning; overlapping; small disjunction; OPTIMIZATION; ENSEMBLE;
D O I
10.1504/IJBIC.2021.119197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Oversampling is a popular problem-solver for class imbalance learning by generating more minority samples to balance the dataset size of different classes. However, resampling in original space is ineffective for the imbalance datasets with class overlapping or small disjunction. Based on this, a novel oversampling technique based on manifold distance is proposed, in which a new minority sample is produced in terms of the distances among neighbours in manifold space, rather than the Euclidean distance among them. After mapping the original data to its manifold structure, the overlapped majority and minority samples will lie in areas easily being partitioned. In addition, the new samples are generated based on the neighbours locating nearby in manifold space, avoiding the adverse effect of the disjoint minority classes. Following that, an adaptive adjustment method is presented to determine the number of the newly generated minority samples according to the distribution density of the matched-pair data. The experimental results on 48 imbalanced datasets indicate that the proposed oversampling technique has the better classification accuracy.
引用
收藏
页码:131 / 142
页数:12
相关论文
共 50 条
  • [31] Oversampling Methods to Handle the Class Imbalance Problem: A Review
    Sharma, Harsh
    Gosain, Anushika
    SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 96 - 110
  • [32] CCO: A Cluster Core-Based Oversampling Technique for Improved Class-Imbalanced Learning
    Mondal, Priyobrata
    Ansari, Faizanuddin
    Das, Swagatam
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 1 - 13
  • [33] Weight Decision Algorithm for Oversampling Technique on Class-Imbalanced Learning
    Kang, Young-Il
    Won, Sangchul
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 182 - 186
  • [34] Synthetic minority oversampling technique for multiclass imbalance problems
    Zhu, Tuanfei
    Lin, Yaping
    Liu, Yonghe
    PATTERN RECOGNITION, 2017, 72 : 327 - 340
  • [35] A Tailored Particle Swarm and Egyptian Vulture Optimization-Based Synthetic Minority-Oversampling Technique for Class Imbalance Problem
    Rout, Subhashree
    Mallick, Pradeep Kumar
    Reddy, Annapareddy V. N.
    Kumar, Sachin
    INFORMATION, 2022, 13 (08)
  • [36] Manifold-based synthetic oversampling with manifold conformance estimation
    Bellinger, Colin
    Drummond, Christopher
    Japkowicz, Nathalie
    MACHINE LEARNING, 2018, 107 (03) : 605 - 637
  • [37] Manifold-based synthetic oversampling with manifold conformance estimation
    Colin Bellinger
    Christopher Drummond
    Nathalie Japkowicz
    Machine Learning, 2018, 107 : 605 - 637
  • [38] Minority Sub-Region Estimation-Based Oversampling for Imbalance Learning
    Sun, Yi
    Cai, Lijun
    Liao, Bo
    Zhu, Wen
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (05) : 2324 - 2334
  • [39] Manifold cluster-based evolutionary ensemble imbalance learning
    Guo, Yinan
    Feng, Jiawei
    Jiao, Botao
    Yang, Linkai
    Lu, Hui
    Yu, Zekuan
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159
  • [40] Manifold cluster-based evolutionary ensemble imbalance learning
    Guo, Yinan
    Feng, Jiawei
    Jiao, Botao
    Yang, Linkai
    Lu, Hui
    Yu, Zekuan
    Computers and Industrial Engineering, 2021, 159