Maximal Information Coefficient-Based Undersampling Method for Highly-Imbalanced Learning

被引:0
|
作者
Qin, Haiou [1 ,2 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Peoples R China
[2] Jiangxi Prov Key Lab Smart Water Conservancy, Nanchang 330099, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Microwave integrated circuits; Generative adversarial networks; Noise measurement; Machine learning algorithms; Classification algorithms; Training; Software packages; Shape; Sensitivity; Sampling methods; Imbalanced classification; imbalanced learning; maximal information coefficient; maximal information coefficient-based undersampling; undersampling; MACHINE;
D O I
10.1109/ACCESS.2025.3525475
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning from highly-imbalanced datasets is still a big challenge in the field of machine learning because models created by general learning algorithms are weak in recognizing the samples from the minority class correctly. Undersampling is an alternative kind of methods to deal with imbalanced learning. In this paper, we propose a new undersampling method based on maximal information coefficient (including two algorithms MICU-1 and MICU-2) to rebalance the datasets. In order to evaluate the effectiveness of the method, 20 highly- imbalanced datasets are used for the benchmarks. Results show that compared with other undersampling methods, maximal information coefficient-based undersampling method are competitive in terms of G-mean and F-measure.
引用
收藏
页码:4126 / 4135
页数:10
相关论文
共 50 条
  • [41] Correlation Coefficient-Based Information Criterion for Quantification of Dependence Characteristics in Hydrological Time Series
    Xie, Ping
    Huo, Jingqun
    Sang, Yan-Fang
    Li, Yaqing
    Chen, Jie
    Wu, Ziyi
    Singh, Vijay P.
    WATER RESOURCES RESEARCH, 2022, 58 (07)
  • [42] Moving correlation coefficient-based method for the identification of periodicities in hydrologic time series
    Wu, Linqian
    Xie, Ping
    Wu, Ziyi
    Sang, Yanfang
    Chen, Jie
    Zhao, Yuxi
    Li, Yaqing
    Wang, Lu
    CHINESE SCIENCE BULLETIN-CHINESE, 2019, 64 (24): : 2549 - 2560
  • [43] Combined oversampling and undersampling method based on slow-start algorithm for imbalanced network traffic
    Seunghyun Park
    Hyunhee Park
    Computing, 2021, 103 : 401 - 424
  • [44] Combined oversampling and undersampling method based on slow-start algorithm for imbalanced network traffic
    Park, Seunghyun
    Park, Hyunhee
    COMPUTING, 2021, 103 (03) : 401 - 424
  • [45] Anomaly Detection Method Based on Clustering Undersampling and Ensemble Learning
    Huan, Wenming
    Lin, Haitao
    Lie, Haixue
    Zhou, Yan
    Wang, Yiming
    PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 980 - 984
  • [46] Sharp learning rates of coefficient-based lq-regularized regression with indefinite kernels
    LV ShaoGao
    SHI DaiMin
    XIAO QuanWu
    ZHANG MingShan
    ScienceChina(Mathematics), 2013, 56 (08) : 1560 - 1577
  • [47] Sharp learning rates of coefficient-based lq-regularized regression with indefinite kernels
    ShaoGao Lv
    DaiMin Shi
    QuanWu Xiao
    MingShan Zhang
    Science China Mathematics, 2013, 56 : 1557 - 1574
  • [48] Railway Accidents Analysis and Prevention Based on the Maximal Information Coefficient
    Shao Fubo
    Li Kepig
    STATISTIC APPLICATION IN MODERN SOCIETY, 2015, : 213 - 218
  • [49] Improved heuristic equivalent search algorithm based on Maximal Information Coefficient for Bayesian Network Structure Learning
    Zhang, Yinghua
    Zhang, Wensheng
    Xie, Yuan
    NEUROCOMPUTING, 2013, 117 : 186 - 195
  • [50] Highly Imbalanced Railway Station Structural Damage Monitoring Based on Cluster-Based Undersampling and Siamese Artificial Neural Network
    Chen, Yanchun
    Zhang, Hong
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (03) : 3915 - 3933