Maximal Information Coefficient-Based Undersampling Method for Highly-Imbalanced Learning

被引:0
|
作者
Qin, Haiou [1 ,2 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Peoples R China
[2] Jiangxi Prov Key Lab Smart Water Conservancy, Nanchang 330099, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Microwave integrated circuits; Generative adversarial networks; Noise measurement; Machine learning algorithms; Classification algorithms; Training; Software packages; Shape; Sensitivity; Sampling methods; Imbalanced classification; imbalanced learning; maximal information coefficient; maximal information coefficient-based undersampling; undersampling; MACHINE;
D O I
10.1109/ACCESS.2025.3525475
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning from highly-imbalanced datasets is still a big challenge in the field of machine learning because models created by general learning algorithms are weak in recognizing the samples from the minority class correctly. Undersampling is an alternative kind of methods to deal with imbalanced learning. In this paper, we propose a new undersampling method based on maximal information coefficient (including two algorithms MICU-1 and MICU-2) to rebalance the datasets. In order to evaluate the effectiveness of the method, 20 highly- imbalanced datasets are used for the benchmarks. Results show that compared with other undersampling methods, maximal information coefficient-based undersampling method are competitive in terms of G-mean and F-measure.
引用
收藏
页码:4126 / 4135
页数:10
相关论文
共 50 条
  • [1] A Novel Approach for Unsupervised Learning of Highly-Imbalanced Data
    Kennedy, Robert K. L.
    Salekshahrezaee, Zahra
    Khoshgoftaar, Taghi M.
    2022 IEEE 4TH INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE, COGMI, 2022, : 52 - 58
  • [2] Maximal Information Coefficient-Based Two-Stage Feature Selection Method for Railway Condition Monitoring
    Wen, Tao
    Dong, Deyi
    Chen, Qianyu
    Chen, Lei
    Roberts, Clive
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (07) : 2681 - 2690
  • [3] Classification of Alzheimer's Disease Using Maximal Information Coefficient-Based Functional Connectivity with an Extreme Learning Machine
    Chauhan, Nishant
    Choi, Byung-Jae
    BRAIN SCIENCES, 2023, 13 (07)
  • [4] Novel DBN Structure Learning Method Based on Maximal Information Coefficient
    Li, Guo-Liang
    Xing, Li-Ning
    Chen, Ying-Wu
    FUZZY SYSTEMS AND DATA MINING II, 2016, 293 : 290 - 298
  • [5] Iterative cleaning and learning of big highly-imbalanced fraud data using unsupervised learning
    Kennedy, Robert K. L.
    Salekshahrezaee, Zahra
    Villanustre, Flavio
    Khoshgoftaar, Taghi M.
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [6] The Proposal of Undersampling Method for Learning from Imbalanced Datasets
    Bach, Malgorzata
    Werner, Aleksandra
    Palt, Mateusz
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 125 - 134
  • [7] LDAMSS: Fast and efficient undersampling method for imbalanced learning
    Liang, Ting
    Xu, Jie
    Zou, Bin
    Wang, Zhan
    Zeng, Jingjing
    APPLIED INTELLIGENCE, 2022, 52 (06) : 6794 - 6811
  • [8] LDAMSS: Fast and efficient undersampling method for imbalanced learning
    Ting Liang
    Jie Xu
    Bin Zou
    Zhan Wang
    Jingjing Zeng
    Applied Intelligence, 2022, 52 : 6794 - 6811
  • [9] Iterative cleaning and learning of big highly-imbalanced fraud data using unsupervised learning
    Robert K. L. Kennedy
    Zahra Salekshahrezaee
    Flavio Villanustre
    Taghi M. Khoshgoftaar
    Journal of Big Data, 10
  • [10] MICOP: Maximal information coefficient-based oscillation prediction to detect biological rhythms in proteomics data
    Iuchi, Hitoshi
    Sugimoto, Masahiro
    Tomita, Masaru
    BMC BIOINFORMATICS, 2018, 19