Maximal Information Coefficient-Based Undersampling Method for Highly-Imbalanced Learning

被引:0
|
作者
Qin, Haiou [1 ,2 ]
机构
[1] Nanchang Inst Technol, Sch Informat Engn, Nanchang 330099, Peoples R China
[2] Jiangxi Prov Key Lab Smart Water Conservancy, Nanchang 330099, Peoples R China
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Microwave integrated circuits; Generative adversarial networks; Noise measurement; Machine learning algorithms; Classification algorithms; Training; Software packages; Shape; Sensitivity; Sampling methods; Imbalanced classification; imbalanced learning; maximal information coefficient; maximal information coefficient-based undersampling; undersampling; MACHINE;
D O I
10.1109/ACCESS.2025.3525475
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning from highly-imbalanced datasets is still a big challenge in the field of machine learning because models created by general learning algorithms are weak in recognizing the samples from the minority class correctly. Undersampling is an alternative kind of methods to deal with imbalanced learning. In this paper, we propose a new undersampling method based on maximal information coefficient (including two algorithms MICU-1 and MICU-2) to rebalance the datasets. In order to evaluate the effectiveness of the method, 20 highly- imbalanced datasets are used for the benchmarks. Results show that compared with other undersampling methods, maximal information coefficient-based undersampling method are competitive in terms of G-mean and F-measure.
引用
收藏
页码:4126 / 4135
页数:10
相关论文
共 50 条
  • [31] Altitude control of aircraft using coefficient-based policy method
    Jiang, Ju
    Gong, Huajun
    Liu, Jianye
    Xu, Haiyan
    Chen, Ye
    2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 348 - +
  • [32] A novel progressively undersampling method based on the density peaks sequence for imbalanced data
    Xie, Xiaoying
    Liu, Huawen
    Zeng, Shouzhen
    Lin, Lingbin
    Li, Wen
    KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [33] A Novel Bayesian Network Structure Learning Algorithm based on Maximal Information Coefficient
    Zhang, Yinghua
    Hu, Qiping
    Zhang, Wensheng
    Liu, Jin
    2012 IEEE FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2012, : 862 - 867
  • [34] A New Attribute Selection Method Based on Maximal Information Coefficient and Automatic Clustering
    Ji, Haijin
    Huang, Song
    Wu, Yaning
    Hui, Zhanwei
    Lv, Xuewei
    2017 FOURTH INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND THEIR APPLICATIONS (DSA 2017), 2017, : 22 - 28
  • [35] A novel method for identifying SNP disease association based on maximal information coefficient
    Liu, H. M.
    Rao, N.
    Yang, D.
    Yang, L.
    Li, Y.
    Ou, F.
    GENETICS AND MOLECULAR RESEARCH, 2014, 13 (04) : 10863 - 10877
  • [36] Undersampling based on generalized learning vector quantization and natural nearest neighbors for imbalanced data
    Wang, Long-Hui
    Dai, Qi
    Wang, Jia-You
    Du, Tony
    Chen, Lifang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [37] Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method
    Mawalim, Candy Olivia
    Unoki, Masashi
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1627 - 1633
  • [38] An efficient wavelength selection method based on the maximal information coefficient for multivariate spectral calibration
    Huang, Xin
    Luo, Yi-Ping
    Xia, Li
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 194
  • [39] Feature selection for IoT based on maximal information coefficient
    Sun, Guanglu
    Li, Jiabin
    Dai, Jian
    Song, Zhichao
    Lang, Fei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 89 : 606 - 616
  • [40] TCIC_FS: Total correlation information coefficient-based feature selection method for high-dimensional data
    Qiu, Ping
    Niu, Zhendong
    KNOWLEDGE-BASED SYSTEMS, 2021, 231 (231)