Optimization of diabetes prediction methods based on combinatorial balancing algorithm

被引:0
|
作者
Shao, HuiZhi [1 ,2 ]
Liu, Xiang [2 ]
Zong, DaShuai [2 ]
Song, QingJun [2 ]
机构
[1] Jinan Engn Polytech, Jinan, Shandong, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An, Shandong, Peoples R China
来源
NUTRITION & DIABETES | 2024年 / 14卷 / 01期
基金
中国国家自然科学基金;
关键词
D O I
10.1038/s41387-024-00324-z
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BackgroundDiabetes, as a significant disease affecting public health, requires early detection for effective management and intervention. However, imbalanced datasets pose a challenge to accurate diabetes prediction. This imbalance often results in models performing poorly in predicting minority classes, affecting overall diagnostic performance.ObjectivesTo address this issue, this study employs a combination of Synthetic Minority Over-sampling Technique (SMOTE) and Random Under-Sampling (RUS) for data balancing and uses Optuna for hyperparameter optimization of machine learning models. This approach aims to fill the gap in current research concerning data balancing and model optimization, thereby improving prediction accuracy and computational efficiency.MethodsFirst, the study uses SMOTE and RUS methods to process the imbalanced diabetes dataset, balancing the data distribution. Then, Optuna is utilized to optimize the hyperparameters of the LightGBM model to enhance its performance. During the experiment, the effectiveness of the proposed methods is evaluated by comparing the training results of the dataset before and after balancing.ResultsThe experimental results show that the enhanced LightGBM-Optuna model improves the accuracy from 97.07% to 97.11%, and the precision from 97.17% to 98.99%. The time required for a single search is only 2.5 seconds. These results demonstrate the superiority of the proposed method in handling imbalanced datasets and optimizing model performance.ConclusionsThe study indicates that combining SMOTE and RUS data balancing algorithms with Optuna for hyperparameter optimization can effectively enhance machine learning models, especially in dealing with imbalanced datasets for diabetes prediction.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Combinatorial optimization methods for disassembly line balancing
    McGovern, SM
    Gupta, SM
    [J]. ENVIRONMENTALLY CONSCIOUS MANUFACTURING IV, 2004, 5583 : 53 - 66
  • [2] A Combinatorial Optimization Algorithm for Load Balancing in Cloud Infrastructure
    Govindarajan, Kannan
    Somasundaram, Thamarai Selvi
    [J]. 2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2017, : 58 - 63
  • [3] Prediction Model of Students’ Learning Motivation Based on Combinatorial Optimization Algorithm
    Deng, Weifeng
    Wang, Lin
    [J]. International Journal of Emerging Technologies in Learning, 2023, 18 (09) : 148 - 164
  • [4] RNA secondary structure prediction algorithm based on combinatorial optimization algorithm and SVMs method
    He Jing-yuan
    Mu Chao
    Huang Hai-hun
    [J]. PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 715 - 719
  • [5] ON THE ANALYSIS, CLASSIFICATION AND PREDICTION OF METAHEURISTIC ALGORITHM BEHAVIOR FOR COMBINATORIAL OPTIMIZATION PROBLEMS
    Scheibenpflug, Andreas
    Wagner, Stefan
    Pitzer, Erik
    Burlacu, Bogdan
    Affenzeller, Michael
    [J]. 24TH EUROPEAN MODELING AND SIMULATION SYMPOSIUM (EMSS 2012), 2012, : 368 - 372
  • [6] A New Algorithm Based on Differential Evolution for Combinatorial Optimization
    Maravilha, Andre L.
    Ramirez, Jaime A.
    Campelo, Felipe
    [J]. 2013 1ST BRICS COUNTRIES CONGRESS ON COMPUTATIONAL INTELLIGENCE AND 11TH BRAZILIAN CONGRESS ON COMPUTATIONAL INTELLIGENCE (BRICS-CCI & CBIC), 2013, : 60 - 66
  • [7] Life prediction for proton exchange membrane fuel cell based on experimental results and combinatorial optimization algorithm
    Huang, Weifeng
    Liu, Minghong
    Zhang, Caizhi
    Niu, Tong
    Fu, Zuhang
    Ren, Xiaoxia
    Chin, Cheng Siong
    [J]. INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 79 : 364 - 376
  • [8] Feeder Load Balancing Using Combinatorial Optimization-based Heuristic Method
    Ukil, A.
    Siti, M.
    Jordaan, J.
    [J]. 2008 13TH INTERNATIONAL CONFERENCE ON HARMONICS AND QUALITY OF POWER, VOLS 1 AND 2, 2008, : 532 - +
  • [9] Iterative Methods in Combinatorial Optimization
    Ravi, R.
    [J]. 29TH INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE, (STACS 2012), 2012, 14 : 24 - 24
  • [10] Research on an Intelligent Optimization Algorithm for Combinatorial Optimization Problem Based on Big Data
    Zhang, Xuecong
    [J]. 2020 INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2020), 2020, : 386 - 389