Data Mining Techniques for Early Diagnosis of Diabetes: A Comparative Study

被引:15
|
作者
Chaves, Luis [1 ]
Marques, Goncalo [1 ]
机构
[1] Polytech Coimbra, ESTGOH, Rua Gen Santos Costa, P-3400124 Oliveira Do Hosp, Portugal
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 05期
关键词
diabetes; COVID-19; SARS-CoV-2; data mining; machine learning;
D O I
10.3390/app11052218
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Diabetes is a life-long condition that is well-known in the 21st century. Once known as a disease of the West, the rise of diabetes has been fed by a nutrition shift, rapid urbanization and increasingly sedentary lifestyles. In late 2019, a new public health concern was emerging (COVID-19), with a particular hazard concerning people living with diabetes. Medical institutes have been collecting data for years. We expect to achieve predictions for pathological complications, which hopefully will prevent the loss of lives and improve the quality of life using data mining processes. This work proposes a comparative study of data mining techniques for early diagnosis of diabetes. We use a publicly accessible data set containing 520 instances, each with 17 attributes. Naive Bayes, Neural Network, AdaBoost, k-Nearest Neighbors, Random Forest and Support Vector Machine methods have been tested. The results suggest that Neural Networks should be used for diabetes prediction. The proposed model presents an AUC of 98.3% and 98.1% accuracy, an F1-Score, Precision and Sensitivity of 98.4% and a Specificity of 97.5%.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [1] Early diagnosis of diabetes mellitus using data mining and machine learning techniques
    Deepa, K.
    Kumar, C. Ranjeeth
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (03) : 3999 - 4011
  • [2] Early prediction of diabetes by applying data mining techniques: A retrospective cohort study
    Al Yousef, Mohammed Zeyad
    Yasky, Adel Fouad
    Al Shammari, Riyad
    Ferwana, Mazen S.
    [J]. MEDICINE, 2022, 101 (29) : E29588
  • [3] Comparative Study of Streaming Data Mining Techniques
    Khan, Shabia Shabir
    Peer, M. A.
    Quadri, S. M. K.
    [J]. 2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2014, : 209 - 214
  • [4] Diagnosis of Diabetes by Applying Data Mining Classification Techniques Comparison of Three Data Mining Algorithms
    Daghistani, Tahani
    Alshammari, Riyad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (07) : 329 - 332
  • [5] Applications of Clustering Techniques in Data Mining: A Comparative Study
    Faizan, Muhammad
    Zuhairi, Megat F.
    Ismail, Shahrinaz
    Sultan, Sara
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (12) : 146 - 153
  • [6] Criteria for a comparative study of visualization techniques in data mining
    Redpath, R
    Srinivasan, B
    [J]. INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2003, : 609 - 620
  • [7] A comparative study for outlier detection techniques in data mining
    Abu Bakar, Zuriana
    Mohemad, Rosmayati
    Ahmad, Akbar
    Deris, Mustafa Mat
    [J]. 2006 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 360 - +
  • [8] Mining educational data to predict students performanceA comparative study of data mining techniques
    Khaledun Nahar
    Boishakhe Islam Shova
    Tahmina Ria
    Humayara Binte Rashid
    A. H. M. Saiful Islam
    [J]. Education and Information Technologies, 2021, 26 : 6051 - 6067
  • [9] Mining educational data to predict students performance A comparative study of data mining techniques
    Nahar, Khaledun
    Shova, Boishakhe Islam
    Ria, Tahmina
    Rashid, Humayara Binte
    Islam, A. H. M. Saiful
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2021, 26 (05) : 6051 - 6067
  • [10] Text Based Diagnosis of COVID-19 Using Data Mining Techniques: A Comparative Study
    Gupta, Aadarsh
    Valecha, Aastha
    Mishra, Sapna
    Gandhi, Tapan
    [J]. 2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,