Pima Indians diabetes mellitus classification based on machine learning (ML) algorithms

被引:72
|
作者
Chang, Victor [1 ]
Bailey, Jozeene [2 ]
Xu, Qianwen Ariel [2 ]
Sun, Zhili [3 ]
机构
[1] Aston Univ, Aston Business Sch, Dept Operat & Informat Management, Birmingham, W Midlands, England
[2] Teesside Univ, Sch Comp & Digital Technol, Cybersecur Informat Syst & AI Res Grp, Middlesbrough, Cleveland, England
[3] Univ Surrey, Inst Commun Syst ICS, 5G & 6G Innovat Ctr 5G & 6GIC, Guildford, Surrey, England
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 22期
关键词
Diabetes mellitus; The Internet of Medical Things (IoMT); Machine learning; Interpretable artificial intelligence; DIAGNOSIS; INTERNET;
D O I
10.1007/s00521-022-07049-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an e-diagnosis system based on machine learning (ML) algorithms to be implemented on the Internet of Medical Things (IoMT) environment, particularly for diagnosing diabetes mellitus (type 2 diabetes). However, the ML applications tend to be mistrusted because of their inability to show the internal decision-making process, resulting in slow uptake by end-users within certain healthcare sectors. This research delineates the use of three interpretable supervised ML models: Naive Bayes classifier, random forest classifier, and J48 decision tree models to be trained and tested using the Pima Indians diabetes dataset in R programming language. The performance of each algorithm is analyzed to determine the one with the best accuracy, precision, sensitivity, and specificity. An assessment of the decision process is also made to improve the model. It can be concluded that a Naive Bayes model works well with a more fine-tuned selection of features for binary classification, while random forest works better with more features.
引用
收藏
页码:16157 / 16173
页数:17
相关论文
共 50 条
  • [31] Non-insulin-dependent diabetes mellitus in populations at risk: the Pima Indians
    Charles, MA
    Eschwege, E
    Bennett, PH
    DIABETES & METABOLISM, 1997, 23 : 6 - 9
  • [32] Genetic algorithm based feature selection and MOE fuzzy classification algorithm on Pima Indians Diabetes dataset
    Vaishali, R.
    Sasikala, R.
    Ramasubbareddy, S.
    Remya, S.
    Nalluri, Sravani
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON COMPUTING NETWORKING AND INFORMATICS (ICCNI 2017), 2017,
  • [33] DIAGNOSIS OF DIABETES MELLITUS USING STATISTICAL METHODS AND MACHINE LEARNING ALGORITHMS
    Pekel, Ebru
    Ozcan, Tuncay
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2018, 36 (04): : 1263 - 1280
  • [34] An Extensive Survey on Recent Machine Learning Algorithms for Diabetes Mellitus Prediction
    Selvi, R. Thanga
    Muthulakshmi, I
    INTELLIGENT COMMUNICATION TECHNOLOGIES AND VIRTUAL MOBILE NETWORKS, ICICV 2019, 2020, 33 : 328 - 335
  • [35] Machine learning algorithms for early diagnosis of diabetes mellitus: A comparative study
    Rawat, Vandana
    Joshi, Shivangi
    Gupta, Shikhar
    Singh, Devesh Pratap
    Singh, Neelam
    MATERIALS TODAY-PROCEEDINGS, 2022, 56 : 502 - 506
  • [36] Predicting complications of diabetes mellitus using advanced machine learning algorithms
    Ljubic, Branimir
    Hai, Ameen Abdel
    Stanojevic, Marija
    Diaz, Wilson
    Polimac, Daniel
    Pavlovski, Martin
    Obradovic, Zoran
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (09) : 1343 - 1351
  • [37] Performance Assessment of Different Machine Learning Algorithms in Predicting Diabetes Mellitus
    Nishat, Mirza Muntasir
    Faisal, Fahim
    Mahbub, Md Ashif
    Mahbub, Md Hasib
    Islam, Shuvo
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2021, 14 (01): : 74 - 82
  • [38] Reduced early insulin secretion in the etiology of type 2 diabetes mellitus in Pima Indians
    Bogardus, C
    Tataranni, PA
    DIABETES, 2002, 51 : S262 - S264
  • [39] GLOMERULAR STRUCTURE IN PIMA-INDIANS WITH TYPE-II DIABETES-MELLITUS
    PAGTALUNAN, ME
    MILLER, PL
    NELSON, RG
    MYERS, BD
    COPLON, N
    MEYER, TW
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 1994, 5 (03): : 381 - 381
  • [40] Performance Analysis of Machine Learning Algorithms for Big Data Classification: ML and Al-Based Algorithms for Big Data Analysis
    Punia, Sanjeev Kumar
    Kumar, Manoj
    Stephan, Thompson
    Deverajan, Ganesh Gopal
    Patan, Rizwan
    INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS, 2021, 12 (04) : 60 - 75