The Reporting Quality of Machine Learning Studies on Pediatric Diabetes Mellitus: Systematic Review

被引:2
|
作者
Zrubka, Zsombor [1 ,8 ]
Kertesz, Gabor [2 ]
Gulacsi, Laszlo [1 ]
Czere, Janos [3 ]
Holgyesi, Aron [1 ,4 ]
Nezhad, Hossein Motahari [1 ,5 ]
Mosavi, Amir [2 ]
Kovacs, Levente [6 ]
Butte, Atul J. [7 ]
Pentek, Marta [1 ]
机构
[1] Obuda Univ, Univ Res & Innovat Ctr, HECON Hlth Econ Res Ctr, Budapest, Hungary
[2] Obuda Univ, John Neumann Fac Informat, Budapest, Hungary
[3] Obuda Univ, Doctoral Sch Innovat Management, Budapest, Hungary
[4] Semmelweis Univ, Doctoral Sch Mol Med, Budapest, Hungary
[5] Corvinus Univ Budapest, Doctoral Sch Business & Management, Budapest, Hungary
[6] Univ Res & Innovat Ctr, Obuda Univ, Physiol Controls Res Ctr, Budapest, Hungary
[7] Univ Calif San Francisco, Bakar Computat Hlth Sci Inst, San Francisco, CA USA
[8] Obuda Univ, Univ Res & Innovat Ctr, HECON Hlth Econ Res Ctr, Becsi ut 96-b, H-1034 Budapest, Hungary
关键词
diabetes mellitus; children; adolescent; pediatric; machine learning; Minimum Information About Clinical Artificial Intelligence Modelling; MI-CLAIM; reporting quality; ARTIFICIAL-INTELLIGENCE; TYPE-1; GUIDELINES; SEARCH; YOUNG; COMPLICATIONS; EPIDEMIOLOGY; ASSOCIATION; TECHNOLOGY; CARE;
D O I
10.2196/47430
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Diabetes mellitus (DM) is a major health concern among children with the widespread adoption of advanced technologies. However, concerns are growing about the transparency, replicability, biasedness, and overall validity of artificial intelligence studies in medicine. Objective: We aimed to systematically review the reporting quality of machine learning (ML) studies of pediatric DM using the Minimum Information About Clinical Artificial Intelligence Modelling (MI-CLAIM) checklist, a general reporting guideline for medical artificial intelligence studies. Methods: We searched the PubMed and Web of Science databases from 2016 to 2020. Studies were included if the use of ML was reported in children with DM aged 2 to 18 years, including studies on complications, screening studies, and in silico samples. In studies following the ML workflow of training, validation, and testing of results, reporting quality was assessed via MI-CLAIM by consensus judgments of independent reviewer pairs. Positive answers to the 17 binary items regarding sufficient reporting were qualitatively summarized and counted as a proxy measure of reporting quality. The synthesis of results included testing the association of reporting quality with publication and data type, participants (human or in silico), research goals, level of code sharing, and the scientific field of publication (medical or engineering), as well as with expert judgments of clinical impact and reproducibility. Results: After screening 1043 records, 28 studies were included. The sample size of the training cohort ranged from 5 to 561. Six studies featured only in silico patients. The reporting quality was low, with great variation among the 21 studies assessed using MI-CLAIM. The number of items with sufficient reporting ranged from 4 to 12 (mean 7.43, SD 2.62). The items on research questions and data characterization were reported adequately most often, whereas items on patient characteristics and model examination were reported adequately least often. The representativeness of the training and test cohorts to real-world settings and the adequacy of model performance evaluation were the most difficult to judge. Reporting quality improved over time (r=0.50; P=.02); it was higher than average in prognostic biomarker and risk factor studies (P=.04) and lower in noninvasive hypoglycemia detection studies (P=.006), higher in studies published in medical versus engineering journals (P=.004), and higher in studies sharing any code of the ML pipeline versus not sharing (P=.003). The association between expert judgments and MI-CLAIM ratings was not significant. Conclusions: The reporting quality of ML studies in the pediatric population with DM was generally low. Important details for clinicians, such as patient characteristics; comparison with the state-of-the-art solution; and model examination for valid, unbiased, and robust results, were often the weak points of reporting. To assess their clinical utility, the reporting standards of ML studies must evolve, and algorithms for this challenging population must become more transparent and replicable.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Reporting quality of studies using machine learning models for medical diagnosis: a systematic review
    Yusuf, Mohamed
    Atal, Ignacio
    Li, Jacques
    Smith, Philip
    Ravaud, Philippe
    Fergie, Martin
    Callaghan, Michael
    Selfe, James
    [J]. BMJ OPEN, 2020, 10 (03): : e034568
  • [2] Protocol for a systematic review on the methodological and reporting quality of prediction model studies using machine learning techniques
    Navarro, Constanza L. Andaur
    Damen, Johanna A. A. G.
    Takada, Toshihiko
    Nijman, Steven W. J.
    Dhiman, Paula
    Ma, Jie
    Collins, Gary S.
    Bajpai, Ram
    Riley, Richard D.
    Moons, Karel G. M.
    Hooft, Lotty
    [J]. BMJ OPEN, 2020, 10 (11):
  • [3] A systematic review on machine learning in sellar region diseases: quality and reporting items
    Qiao, Nidan
    [J]. ENDOCRINE CONNECTIONS, 2019, 8 (07): : 952 - 960
  • [4] Reporting quality in systematic reviews of in vitro studies: a systematic review
    Elshafay, Abdelrahman
    Omran, Esraa Salah
    Abdelkhalek, Mariam
    El-Badry, Mohamed Omar
    Eisa, Heba Gamal
    Fala, Salma Y.
    Dang, Thao
    Ghanem, Mohammad A. T.
    Elbadawy, Maha
    Elhady, Mohamed Tamer
    Nguyen Lam Vuong
    Hirayama, Kenji
    Nguyen Tien Huy
    [J]. CURRENT MEDICAL RESEARCH AND OPINION, 2019, 35 (09) : 1631 - 1641
  • [5] Reporting and methodological quality of systematic reviews and meta-analysis with protocols in Diabetes Mellitus Type II: A systematic review
    Rainkie, Daniel Christopher
    Abedini, Zeinab Salman
    Abdelkader, Nada Nabil
    [J]. PLOS ONE, 2020, 15 (12):
  • [6] Applications of Machine Learning in Pediatric Hydrocephalus: A Systematic Review
    Pahwa, Bhavya
    Bali, Ojasvini
    Goyal, Sarvesh
    Kedia, Shweta
    [J]. NEUROLOGY INDIA, 2021, 69 : S568 - +
  • [7] Assessment of the Reporting Quality of Randomized Controlled Trials on the Treatment of Diabetes Mellitus with Traditional Chinese Medicine: A Systematic Review
    Wang, Ping
    Xu, Qin
    Sun, Qi
    Fan, Fang-fang
    Guo, Xue-rui
    Guo, Fei
    [J]. PLOS ONE, 2013, 8 (07):
  • [8] Quality of reporting for qualitative studies in pediatric urologydA scoping review
    Mccloskey, Kyle
    Neuzil, Kevin
    Basak, Ram
    Chan, Katherine H.
    [J]. JOURNAL OF PEDIATRIC UROLOGY, 2023, 19 (05) : 643 - 651
  • [9] Machine Learning Models to Forecast Outcomes of Pituitary Surgery: A Systematic Review in Quality of Reporting and Current Evidence
    Rech, Matheus M.
    de Macedo Filho, Leonardo
    White, Alexandra J.
    Perez-Vega, Carlos
    Samson, Susan L.
    Chaichana, Kaisorn L.
    Olomu, Osarenoma U.
    Quinones-Hinojosa, Alfredo
    Almeida, Joao Paulo
    [J]. BRAIN SCIENCES, 2023, 13 (03)
  • [10] Machine learning and artificial intelligence based Diabetes Mellitus detection and self-management: A systematic review
    Chaki, Jyotismita
    Ganesh, S. Thillai
    Cidham, S. K.
    Theertan, S. Ananda
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 3204 - 3225