Predicting osteoarthritis in adults using statistical data mining and machine learning

被引:5
|
作者
Bertoncelli, Carlo M. [1 ,2 ,3 ]
Altamura, Paola [4 ]
Bagui, Sikha [1 ]
Bagui, Subhash [1 ]
Vieira, Edgar Ramos [5 ]
Costantini, Stefania [3 ]
Monticone, Marco [6 ,7 ,8 ]
Solla, Federico [2 ]
Bertoncelli, Domenico [1 ,3 ]
机构
[1] Univ West Florida, Hal Marcus Coll Sci & Engn, Dept Comp Sci, Pensacola, FL 32514 USA
[2] Lenval Univ, Pediat Hosp Nice, Dept Pediat Orthopaed Surg, Nice, France
[3] Univ Aquila, Dept Informat Engn Comp Sci & Math, Laquila, Italy
[4] Univ G dAnnunzio, Dept Med Chem & Pharmaceut Technol, Chieti, Italy
[5] Florida Int Univ, Dept Phys Therapy, Miami, FL 33199 USA
[6] Univ Cagliari, Dept Med Sci & Publ Hlth, Cagliari, Italy
[7] Univ Cagliari, Dept Phys Med & Rehabil, Cagliari, Italy
[8] Univ Cagliari, G Brotzu Hosp, Dept Neurosci & Rehabil, Neurorehabil Unit, Cagliari, Italy
关键词
arthritis; machine learning; osteoarthritis; statistical data mining; KNEE OSTEOARTHRITIS; EPIDEMIOLOGY; MODEL; VALIDATION; SCOLIOSIS; ARTHRITIS; BURDEN; STATE; HIP;
D O I
10.1177/1759720X221104935
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Osteoarthritis (OA) has traditionally been considered a disease of older adults (>= 65 years old), but it may appear in younger adults. However, the risk factors for OA in younger adults need to be further evaluated. Objectives: To develop a prediction model for identifying risk factors of OA in subjects aged 20-50 years and compare the performance of different machine learning models. Methods: We included data from 52,512 participants of the National Health and Nutrition Examination Survey; of those, we analyzed only subjects aged 20-50 years (n = 19,133), with or without OA. The supervised machine learning model 'Deep PredictMed' based on logistic regression, deep neural network (DNN), and support vector machine was used for identifying demographic and personal characteristics that are associated with OA. Finally, we compared the performance of the different models. Results: Being a female (p < 0.001), older age (p < 0.001), a smoker (p < 0.001), higher body mass index (p < 0.001), high blood pressure (p < 0.001), race/ethnicity (lowest risk among Mexican Americans, p = 0.01), and physical and mental limitations (p < 0.001) were associated with having OA. Best predictive performance yielded a 75% area under the receiver operating characteristic curve. Conclusion: Sex (female), age (older), smoking (yes), body mass index (higher), blood pressure (high), race/ethnicity, and physical and mental limitations are risk factors for having OA in adults aged 20-50 years. The best predictive performance was achieved using DNN algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Predicting Student Performance Using Machine Learning in fNIRS Data
    Oku, Amanda Yumi Ambriola
    Sato, Joao Ricardo
    FRONTIERS IN HUMAN NEUROSCIENCE, 2021, 15
  • [22] Predicting liver cancer on epigenomics data using machine learning
    Vekariya, Vishalkumar
    Passi, Kalpdrum
    Jain, Chakresh Kumar
    FRONTIERS IN BIOINFORMATICS, 2022, 2
  • [23] Predicting drug shortages using pharmacy data and machine learning
    Pall, Raman
    Gauthier, Yvan
    Auer, Sofia
    Mowaswes, Walid
    HEALTH CARE MANAGEMENT SCIENCE, 2023, 26 (03) : 395 - 411
  • [24] Prediction of an educational institute learning environment using machine learning and data mining
    Shoaib, Muhammad
    Sayed, Nasir
    Amara, Nedra
    Latif, Abdul
    Azam, Sikandar
    Muhammad, Sajjad
    EDUCATION AND INFORMATION TECHNOLOGIES, 2022, 27 (07) : 9099 - 9123
  • [25] Prediction of an educational institute learning environment using machine learning and data mining
    Muhammad Shoaib
    Nasir Sayed
    Nedra Amara
    Abdul Latif
    Sikandar Azam
    Sajjad Muhammad
    Education and Information Technologies, 2022, 27 : 9099 - 9123
  • [26] Predicting the occurrence of surgical site infections using text mining and machine learning
    da Silva, Daniel A.
    ten Caten, Carla S.
    dos Santos, Rodrigo P.
    Fogliatto, Flavio S.
    Hsuan, Juliana
    PLOS ONE, 2019, 14 (12):
  • [27] THE PERVASIVENESS OF DATA MINING AND MACHINE LEARNING
    Ramakrishnan, Naren
    COMPUTER, 2009, 42 (08) : 28 - 29
  • [28] Statistical and Machine Learning Models for Remote Sensing Data Mining-Recent Advancements
    Das, Monidipa
    Ghosh, Soumya K.
    Chowdary, Vemuri M.
    Mitra, Pabitra
    Rijal, Santosh
    REMOTE SENSING, 2022, 14 (08)
  • [29] Forecasting, Data Mining and Machine Learning
    OPERATIONS RESEARCH PROCEEDINGS 2010, 2011, : 1 - 1
  • [30] Machine learning and data mining in manufacturing
    Dogan, Alican
    Birant, Derya
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 166