Predicting osteoarthritis in adults using statistical data mining and machine learning

被引:5
|
作者
Bertoncelli, Carlo M. [1 ,2 ,3 ]
Altamura, Paola [4 ]
Bagui, Sikha [1 ]
Bagui, Subhash [1 ]
Vieira, Edgar Ramos [5 ]
Costantini, Stefania [3 ]
Monticone, Marco [6 ,7 ,8 ]
Solla, Federico [2 ]
Bertoncelli, Domenico [1 ,3 ]
机构
[1] Univ West Florida, Hal Marcus Coll Sci & Engn, Dept Comp Sci, Pensacola, FL 32514 USA
[2] Lenval Univ, Pediat Hosp Nice, Dept Pediat Orthopaed Surg, Nice, France
[3] Univ Aquila, Dept Informat Engn Comp Sci & Math, Laquila, Italy
[4] Univ G dAnnunzio, Dept Med Chem & Pharmaceut Technol, Chieti, Italy
[5] Florida Int Univ, Dept Phys Therapy, Miami, FL 33199 USA
[6] Univ Cagliari, Dept Med Sci & Publ Hlth, Cagliari, Italy
[7] Univ Cagliari, Dept Phys Med & Rehabil, Cagliari, Italy
[8] Univ Cagliari, G Brotzu Hosp, Dept Neurosci & Rehabil, Neurorehabil Unit, Cagliari, Italy
关键词
arthritis; machine learning; osteoarthritis; statistical data mining; KNEE OSTEOARTHRITIS; EPIDEMIOLOGY; MODEL; VALIDATION; SCOLIOSIS; ARTHRITIS; BURDEN; STATE; HIP;
D O I
10.1177/1759720X221104935
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Osteoarthritis (OA) has traditionally been considered a disease of older adults (>= 65 years old), but it may appear in younger adults. However, the risk factors for OA in younger adults need to be further evaluated. Objectives: To develop a prediction model for identifying risk factors of OA in subjects aged 20-50 years and compare the performance of different machine learning models. Methods: We included data from 52,512 participants of the National Health and Nutrition Examination Survey; of those, we analyzed only subjects aged 20-50 years (n = 19,133), with or without OA. The supervised machine learning model 'Deep PredictMed' based on logistic regression, deep neural network (DNN), and support vector machine was used for identifying demographic and personal characteristics that are associated with OA. Finally, we compared the performance of the different models. Results: Being a female (p < 0.001), older age (p < 0.001), a smoker (p < 0.001), higher body mass index (p < 0.001), high blood pressure (p < 0.001), race/ethnicity (lowest risk among Mexican Americans, p = 0.01), and physical and mental limitations (p < 0.001) were associated with having OA. Best predictive performance yielded a 75% area under the receiver operating characteristic curve. Conclusion: Sex (female), age (older), smoking (yes), body mass index (higher), blood pressure (high), race/ethnicity, and physical and mental limitations are risk factors for having OA in adults aged 20-50 years. The best predictive performance was achieved using DNN algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Algorithms for Data Mining and Machine Learning
    Schulz, Volker H.
    SIAM REVIEW, 2020, 62 (03) : 739 - 739
  • [32] DATA MINING AND MACHINE LEARNING IN ASTRONOMY
    Ball, Nicholas M.
    Brunner, Robert J.
    INTERNATIONAL JOURNAL OF MODERN PHYSICS D, 2010, 19 (07): : 1049 - 1106
  • [33] Machine learning and robust data mining
    Croux, Christophe
    Leuven, K. U.
    Gallopoulos, Efstratios
    Van Aelst, Stefan
    Zha, Hongyuan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 151 - 154
  • [34] Machine learning for mining imbalanced data
    Arafat, Md. Yasir
    Hoque, Sabera
    Xu, Shuxiang
    Farid, Dewan Md
    IAENG International Journal of Computer Science, 2019, 46 (02) : 332 - 348
  • [35] Fuzzy machine learning and data mining
    Huellermeier, Eyke
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (04) : 269 - 283
  • [36] Machine learning for data mining in medicine
    Lavrac, N
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 1999, 1620 : 47 - 62
  • [37] Data Mining and Machine Learning Models for Predicting Drug Likeness and Their Disease or Organ Category
    Yosipof, Abraham
    Guedes, Rita C.
    Garcia-Sosa, Alfonso T.
    FRONTIERS IN CHEMISTRY, 2018, 6
  • [38] Detecting Adverse Drug Reaction with Data Mining And Predicting its Severity With Machine Learning
    Islam, Tanvir
    Hussain, Nadib
    Islam, Samiul
    Chakrabarty, Amitabha
    2018 IEEE REGION 10 HUMANITARIAN TECHNOLOGY CONFERENCE (R10-HTC), 2018,
  • [39] Predicting Malicious Software in IoT Environment Based on Machine Learning and Data Mining Techniques
    Alharbi, Abdulmohsen
    Hamid, Abdul
    Lahza, Husam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 497 - 506
  • [40] Predicting the Dynamic Behaviour of a Concrete Dam using Statistical and Machine Learning Models
    Pereira, Sérgio
    Mata, Juan
    Magalhães, Filipe
    Gomes, Jorge
    Cunha, Álvaro
    e-Journal of Nondestructive Testing, 2024, 29 (07):