Predicting osteoarthritis in adults using statistical data mining and machine learning

被引:5
|
作者
Bertoncelli, Carlo M. [1 ,2 ,3 ]
Altamura, Paola [4 ]
Bagui, Sikha [1 ]
Bagui, Subhash [1 ]
Vieira, Edgar Ramos [5 ]
Costantini, Stefania [3 ]
Monticone, Marco [6 ,7 ,8 ]
Solla, Federico [2 ]
Bertoncelli, Domenico [1 ,3 ]
机构
[1] Univ West Florida, Hal Marcus Coll Sci & Engn, Dept Comp Sci, Pensacola, FL 32514 USA
[2] Lenval Univ, Pediat Hosp Nice, Dept Pediat Orthopaed Surg, Nice, France
[3] Univ Aquila, Dept Informat Engn Comp Sci & Math, Laquila, Italy
[4] Univ G dAnnunzio, Dept Med Chem & Pharmaceut Technol, Chieti, Italy
[5] Florida Int Univ, Dept Phys Therapy, Miami, FL 33199 USA
[6] Univ Cagliari, Dept Med Sci & Publ Hlth, Cagliari, Italy
[7] Univ Cagliari, Dept Phys Med & Rehabil, Cagliari, Italy
[8] Univ Cagliari, G Brotzu Hosp, Dept Neurosci & Rehabil, Neurorehabil Unit, Cagliari, Italy
关键词
arthritis; machine learning; osteoarthritis; statistical data mining; KNEE OSTEOARTHRITIS; EPIDEMIOLOGY; MODEL; VALIDATION; SCOLIOSIS; ARTHRITIS; BURDEN; STATE; HIP;
D O I
10.1177/1759720X221104935
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Osteoarthritis (OA) has traditionally been considered a disease of older adults (>= 65 years old), but it may appear in younger adults. However, the risk factors for OA in younger adults need to be further evaluated. Objectives: To develop a prediction model for identifying risk factors of OA in subjects aged 20-50 years and compare the performance of different machine learning models. Methods: We included data from 52,512 participants of the National Health and Nutrition Examination Survey; of those, we analyzed only subjects aged 20-50 years (n = 19,133), with or without OA. The supervised machine learning model 'Deep PredictMed' based on logistic regression, deep neural network (DNN), and support vector machine was used for identifying demographic and personal characteristics that are associated with OA. Finally, we compared the performance of the different models. Results: Being a female (p < 0.001), older age (p < 0.001), a smoker (p < 0.001), higher body mass index (p < 0.001), high blood pressure (p < 0.001), race/ethnicity (lowest risk among Mexican Americans, p = 0.01), and physical and mental limitations (p < 0.001) were associated with having OA. Best predictive performance yielded a 75% area under the receiver operating characteristic curve. Conclusion: Sex (female), age (older), smoking (yes), body mass index (higher), blood pressure (high), race/ethnicity, and physical and mental limitations are risk factors for having OA in adults aged 20-50 years. The best predictive performance was achieved using DNN algorithms.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Educational data mining for predicting students' academic performance using machine learning algorithms
    Dabhade, Pranav
    Agarwal, Ravina
    Alameen, K. P.
    Fathima, A. T.
    Sridharan, R.
    Gopakumar, G.
    MATERIALS TODAY-PROCEEDINGS, 2021, 47 : 5260 - 5267
  • [2] A study on predicting crime rates through machine learning and data mining using text
    Saeed, Ruaa Mohammed
    Abdulmohsin, Husam Ali
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [3] Predicting game-induced emotions using EEG, data mining and machine learning
    Min Xuan Lim
    Jason Teo
    Bulletin of the National Research Centre, 48 (1)
  • [4] Predicting the Level of Safety Feeling of Bangladeshi Internet users using Data Mining and Machine Learning
    Alam, Md. Safiul
    Roy, Anirban
    Majumder, Partha Protim
    Khushbu, Sharun Akter
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 725 - 739
  • [5] Predicting and mitigating cyber threats through data mining and machine learning
    Samia, Nusrat
    Saha, Sajal
    Haque, Anwar
    COMPUTER COMMUNICATIONS, 2024, 228
  • [6] Mining of soil data for predicting the paddy productivity by machine learning techniques
    Ajitha Antony
    Ramanathan Karuppasamy
    Paddy and Water Environment, 2023, 21 : 231 - 242
  • [7] Mining of soil data for predicting the paddy productivity by machine learning techniques
    Antony, Ajitha
    Karuppasamy, Ramanathan
    PADDY AND WATER ENVIRONMENT, 2023, 21 (02) : 231 - 242
  • [8] Continuous acoustic data mining using machine learning
    de la SELLE, Théotime
    Deschanel, Stéphanie
    Weiss, Jérôme
    e-Journal of Nondestructive Testing, 2024, 29 (10):
  • [9] Mining Process Control Data Using Machine Learning
    Nasr, Emad S. Abouel
    Al-Mubaid, Hisham
    CIE: 2009 INTERNATIONAL CONFERENCE ON COMPUTERS AND INDUSTRIAL ENGINEERING, VOLS 1-3, 2009, : 1434 - +
  • [10] Machine learning and data mining
    Mitchell, TM
    COMMUNICATIONS OF THE ACM, 1999, 42 (11) : 30 - 36