Sarcopenia feature selection and risk prediction using machine learning A cross-sectional study

被引:30
|
作者
Kang, Yang-Jae [1 ,2 ]
Yoo, Jun-Il [3 ]
Ha, Yong-chan [4 ]
机构
[1] Gyeongsang Natl Univ Hosp, PMBBRC, Div Appl Life Sci Dept, Jinju, South Korea
[2] Gyeongsang Natl Univ Hosp, Div Life Sci Dept, Jinju, South Korea
[3] Gyeongsang Natl Univ Hosp, Dept Orthopaed Surg, 90 Chilamdong, Jinju 660702, Gyeongnamdo, South Korea
[4] Chung Ang Univ, Coll Med, Dept Orthopaed Surg, Seoul, South Korea
关键词
feature selection; machine learning; risk prediction; sarcopenia; MUSCLE MASS; HEALTH;
D O I
10.1097/MD.0000000000017699
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The purpose of this study was to verify the usefulness of machine learning (ML) for selection of risk factors and development of predictive models for patients with sarcopenia. We collected medical records from Korean postmenopausal women based on Korea National Health and Nutrition Examination Surveys. A training data set compiled from simple survey data was used to construct models based on popular ML algorithms (e.g., support vector machine, random forest [RF], and logistic regression). A total of 4020 patients >= 65 years of age were enrolled in this study. The study population consisted of 1698 (42.2%) male and 2322 (57.8%) female patients. The 10 most important risk factors in men were bodymass index (BMI), red blood cell (RBC) count, blood urea nitrogen (BUN), vitamin D, ferritin, fiber intake (g/d), primary diastolic blood pressure, white blood cell (WBC) count, fat intake (g/d), age, glutamic-pyruvic transaminase, niacin intake (mg/d), protein intake (g/d), fasting blood sugar, and water intake (g/d). The 10 most important risk factors in women were BMI, water intake (g/d), WBC, RBC count, iron intake (mg/d), BUN, high-density lipoprotein, protein intake (g/d), fiber consumption (g/d), vitamin C intake (mg/d), parathyroid hormone, niacin intake (mg/d), carotene intake (mg/d), potassiumintake (mg/d), calcium intake (mg/d), sodiumintake (mg/d), retinol intake (mg/d), and age. A receiver operating characteristic (ROC) curve analysis found that the area under the ROC curve for each ML model was not significantly different within a gender. The most cost-effective method in clinical practice is to make feature selection using RF models and expert knowledge and to make disease prediction using verification by several ML models. However, the developed prediction model should be validated using additional studies.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Predicting the risk of hypertension using machine learning algorithms: A cross sectional study in Ethiopia
    Islam, Md. Merajul
    Alam, Md. Jahangir
    Maniruzzaman, Md
    Ahmed, N. A. M. Faisal
    Ali, Md Sujan
    Rahman, Md. Jahanur
    Roy, Dulal Chandra
    PLOS ONE, 2023, 18 (08):
  • [32] Machine Learning Prediction of Tongue Pressure in Elderly Patients with Head and Neck Tumor: A Cross-Sectional Study
    Han, Xuewei
    Bai, Ziyi
    Mogushi, Kaoru
    Hase, Takeshi
    Takeuchi, Katsuyuki
    Iida, Yoritsugu
    Sumita, Yuka I.
    Wakabayashi, Noriyuki
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (08)
  • [33] Prevalence of physical inactivity and risk of sarcopenia in primary care. Cross-sectional study
    Martin, Laura Illamola
    Granados, Antonio Granados
    Melenchon, Albert Sanllorente
    Cristobal, Juan Jose Rodriguez
    Hernandez, Mireia Broto
    ATENCION PRIMARIA, 2024, 56 (11):
  • [34] Risk phenotype for sarcopenia in older adults from Amazonas, Brazil; a cross-sectional study
    de Lima, Alex Barreto
    Torres-Costoso, Ana
    Zymbal, Vera
    Gouveia, Elvio Rubio
    Baptista, Fatima
    PLOS ONE, 2023, 18 (10):
  • [35] Use of feature importance statistics to accurately predict asthma attacks using machine learning: A cross-sectional cohort study of the US population
    Huang, Alexander A.
    Huang, Samuel Y.
    PLOS ONE, 2023, 18 (11):
  • [36] Feature selection and classification in breast cancer prediction using IoT and machine learning
    Gopal, V. Nanda
    Al-Turjman, Fadi
    Kumar, R.
    Anand, L.
    Rajesh, M.
    MEASUREMENT, 2021, 178
  • [37] Antiprotozoal peptide prediction using machine learning with effective feature selection techniques
    Periwal, Neha
    Arora, Pooja
    Thakur, Ananya
    Agrawal, Lakshay
    Goyal, Yash
    Rathore, Anand S.
    Anand, Harsimrat Singh
    Kaur, Baljeet
    Sood, Vikas
    HELIYON, 2024, 10 (16)
  • [38] Prediction and feature selection of low birth weight using machine learning algorithms
    Reza, Tasneem Binte
    Salma, Nahid
    JOURNAL OF HEALTH POPULATION AND NUTRITION, 2024, 43 (01)
  • [39] Cross-sectional Observational Study of Typical in-utero Fetal Movements using Machine Learning
    Vasung, Lana
    Xu, Junshen
    Abaci-Turk, Esra
    Zhou, Cindy
    Holland, Elizabeth
    Barth, William H. H.
    Barnewolt, Carol
    Connolly, Susan
    Estroff, Judy
    Golland, Polina
    Feldman, Henry A. A.
    Adalsteinsson, Elfar
    Grant, P. Ellen
    DEVELOPMENTAL NEUROSCIENCE, 2023, 45 (03) : 105 - 114
  • [40] Using advanced machine learning algorithms to predict academic major completion: A cross-sectional study
    Kordbagheri, Alireza
    Kordbagheri, Mohammadreza
    Tayim, Natalie
    Fakhrou, Abdulnaser
    Davoudi, Mohammadreza
    Computers in Biology and Medicine, 2025, 184