Use of machine learning to identify risk factors for insomnia

被引:28
|
作者
Huang, Alexander A. A. [1 ]
Huang, Samuel Y. Y. [2 ]
机构
[1] Northwestern Univ, Feinberg Sch Med, Chicago, IL 60611 USA
[2] Virginia Commonwealth Univ, Sch Med, Richmond, VA USA
来源
PLOS ONE | 2023年 / 18卷 / 04期
关键词
SLEEP DISORDERS; EPIDEMIOLOGY; MEDICATION; DEPRESSION; CHILDREN; DURATION; ALCOHOL; QUALITY; ANXIETY; LIFE;
D O I
10.1371/journal.pone.0282622
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
ImportanceSleep is critical to a person's physical and mental health, but there are few studies systematically assessing risk factors for sleep disorders. ObjectiveThe objective of this study was to identify risk factors for a sleep disorder through machine-learning and assess this methodology. Design, setting, and participantsA retrospective, cross-sectional cohort study using the publicly available National Health and Nutrition Examination Survey (NHANES) was conducted in patients who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data. MethodsA physician diagnosis of insomnia was the outcome of this study. Univariate logistic models, with insomnia as the outcome, were used to identify covariates that were associated with insomnia. Covariates that had a p<0.0001 on univariate analysis were included within the final machine-learning model. The machine learning model XGBoost was used due to its prevalence within the literature as well as its increased predictive accuracy in healthcare prediction. Model covariates were ranked according to the cover statistic to identify risk factors for insomnia. Shapely Additive Explanations (SHAP) were utilized to visualize the relationship between these potential risk factors and insomnia. ResultsOf the 7,929 patients that met the inclusion criteria in this study, 4,055 (51% were female, 3,874 (49%) were male. The mean age was 49.2 (SD = 18.4), with 2,885 (36%) White patients, 2,144 (27%) Black patients, 1,639 (21%) Hispanic patients, and 1,261 (16%) patients of another race. The machine learning model had 64 out of a total of 684 features that were found to be significant on univariate analysis (P<0.0001 used). These were fitted into the XGBoost model and an AUROC = 0.87, Sensitivity = 0.77, Specificity = 0.77 were observed. The top four highest ranked features by cover, a measure of the percentage contribution of the covariate to the overall model prediction, were the Patient Health Questionnaire depression survey (PHQ-9) (Cover = 31.1%), age (Cover = 7.54%), physician recommendation of exercise (Cover = 3.86%), weight (Cover = 2.99%), and waist circumference (Cover = 2.70%). ConclusionMachine learning models can effectively predict risk for a sleep disorder using demographic, laboratory, physical exam, and lifestyle covariates and identify key risk factors.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Combining Machine Learning and Traditional Statistical Modeling to Identify Risk Factors of Hospital Mortality and Directionality for Severe ARDS
    Agrawal, A.
    Shaheen, I.
    Narasimhan, M.
    Qiu, M.
    Hirsch, J.
    Zhang, M.
    Hajizadeh, N.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2019, 199
  • [32] Using machine learning to identify risk factors for short-term complications following thumb carpometacarpal arthroplasty
    Shah, Rohan M.
    Khazanchi, Rushmin
    Bajaj, Anitesh
    Rana, Krishi
    Malhotra, Saaz
    Wolf, Jennifer Moriatis
    JOURNAL OF HAND AND MICROSURGERY, 2024, 16 (05)
  • [33] A Machine Learning Approach to Identify Risk Factors for Post-Operative Complications After Tetralogy of Fallot Repair
    Faerber, Jennifer A.
    Huang, Jing
    Zhang, Xuemei
    Song, Lihai
    Mascio, Christopher
    Ravishankar, Chitra
    McGowan, Francis X.
    O'Byrne, Michael
    Goldmuntz, Elizabeth
    Mercer-Rosa, Laura M.
    CIRCULATION, 2019, 140
  • [34] Machine learning analysis to identify the association between risk factors and onset of nosocomial diarrhea: a retrospective cohort study
    Kurisu, Ken
    Yoshiuchi, Kazuhiro
    Ogino, Kei
    Oda, Toshimi
    PEERJ, 2019, 7
  • [35] Machine learning and network-based models to identify genetic risk factors to the progression and survival of colorectal cancer
    Hossain, Md Jakir
    Chowdhury, Utpala Nanda
    Islam, M. Babul
    Uddin, Shahadat
    Ahmed, Mohammad Boshir
    Quinn, Julian M. W.
    Moni, Mohammad Ali
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
  • [36] A MACHINE LEARNING APPROACH TO IDENTIFY FACTORS ASSOCIATED WITH ADOLESCENT SLEEP OUTCOMES
    Ricketts, Emily
    Kaplan, Katherine
    McMakin, Dana
    Patriarca, Guadalupe
    Mathew, Gina
    McGrew, Tylor
    Chang, Anne-Marie
    Hale, Lauren
    SLEEP, 2024, 47
  • [37] Machine Learning to identify factors that affect Human Systolic Blood Pressure
    Pechprasarn, Suejit
    Sukkasem, Chayanisa
    Sasivimolkul, Suvicha
    Suvarnaphaet, Phitsini
    2019 12TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2019), 2019,
  • [38] Use of machine learning to identify protective factors for death from COVID19 in the ICU: a retrospective study
    Dos Santos, Lander
    Silva, Lincoln Luis
    Pelloso, Fernando Castilho
    Maia, Vinicius
    Pujals, Constanza
    Borghesan, Deise Helena
    Carvalho, Maria Dalva
    Pedroso, Raissa Bocchi
    Pelloso, Sandra Marisa
    PEERJ, 2024, 12 : 2 - 22
  • [39] Using Machine Learning To Identify Factors That Govern Amorphization of Irradiated Pyrochlores
    Pilania, Ghanshyam
    Whittle, Karl R.
    Jiang, Chao
    Grimes, Robin W.
    Stanek, Christopher R.
    Sickafus, Kurt E.
    Uberuaga, Blas Pedro
    CHEMISTRY OF MATERIALS, 2017, 29 (06) : 2574 - 2583
  • [40] The use of supervised machine learning techniques to identify factors influencing vitamin D bio-enrichment of pork
    Rosbotham, E. J.
    Rankin, D.
    Gill, C. I. R.
    McDonald, E. J.
    McRoberts, W. C.
    Neill, H. R.
    Boland, R.
    Pourshahidi, L. K.
    PROCEEDINGS OF THE NUTRITION SOCIETY, 2021, 80 (OCE3)