Using machine learning to explore the efficacy of administrative variables in prediction of subjective-wellbeing outcomes in New Zealand

被引:0
|
作者
Anantha Narayanan [1 ]
Tom Stewart [1 ]
Scott Duncan [1 ]
Gail Pacheco [2 ]
机构
[1] Auckland University of Technology,School of Sport and Recreation
[2] Auckland University of Technology,Faculty of Business, Economics and Law
关键词
Subjective wellbeing; Machine learning; Predictive models; Administrative data; Census;
D O I
10.1038/s41598-025-90852-0
中图分类号
学科分类号
摘要
The growing acknowledgment of population wellbeing as a key indicator of societal prosperity has propelled governments worldwide to devise policies aimed at improving their citizens’ overall wellbeing. In New Zealand, the General Social Survey provides wellbeing metrics for a representative subset of the population (~ 10,000 individuals). However, this sample size only provides a surface-level understanding of the country’s wellbeing landscape, limiting our ability to comprehensively assess the impacts of governmental policies, particularly on smaller subgroups who may be of high policy interest. To overcome this challenge, comprehensive population-level wellbeing data is imperative. Leveraging New Zealand’s Integrated Data Infrastructure, this study developed and validated the efficacy of three predictive models—Stepwise Linear Regression, Elastic Net Regression, and Random Forest—for predicting subjective wellbeing outcomes (life satisfaction, life worthwhileness, family wellbeing, and mental wellbeing) using census-level administrative variables as predictors. Our results demonstrated the Random Forest model’s effectiveness in predicting subjective wellbeing, reflected in low RMSE values (~ 1.5). Nonetheless, the models exhibited low R2 values, suggesting limited explanatory capacity for the nuanced variability in outcome variables. While achieving reasonable predictive accuracy, our findings underscore the necessity for further model refinements to enhance the prediction of subjective wellbeing outcomes.
引用
收藏
相关论文
共 50 条
  • [41] Using Machine Learning Approaches to Explore Non-Cognitive Variables Influencing Reading Proficiency in English among Filipino Learners
    Bernardo, Allan B., I
    Cordel, Macario O., II
    Lucas, Rochelle Irene G.
    Teves, Jude Michael M.
    Yap, Sashmir A.
    Chua, Unisse C.
    EDUCATION SCIENCES, 2021, 11 (10):
  • [42] Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
    Gupta, Sunil
    Truyen Tran
    Luo, Wei
    Dinh Phung
    Kennedy, Richard Lee
    Broad, Adam
    Campbell, David
    Kipp, David
    Singh, Madhu
    Khasraw, Mustafa
    Matheson, Leigh
    Ashley, David M.
    Venkatesh, Svetha
    BMJ OPEN, 2014, 4 (03):
  • [43] Machine learning prediction of incidence of Alzheimer's disease using large-scale administrative health data
    Park, Ji Hwan
    Cho, Han Eol
    Kim, Jong Hun
    Wall, Melanie M.
    Stern, Yaakov
    Lim, Hyunsun
    Yoo, Shinjae
    Kim, Hyoung Seop
    Cha, Jiook
    NPJ DIGITAL MEDICINE, 2020, 3 (01)
  • [44] Machine learning prediction of incidence of Alzheimer’s disease using large-scale administrative health data
    Ji Hwan Park
    Han Eol Cho
    Jong Hun Kim
    Melanie M. Wall
    Yaakov Stern
    Hyunsun Lim
    Shinjae Yoo
    Hyoung Seop Kim
    Jiook Cha
    npj Digital Medicine, 3
  • [45] Using Machine Learning Techniques to Explore the Possibilities of Reducing the Spread of Corona Virus and its New Variants
    Meshref, Hossam
    5TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATICS (ICCI 2022), 2022, : 416 - 423
  • [46] Predicting acupuncture efficacy for major depressive disorder using baseline clinical variables: A machine learning study
    Fu, Jiani
    Cai, Xiaowen
    Huang, Shengtao
    Qiu, Xiaoke
    Li, Zheng
    Hong, Houyuan
    Qu, Shanshan
    Huang, Yong
    JOURNAL OF PSYCHIATRIC RESEARCH, 2023, 168 : 64 - 70
  • [47] Prediction of heat waves using meteorological variables in diverse regions of Iran with advanced machine learning models
    Seyed Babak Haji Seyed Asadollah
    Najeebullah Khan
    Ahmad Sharafati
    Shamsuddin Shahid
    Eun-Sung Chung
    Xiao-Jun Wang
    Stochastic Environmental Research and Risk Assessment, 2022, 36 : 1959 - 1974
  • [48] Machine learning-based prediction model for battery levels in IoT devices using meteorological variables
    Macias, Juan Emilio Zurita
    Trilles, Sergio
    INTERNET OF THINGS, 2024, 25
  • [49] Prediction of heat waves using meteorological variables in diverse regions of Iran with advanced machine learning models
    Asadollah, Seyed Babak Haji Seyed
    Khan, Najeebullah
    Sharafati, Ahmad
    Shahid, Shamsuddin
    Chung, Eun-Sung
    Wang, Xiao-Jun
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (07) : 1959 - 1974
  • [50] Computational Learning Model for Prediction of Heart Disease Using Machine Learning Based on a New Regularizer
    Albahr, Abdulaziz
    Albahar, Marwan
    Thanoon, Mohammed
    Binsawad, Muhammad
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021