An overview of modern machine learning methods for effect measure modification analyses in high-dimensional settings

被引:0
|
作者
Cheung, Michael [1 ]
Dimitrova, Anna [1 ]
Benmarhnia, Tarik [1 ]
机构
[1] Univ Calif San Diego, Scripps Inst Oceanog, San Diego, CA USA
关键词
Effect measure modification; Heterogeneity; Machine learning; Generalized random forest; Bayesian additive regression trees; Bayesian causal forest; Metalearner; CAUSAL INFERENCE; CHILD UNDERNUTRITION; ASSOCIATION; REGRESSION; SELECTION;
D O I
10.1016/j.ssmph.2025.101764
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
A primary concern of public health researchers involves identifying and quantifying heterogeneous exposure effects across population subgroups. Understanding the magnitude and direction of these effects on a given scale provides researchers the ability to recommend policy prescriptions and assess the external validity of findings. Traditional methods for effect measure modification analyses require manual model specification that is often impractical or not feasible to conduct in high-dimensional settings. Recent developments in machine learning aim to solve this issue by utilizing data-driven approaches to estimate heterogeneous exposure effects. However, these methods do not directly identify effect modifiers and estimate corresponding subgroup effects. Consequently, additional analysis techniques are required to use these methods in the context of effect measure modification analyses. While no data-driven method or technique can identify effect modifiers and domain expertise is still required, they may serve an important role in the discovery of vulnerable subgroups when prior knowledge is not available. We summarize and provide the intuition behind these machine learning methods and discuss how they may be employed for effect measure modification analyses to serve as a reference for public health researchers. We discuss their implementation in R with annotated syntax and demonstrate their application by assessing the heterogeneous effects of drought on stunting among children in the Demographic and Health survey data set as a case study.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] An evaluation of resampling methods for assessment of survival risk prediction in high-dimensional settings
    Subramanian, Jyothi
    Simon, Richard
    STATISTICS IN MEDICINE, 2011, 30 (06) : 642 - 653
  • [22] High-Dimensional Multi-trait GWAS By Reverse Prediction of Genotypes Using Machine Learning Methods
    Malik, Muhammad Ammar
    Ludl, Adriaan-Alexander
    Michoel, Tom
    COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS, CIBB 2021, 2022, 13483 : 79 - 93
  • [23] Model-Based Design of Experiments for High-Dimensional Inputs Supported by Machine-Learning Methods
    Seufert, Philipp
    Schwientek, Jan
    Bortz, Michael
    PROCESSES, 2021, 9 (03) : 1 - 25
  • [24] Estimation of predictive performance in high-dimensional data settings using learning curves
    Goedhart, Jeroen M.
    Klausch, Thomas
    van de Wiel, Mark A.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
  • [25] Revisiting Computational Thermodynamics through Machine Learning of High-Dimensional Data
    Srinivasan, Srikant
    Rajan, Krishna
    COMPUTING IN SCIENCE & ENGINEERING, 2013, 15 (05) : 22 - 31
  • [26] Two-stage extreme learning machine for high-dimensional data
    Liu, Peng
    Huang, Yihua
    Meng, Lei
    Gong, Siyuan
    Zhang, Guopeng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2016, 7 (05) : 765 - 772
  • [27] Asynchronous Parallel, Sparse Approximated SVRG for High-Dimensional Machine Learning
    Shang, Fanhua
    Huang, Hua
    Fan, Jun
    Liu, Yuanyuan
    Liu, Hongying
    Liu, Jianhui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (12) : 5636 - 5648
  • [28] A machine learning based approach towards high-dimensional mediation analysis
    Natha, Tanmay
    Caffoa, Brian
    Wagerb, Tor
    Lindquista, Martin A.
    NEUROIMAGE, 2023, 268
  • [29] Extreme learning machine Cox model for high-dimensional survival analysis
    Wang, Hong
    Li, Gang
    STATISTICS IN MEDICINE, 2019, 38 (12) : 2139 - 2156
  • [30] Novel machine learning approach for classification of high-dimensional microarray data
    Rabia Aziz Musheer
    C. K. Verma
    Namita Srivastava
    Soft Computing, 2019, 23 : 13409 - 13421