An overview of modern machine learning methods for effect measure modification analyses in high-dimensional settings

被引:0
|
作者
Cheung, Michael [1 ]
Dimitrova, Anna [1 ]
Benmarhnia, Tarik [1 ]
机构
[1] Univ Calif San Diego, Scripps Inst Oceanog, San Diego, CA USA
关键词
Effect measure modification; Heterogeneity; Machine learning; Generalized random forest; Bayesian additive regression trees; Bayesian causal forest; Metalearner; CAUSAL INFERENCE; CHILD UNDERNUTRITION; ASSOCIATION; REGRESSION; SELECTION;
D O I
10.1016/j.ssmph.2025.101764
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
A primary concern of public health researchers involves identifying and quantifying heterogeneous exposure effects across population subgroups. Understanding the magnitude and direction of these effects on a given scale provides researchers the ability to recommend policy prescriptions and assess the external validity of findings. Traditional methods for effect measure modification analyses require manual model specification that is often impractical or not feasible to conduct in high-dimensional settings. Recent developments in machine learning aim to solve this issue by utilizing data-driven approaches to estimate heterogeneous exposure effects. However, these methods do not directly identify effect modifiers and estimate corresponding subgroup effects. Consequently, additional analysis techniques are required to use these methods in the context of effect measure modification analyses. While no data-driven method or technique can identify effect modifiers and domain expertise is still required, they may serve an important role in the discovery of vulnerable subgroups when prior knowledge is not available. We summarize and provide the intuition behind these machine learning methods and discuss how they may be employed for effect measure modification analyses to serve as a reference for public health researchers. We discuss their implementation in R with annotated syntax and demonstrate their application by assessing the heterogeneous effects of drought on stunting among children in the Demographic and Health survey data set as a case study.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem
    Zekic-Susac, Marijana
    Pfeifer, Sanja
    Sarlija, Natasa
    BUSINESS SYSTEMS RESEARCH JOURNAL, 2014, 5 (03): : 82 - 96
  • [2] Learning from models: high-dimensional analyses on the performance of machine learning interatomic potentials
    Liu, Yunsheng
    Mo, Yifei
    NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [3] Machine Learning Regularization Methods in High-Dimensional Monetary and Financial VARs
    Sanchez Garcia, Javier
    Cruz Rambaud, Salvador
    MATHEMATICS, 2022, 10 (06)
  • [4] PERFORMANCE OF MACHINE LEARNING METHODS IN CLASSIFICATION MODELS WITH HIGH-DIMENSIONAL DATA
    Zekic-Susac, Marijana
    Pfeifer, Sanja
    Sarlija, Natasa
    SOR'13 PROCEEDINGS: THE 12TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH IN SLOVENIA, 2013, : 219 - 224
  • [5] Handling high-dimensional data with missing values by modern machine learning techniques
    Chen, Sixia
    Xu, Chao
    JOURNAL OF APPLIED STATISTICS, 2023, 50 (03) : 786 - 804
  • [6] Robust Methods for High-Dimensional Linear Learning
    Merad, Ibrahim
    Gaiffas, Stephane
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [7] High-Dimensional Data Visualisation Methods Using Machine Learning and Their Use in Image Analysis
    Tian, Ying
    Ali, Majid Khan Majahar
    Wu, Lili
    Li, Tao
    TRAITEMENT DU SIGNAL, 2024, 41 (03) : 1355 - 1364
  • [8] Morphological classification of brains via high-dimensional shape transformations and machine learning methods
    Lao, ZQ
    Shen, DG
    Xue, Z
    Karacali, B
    Resnick, SM
    Davatzikos, C
    NEUROIMAGE, 2004, 21 (01) : 46 - 57
  • [9] Machine learning for high-dimensional dynamic stochastic economies
    Scheidegger, Simon
    Bilionis, Ilias
    JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 33 : 68 - 82
  • [10] Machine learning and structural health monitoring overview with emerging technology and high-dimensional data source highlights
    Malekloo, Arman
    Ozer, Ekin
    AlHamaydeh, Mohammad
    Girolami, Mark
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2022, 21 (04): : 1906 - 1955