Invited Commentary: Demystifying Statistical Inference When Using Machine Learning in Causal Research

被引:0
|
作者
Balzer, Laura B. [1 ,2 ]
Westling, Ted [3 ]
机构
[1] Univ Massachusetts Amherst, Dept Biostat & Epidemiol, 427 Arnold House, Amherst, MA 01003 USA
[2] Univ Massachusetts Amherst, Dept Biostat & Epidemiol, Amherst, MA USA
[3] Univ Massachusetts Amherst, Dept Math & Stat, Amherst, MA USA
关键词
causal inference; cross-fitting; cross-validation; doubly robust; machine learning; nonparametric; Super Learner; TMLE;
D O I
暂无
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
In this issue, Naimi et al. (Am J Epidemiol. XXXX;XXX(XX):XXXX-XXXX) discuss a critical topic in public health and beyond: obtaining valid statistical inference when using machine learning in causal research. In doing so, the authors review recent prominent methodological work and recommend: 1) doubly robust estimators, such as targeted maximum likelihood estimation (TMLE); 2) ensemble methods, such as Super Learner, to combine predictions from a diverse library of algorithms; and 3) sample splitting to reduce bias and improve inference. We largely agree with these recommendations. In this commentary, we highlight the critical importance of the Super Learner library. Specifically, in both simulation settings considered by the authors, we demonstrate that reductions in bias and improvements in confidence-interval coverage can be achieved using TMLE without sample splitting and with a Super Learner library that excludes tree-based methods but includes regression splines. Whether extremely data-adaptive algorithms and sample splitting are needed depends on the specific problem and should be informed by simulations reflecting the specific application. More research is needed on practical recommendations for selecting among these options in common situations arising in epidemiology.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [21] Recent Developments in Causal Inference and Machine Learning
    Brand, Jennie E.
    Zhou, Xiang
    Xie, Yu
    ANNUAL REVIEW OF SOCIOLOGY, 2023, 49 : 81 - 110
  • [22] Machine Learning and Causal Inference for Policy Evaluation
    Athey, Susan
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 5 - 6
  • [23] Developing a novel causal inference algorithm for personalized biomedical causal graph learning using meta machine learning
    Wu, Hang
    Shi, Wenqi
    Wang, May D.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [24] Statistical methods of causal inference in sports science research
    Li, Jinchao
    Open Cybernetics and Systemics Journal, 2015, 9 : 2070 - 2074
  • [25] A Theory of Statistical Inference for Matching Methods in Causal Research
    Iacus, Stefano M.
    King, Gary
    Porro, Giuseppe
    POLITICAL ANALYSIS, 2019, 27 (01) : 46 - 68
  • [26] Reflection on modern methods: when worlds collide-prediction, machine learning and causal inference
    Blakely, Tony
    Lynch, John
    Simons, Koen
    Bentley, Rebecca
    Rose, Sherri
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2020, 49 (06) : 2058 - 2064
  • [27] Causal ML: Python']Python package for causal inference machine learning
    Zhao, Yang
    Liu, Qing
    SOFTWAREX, 2023, 21
  • [28] Making the most of AI and machine learning in organizations and strategy research: Supervised machine learning, causal inference, and matching models
    Rathje, Jason
    Katila, Riitta
    Reineke, Philipp
    STRATEGIC MANAGEMENT JOURNAL, 2024, : 1926 - 1953
  • [29] Invited Commentary: Cross-Sectional Studies and Causal Inference-It's Complicated
    Barnett, Tracie A.
    Koushik, Anita
    Schuster, Tibor
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (04) : 517 - 519
  • [30] A Roadmap for Using Causal Inference and Machine Learning to Personalize Asthma Medication Selection
    Nkoy, Flory L.
    Stone, Bryan L.
    Zhang, Yue
    Luo, Gang
    JMIR MEDICAL INFORMATICS, 2024, 12