Invited Commentary: Demystifying Statistical Inference When Using Machine Learning in Causal Research

被引:0
|
作者
Balzer, Laura B. [1 ,2 ]
Westling, Ted [3 ]
机构
[1] Univ Massachusetts Amherst, Dept Biostat & Epidemiol, 427 Arnold House, Amherst, MA 01003 USA
[2] Univ Massachusetts Amherst, Dept Biostat & Epidemiol, Amherst, MA USA
[3] Univ Massachusetts Amherst, Dept Math & Stat, Amherst, MA USA
关键词
causal inference; cross-fitting; cross-validation; doubly robust; machine learning; nonparametric; Super Learner; TMLE;
D O I
暂无
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
In this issue, Naimi et al. (Am J Epidemiol. XXXX;XXX(XX):XXXX-XXXX) discuss a critical topic in public health and beyond: obtaining valid statistical inference when using machine learning in causal research. In doing so, the authors review recent prominent methodological work and recommend: 1) doubly robust estimators, such as targeted maximum likelihood estimation (TMLE); 2) ensemble methods, such as Super Learner, to combine predictions from a diverse library of algorithms; and 3) sample splitting to reduce bias and improve inference. We largely agree with these recommendations. In this commentary, we highlight the critical importance of the Super Learner library. Specifically, in both simulation settings considered by the authors, we demonstrate that reductions in bias and improvements in confidence-interval coverage can be achieved using TMLE without sample splitting and with a Super Learner library that excludes tree-based methods but includes regression splines. Whether extremely data-adaptive algorithms and sample splitting are needed depends on the specific problem and should be informed by simulations reflecting the specific application. More research is needed on practical recommendations for selecting among these options in common situations arising in epidemiology.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [1] Invited Commentary: Demystifying Statistical Inference When Using Machine Learning in Causal Research
    Balzer, Laura B.
    Westling, Ted
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (09) : 1545 - 1549
  • [2] INVITED COMMENTARY: DEMYSTIFYING STATISTICAL INFERENCE WHEN USING MACHINE LEARNING IN CAUSAL RESEARCH (vol 192, pg 1545, 2023)
    Balzer, L. B.
    Westling, T.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (09) : 1607 - 1608
  • [3] When Causal Inference Meets Graph Machine Learning
    Ma, Jing
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22676 - 22676
  • [4] Invited Commentary: Machine Learning in Causal Inference-How Do I Love Thee? Let Me Count the Ways
    Balzer, Laura B.
    Petersen, Maya L.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2021, 190 (08) : 1483 - 1487
  • [5] JUDGMENT AND CAUSAL INFERENCE - CRITERIA IN EPIDEMIOLOGIC STUDIES - INVITED COMMENTARY
    WINKELSTEIN, W
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 1995, 141 (08) : 699 - 700
  • [6] CURRENT APPLICATIONS OF MACHINE LEARNING FOR CAUSAL INFERENCE IN HEALTHCARE RESEARCH USING OBSERVATIONAL DATA
    Onasanya, O.
    Hoffman, S.
    Harris, K.
    Dixon, R.
    Grabner, M.
    VALUE IN HEALTH, 2024, 27 (06) : S266 - S266
  • [7] Invited Commentary: Making Causal Inference More Social and (Social) Epidemiology More Causal
    Jackson, John W.
    Arah, Onyebuchi A.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2020, 189 (03) : 179 - 182
  • [8] Machine learning in causal inference for epidemiology
    Chiara Moccia
    Giovenale Moirano
    Maja Popovic
    Costanza Pizzi
    Piero Fariselli
    Lorenzo Richiardi
    Claus Thorn Ekstrøm
    Milena Maule
    European Journal of Epidemiology, 2024, 39 (10) : 1097 - 1108
  • [9] On the relationship of machine learning with causal inference
    Lin, Sheng-Hsuan
    Ikram, Mohammad Arfan
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2020, 35 (02) : 183 - 185
  • [10] Machine learning for causal inference in Biostatistics
    Rose, Sherri
    Rizopoulos, Dimitris
    BIOSTATISTICS, 2020, 21 (02) : 336 - 338