Invited Commentary: Demystifying Statistical Inference When Using Machine Learning in Causal Research

被引:0
|
作者
Balzer, Laura B. [1 ,2 ]
Westling, Ted [3 ]
机构
[1] Univ Massachusetts Amherst, Dept Biostat & Epidemiol, 427 Arnold House, Amherst, MA 01003 USA
[2] Univ Massachusetts Amherst, Dept Biostat & Epidemiol, Amherst, MA USA
[3] Univ Massachusetts Amherst, Dept Math & Stat, Amherst, MA USA
关键词
causal inference; cross-fitting; cross-validation; doubly robust; machine learning; nonparametric; Super Learner; TMLE;
D O I
暂无
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
In this issue, Naimi et al. (Am J Epidemiol. XXXX;XXX(XX):XXXX-XXXX) discuss a critical topic in public health and beyond: obtaining valid statistical inference when using machine learning in causal research. In doing so, the authors review recent prominent methodological work and recommend: 1) doubly robust estimators, such as targeted maximum likelihood estimation (TMLE); 2) ensemble methods, such as Super Learner, to combine predictions from a diverse library of algorithms; and 3) sample splitting to reduce bias and improve inference. We largely agree with these recommendations. In this commentary, we highlight the critical importance of the Super Learner library. Specifically, in both simulation settings considered by the authors, we demonstrate that reductions in bias and improvements in confidence-interval coverage can be achieved using TMLE without sample splitting and with a Super Learner library that excludes tree-based methods but includes regression splines. Whether extremely data-adaptive algorithms and sample splitting are needed depends on the specific problem and should be informed by simulations reflecting the specific application. More research is needed on practical recommendations for selecting among these options in common situations arising in epidemiology.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [41] Statistical Appliance Inference in the Smart Grid by Machine Learning
    Bilgin, Zeki
    Tomur, Emrah
    Ersoy, Mehmet Akif
    Soykan, Elif Ustundag
    2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC WORKSHOPS), 2019,
  • [42] Statistical inference and statistical learning in economic research - selected challenges
    Jajuga, Krzysztof
    Pociecha, Jozef
    Szreder, Miroslaw
    EKONOMISTA, 2024, (02): : 138 - 154
  • [43] Invited Commentary: What Social Epidemiology Brings to the Table-Reconciling Social Epidemiology and Causal Inference
    Robinson, Whitney R.
    Bailey, Zinzi D.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2020, 189 (03) : 171 - 174
  • [44] Invited Commentary: Agent-Based Models for Causal Inference-Reweighting Data and Theory in Epidemiology
    Hernan, Miguel A.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2015, 181 (02) : 103 - 105
  • [45] Invited Commentary: Causal Inference Across Space and Time-Quixotic Quest, Worthy Goal, or Both?
    Edwards, Jessie K.
    Lesko, Catherine R.
    Keil, Alexander P.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2017, 186 (02) : 143 - 145
  • [46] Critical issues in statistical causal inference for observational physics education research
    Adlakha, Vidushi
    Kuo, Eric
    PHYSICAL REVIEW PHYSICS EDUCATION RESEARCH, 2023, 19 (02):
  • [47] Optimal subset selection for causal inference using machine learning ensembles and particle swarm optimization
    Dhruv Sharma
    Christopher Willy
    John Bischoff
    Complex & Intelligent Systems, 2021, 7 : 41 - 59
  • [48] Finding predictors and causes of cardiac surgery ICU readmission using machine learning and causal inference
    Lee, J.
    Ahmidi, N.
    Srinivasan, R.
    Alejo, D.
    Dinatale, J.
    Schena, S.
    Whitman, G.
    Sussman, M.
    Shpitser, I.
    EUROPEAN HEART JOURNAL, 2019, 40 : 3575 - 3575
  • [49] Optimal subset selection for causal inference using machine learning ensembles and particle swarm optimization
    Sharma, Dhruv
    Willy, Christopher
    Bischoff, John
    COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (01) : 41 - 59
  • [50] Interpretable Machine Learning Techniques for Causal Inference Using Balancing Scores As Meta-features
    Nohara, Yasunobu
    Iihara, Koji
    Nakashima, Naoki
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 4042 - 4045