Causal Bandits for Linear Structural Equation Models

被引:0
|
作者
Varici, Burak [1 ]
Shanmugam, Karthikeyan [2 ]
Sattigeri, Prasanna [3 ]
Tajer, Ali [1 ]
机构
[1] Rensselaer Polytech Inst, Elect Comp & Syst Engn, Troy, NY 12180 USA
[2] Google Res India, Bangalore, India
[3] IBM Res, MIT IBM Watson Lab, Cambridge, MA 02142 USA
关键词
Causal bandits; multi-armed bandits; causality; linear SEMs; cumulative regret;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies the problem of designing an optimal sequence of interventions in a causal graphical model to minimize cumulative regret with respect to the best intervention in hindsight. This is, naturally, posed as a causal bandit problem. The focus is on causal bandits for linear structural equation models (SEMs) and soft interventions. It is assumed that the graph's structure is known and has N nodes. Two linear mechanisms, one soft intervention and one observational, are assumed for each node, giving rise to 2N possible interventions. The majority of the existing causal bandit algorithms assume that at least the interventional distributions of the reward node's parents are fully specified. However, there are 2N such distributions (one corresponding to each intervention), acquiring which becomes prohibitive even in moderate-sized graphs. This paper dispenses with the assumption of knowing these distributions or their marginals. Two algorithms are proposed for the frequentist (UCB-based) and Bayesian (Thompson sampling-based) settings. The key idea of these algorithms is to avoid directly estimating the 2N reward distributions and instead estimate the parameters that fully specify the SEMs (linear in N) and use them to compute the rewards. In both algorithms, under boundedness assumptions on noise and the parameter space, the cumulative regrets scal
引用
收藏
页数:59
相关论文
共 50 条
  • [41] Bayesian identification of structural coefficients in causal models and the causal false-positive risk of confounders and colliders in linear Markovian models
    Kelter, Riko
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [42] Empirical likelihood for linear structural equation models with dependent errors
    Wang, Y. Samuel
    Drton, Mathias
    [J]. STAT, 2017, 6 (01): : 434 - 447
  • [43] Bayesian identification of structural coefficients in causal models and the causal false-positive risk of confounders and colliders in linear Markovian models
    Riko Kelter
    [J]. BMC Medical Research Methodology, 22
  • [44] Causal Bandits with Propagating Inference
    Yabe, Akihiro
    Hatano, Daisuke
    Sumita, Hanna
    Ito, Shinji
    Kakimura, Naonori
    Fukunaga, Takuro
    Kawarabayashi, Ken-ichi
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [45] Confounded Budgeted Causal Bandits
    Jamshidi, Fateme
    Etesami, Jalal
    Kiyavash, Negar
    [J]. CAUSAL LEARNING AND REASONING, VOL 236, 2024, 236 : 423 - 461
  • [46] Causal Language in Structural Equation Modeling
    Sanchez-Iglesias, Ivan
    Aguayo-Estremera, Raimundo
    Miguel-Alvaro, Alejandro
    Paniagua, David
    [J]. REVISTA IBEROAMERICANA DE DIAGNOSTICO Y EVALUACION-E AVALIACAO PSICOLOGICA, 2022, 5 (66): : 35 - 51
  • [47] Constraint-based Causal Discovery for Non-Linear Structural Causal Models with Cycles and Latent Confounders
    Forre, Patrick
    Mooij, Joris M.
    [J]. UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 269 - 278
  • [48] Linear Causal Modeling With Structural Equations
    Markus, Keith A.
    [J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2012, 19 (04) : 703 - 710
  • [49] Structure Learning of Linear Gaussian Structural Equation Models with Weak Edges
    Eigenmann, Marco F.
    Nandy, Preetam
    Maathuis, Marloes H.
    [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [50] Learning linear structural equation models in polynomial time and sample complexity
    Ghoshal, Asish
    Honorio, Jean
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84