Reinforcement Learning of Causal Variables Using Mediation Analysis

被引:0
|
作者
Herlau, Tue [1 ]
Larsen, Rasmus [2 ]
机构
[1] Tech Univ Denmark, DK-2800 Lyngby, Denmark
[2] Alexandra Inst, DK-2300 Copenhagen, Denmark
关键词
STATISTICS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the problem of acquiring causal representations and concepts in a reinforcement learning setting. Our approach defines a causal variable as being both manipulable by a policy, and able to predict the outcome. We thereby obtain a parsimonious causal graph in which interventions occur at the level of policies. The approach avoids defining a generative model of the data, prior pre-processing, or learning the transition kernel of the Markov decision process. Instead, causal variables and policies are determined by maximizing a new optimization target inspired by mediation analysis, which differs from the expected return. The maximization is accomplished using a generalization of Bellman's equation which is shown to converge, and the method finds meaningful causal representations in a simulated environment.
引用
收藏
页码:6910 / 6917
页数:8
相关论文
共 50 条
  • [31] Causal Mediation Analysis with Hidden Confounders
    Cheng, Lu
    Guo, Ruocheng
    Liu, Huan
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 113 - 122
  • [32] On Causal Mediation Analysis with a Survival Outcome
    Tchetgen, Eric J. Tchetgen
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2011, 7 (01):
  • [33] Causal Mediation Analysis in Sibling Studies
    Wood, Mollie
    Nordeng, Hedvig
    Hernandez-Diaz, Sonia
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2017, 26 : 88 - 89
  • [34] Using mediation analysis to identify causal mechanisms in disease management interventions
    Linden A.
    Karlson K.B.
    [J]. Health Services and Outcomes Research Methodology, 2013, 13 (2-4) : 86 - 108
  • [35] Investigating Gender Bias in Language Models Using Causal Mediation Analysis
    Vig, Jesse
    Gehrmann, Sebastian
    Belinkov, Yonatan
    Qian, Sharon
    Nevo, Daniel
    Singer, Yaron
    Shieber, Stuart
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [36] Commentary: Causal Effects in Mediation Modeling: An Introduction with Applications to Latent Variables
    Coman, Emil N.
    Thoemmes, Felix
    Fifield, Judith
    [J]. FRONTIERS IN PSYCHOLOGY, 2017, 8
  • [37] REINFORCEMENT VARIABLES AND LEARNING IN RETARDATES AND NORMALS
    GUTHRIE, GM
    MEYN, CF
    QUACKENB.JF
    BUTLER, AJ
    [J]. PENNSYLVANIA PSYCHIATRIC QUARTERLY, 1966, 6 (03): : 20 - &
  • [38] Causal Mediation Analysis in the Presence of Post-treatment Confounding Variables: A Monte Carlo Simulation Study
    Kisbu-Sakarya, Yasemin
    MacKinnon, David P.
    Valente, Matthew J.
    Cetinkaya, Esra
    [J]. FRONTIERS IN PSYCHOLOGY, 2020, 11
  • [39] Causal Discovery by Graph Attention Reinforcement Learning
    Yang, Dezhi
    Yu, Guoxian
    Wang, Jun
    Yan, Zhongmin
    Guo, Maozu
    [J]. PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 28 - 36
  • [40] Explainable Reinforcement Learning through a Causal Lens
    Madumal, Prashan
    Miller, Tim
    Sonenberg, Liz
    Vetere, Frank
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 2493 - 2500