Initial State Interventions for Deconfounded Imitation Learning

被引:0
|
作者
Pfrommer, Samuel [1 ]
Bai, Yatong [1 ]
Lee, Hyunin [1 ]
Sojoudi, Somayeh [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
关键词
D O I
10.1109/CDC49753.2023.10383252
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Imitation learning suffers from causal confusion. This phenomenon occurs when learned policies attend to features that do not causally influence the expert actions but are instead spuriously correlated. Causally confused agents produce low open-loop supervised loss but poor closed-loop performance upon deployment. We consider the problem of masking observed confounders in a disentangled representation of the observation space. Our novel masking algorithm leverages the usual ability to intervene in the initial system state, avoiding any requirement involving expert querying, expert reward functions, or causal graph specification. Under certain assumptions, we theoretically prove that this algorithm is conservative in the sense that it does not incorrectly mask observations that causally influence the expert; furthermore, intervening on the initial state serves to strictly reduce excess conservatism. The masking algorithm is applied to behavior cloning for two illustrative control systems: CartPole and Reacher.
引用
收藏
页码:2312 / 2319
页数:8
相关论文
共 50 条
  • [21] Domain-Adversarial and -Conditional State Space Model for Imitation Learning
    Okumura, Ryo
    Okada, Masashi
    Taniguchi, Tadahiro
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5179 - 5186
  • [22] Adaptive scheduling for Internet of Vehicles using deconfounded graph transfer learning
    Liu, Xiuwen
    Wang, Shuo
    Chen, Yanjiao
    COMPUTER NETWORKS, 2025, 256
  • [23] Contrastive Initial State Buffer for Reinforcement Learning
    Messikommer, Nico
    Song, Yunlong
    Scaramuzza, Davide
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 2866 - 2872
  • [24] Social Learning and Imitation
    Wile
    AMERICAN JOURNAL OF ORTHOPSYCHIATRY, 1942, 12 (04) : 743 - 743
  • [25] Imitation and the effort of learning
    Williams, Justin H. G.
    BEHAVIORAL AND BRAIN SCIENCES, 2008, 31 (01) : 40 - +
  • [26] Social Learning and Imitation
    Flugel, J. C.
    INTERNATIONAL JOURNAL OF PSYCHOANALYSIS, 1943, 24 : 85 - 87
  • [27] Quantum Imitation Learning
    Cheng, Zhihao
    Zhang, Kaining
    Shen, Li
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14190 - 14204
  • [28] Social Learning and Imitation
    不详
    PSYCHIATRIC QUARTERLY, 1942, 16 (04) : 820 - 821
  • [29] Relational Learning by Imitation
    Bombini, Grazia
    Di Mauro, Nicola
    Basile, Teresa M. A.
    Ferilli, Stefano
    Esposito, Floriana
    AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2009, 5559 : 273 - 282
  • [30] Deconfounded Opponent Intention Inference for Football Multi-Player Policy Learning
    Wang, Shijie
    Pan, Yi
    Pu, Zhiqiang
    Liu, Boyin
    Yi, Jianqiang
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 8054 - 8061