Ordering-Based Causal Discovery with Reinforcement Learning

被引:0
|
作者
Wang, Xiaoqiang [1 ]
Du, Yali [2 ]
Zhu, Shengyu [3 ]
Ke, Liangjun [1 ]
Chen, Zhitang [3 ]
Hao, Jianye [3 ,4 ]
Wang, Jun [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Automat Sci & Engn, State Key Lab Mfg Syst Engn, Xian, Peoples R China
[2] UCL, London, England
[3] Huawei Noahs Ark Lab, Quebec City, PQ, Canada
[4] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is a long-standing question to discover causal relations among a set of variables in many empirical sciences. Recently, Reinforcement Learning (RL) has achieved promising results in causal discovery from observational data. However, searching the space of directed graphs and enforcing acyclicity by implicit penalties tend to be inefficient and restrict the existing RL-based method to small scale problems. In this work, we propose a novel RL-based approach for causal discovery, by incorporating RL into the ordering-based paradigm. Specifically, we formulate the ordering search problem as a multi-step Markov decision process, implement the ordering generating process with an encoder-decoder architecture, and finally use RL to optimize the proposed model based on the reward mechanisms designed for each ordering. A generated ordering would then be processed using variable selection to obtain the final causal graph. We analyze the consistency and computational complexity of the proposed method, and empirically show that a pretrained model can be exploited to accelerate training. Experimental results on both synthetic and real data sets shows that the proposed method achieves a much improved performance over existing RL-based method.
引用
收藏
页码:3566 / 3573
页数:8
相关论文
共 50 条
  • [1] Ordering-Based Causal Structure Learning in the Presence of Latent Variables
    Bernstein, Daniel Irving
    Saeed, Basil
    Squires, Chandler
    Uhler, Caroline
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 4098 - 4107
  • [2] Novel Ordering-Based Approaches for Causal Structure Learning in the Presence of Unobserved Variables
    Mokhtarian, Ehsan
    Khorasani, Mohammadsadegh
    Etesami, Jalal
    Kiyavash, Negar
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 12260 - 12268
  • [3] CAUSAL DISCOVERY WITH REINFORCEMENT LEARNING
    Huawei Noah's Ark Lab
    不详
    Int. Conf. Learn. Represent., ICLR,
  • [4] An Intelligent Fault Diagnosis Framework for Rolling Bearings With Integrated Feature Extraction and Ordering-Based Causal Discovery
    Ding, Xu
    Wang, Junlong
    Wu, Hao
    Xu, Juan
    Xin, Miao
    IEEE SENSORS JOURNAL, 2024, 24 (10) : 16374 - 16386
  • [5] Dynamic Ordering-Based Search Algorithm for Markov Blanket Discovery
    Zeng, Yifeng
    He, Xian
    Xiang, Yanping
    Mao, Hua
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II: 15TH PACIFIC-ASIA CONFERENCE, PAKDD 2011, 2011, 6635 : 420 - 431
  • [6] The Improved Ordering-Based Search Method Incorporating with Ensemble Learning
    Wang, Hao
    Wang, Zidong
    Zhong, Ruiguo
    Liu, Xiaohan
    Gao, Xiaoguang
    COGNITIVE COMPUTATION, 2024, 16 (03) : 852 - 876
  • [7] Causal Discovery by Graph Attention Reinforcement Learning
    Yang, Dezhi
    Yu, Guoxian
    Wang, Jun
    Yan, Zhongmin
    Guo, Maozu
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 28 - 36
  • [8] Causal Discovery and Reinforcement Learning: A Synergistic Integration
    Mendez-Molina, Arquimides
    Morales, Eduardo F.
    Enrique Sucar, L.
    INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 186, 2022, 186
  • [9] KCRL: A Prior Knowledge Based Causal Discovery Framework with Reinforcement Learning
    Hasan, Uzma
    Gani, Md Osman
    MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 182, 2022, 182 : 691 - 714
  • [10] Ordering-based representations of rational inference
    Georgatos, K
    LOGICS IN ARTIFICIAL INTELLIGENCE, 1996, 1126 : 176 - 191