Incorporating Causal Graphical Prior Knowledge into Predictive Modeling via Simple Data Augmentation

被引:0
|
作者
Teshima, Takeshi [1 ,2 ]
Sugiyama, Masashi [1 ,2 ]
机构
[1] Univ Tokyo, Grad Sch Frontier Sci, Tokyo, Japan
[2] RIKEN, Wako, Saitama, Japan
来源
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 161 | 2021年 / 161卷
关键词
DISCOVERY; NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Causal graphs (CGs) are compact representations of the knowledge of the data generating processes behind the data distributions. When a CG is available, e.g., from the domain knowledge, we can infer the conditional independence (CI) relations that should hold in the data distribution. However, it is not straightforward how to incorporate this knowledge into predictive modeling. In this work, we propose a model-agnostic data augmentation method that allows us to exploit the prior knowledge of the CI encoded in a CG for supervised machine learning. We theoretically justify the proposed method by providing an excess risk bound indicating that the proposed method suppresses overfitting by reducing the apparent complexity of the predictor hypothesis class. Using real-world data with CGs provided by domain experts, we experimentally show that the proposed method is effective in improving the prediction accuracy, especially in the small-data regime.
引用
收藏
页码:86 / 96
页数:11
相关论文
共 50 条
  • [31] Automatic Semantic Modeling for Structural Data Source with the Prior Knowledge from Knowledge Base
    Xu, Jiakang
    Mayer, Wolfgang
    Zhang, Hongyu
    He, Keqing
    Feng, Zaiwen
    MATHEMATICS, 2022, 10 (24)
  • [32] Incorporating I Ching Knowledge Into Prediction Task via Data Mining
    Liu, Wenjie
    Chen, Sai
    Huang, Guoyao
    Lu, Lingfeng
    Li, Huakang
    Sun, Guozi
    JOURNAL OF DATABASE MANAGEMENT, 2023, 34 (03)
  • [33] Point-PC: Point cloud completion guided by prior knowledge via causal inference
    Gao, Xuesong
    Jiao, Chuanqi
    Chen, Ruidong
    Wang, Weijie
    Nie, Weizhi
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2025,
  • [34] Bayesian regression discontinuity designs: incorporating clinical knowledge in the causal analysis of primary care data
    Geneletti, Sara
    O'Keeffe, Aidan G.
    Sharples, Linda D.
    Richardson, Sylvia
    Baio, Gianluca
    STATISTICS IN MEDICINE, 2015, 34 (15) : 2334 - 2352
  • [35] Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO
    Yiming Zuo
    Yi Cui
    Guoqiang Yu
    Ruijiang Li
    Habtom W. Ressom
    BMC Bioinformatics, 18
  • [36] Incorporating prior biological knowledge for network-based differential gene expression analysis using differentially weighted graphical LASSO
    Zuo, Yiming
    Cui, Yi
    Yu, Guoqiang
    Li, Ruijiang
    Ressom, Habtom W.
    BMC BIOINFORMATICS, 2017, 18
  • [37] Neural network analysis of neutron and X-ray reflectivity data incorporating prior knowledge
    Munteanu, Valentin
    Starostin, Vladimir
    Greco, Alessandro
    Pithan, Linus
    Gerlach, Alexander
    Hinderhofer, Alexander
    Kowarik, Stefan
    Schreiber, Frank
    Journal of Applied Crystallography, 2024, 57 (Pt 2) : 456 - 469
  • [38] Neural network analysis of neutron and X-ray reflectivity data incorporating prior knowledge
    Munteanu, Valentin
    Starostin, Vladimir
    Greco, Alessandro
    Pithan, Linus
    Gerlach, Alexander
    Hinderhofer, Alexander
    Kowarik, Stefan
    Schreiber, Frank
    JOURNAL OF APPLIED CRYSTALLOGRAPHY, 2024, 57 : 456 - 469
  • [39] Knowledge Base Question Generation via Data Augmentation with Dynamic-Prompt
    Zhao, Long
    Xu, Yin
    Wang, Yanyan
    Li, Fei
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT VII, 2024, 15022 : 249 - 261
  • [40] Biomedical event causal relation extraction with deep knowledge fusion and Roberta-based data augmentation
    Li, Lishuang
    Xiang, Yi
    Hao, Jing
    METHODS, 2024, 231 : 8 - 14