Out-of-Distribution Generalization With Causal Feature Separation

被引:3
|
作者
Wang, Haotian [1 ,2 ]
Kuang, Kun [3 ]
Lan, Long [1 ,2 ]
Wang, Zige [4 ]
Huang, Wanrong [1 ,2 ]
Wu, Fei [5 ]
Yang, Wenjing [1 ,2 ]
机构
[1] Natl Univ Def Technol, Inst Quantum Informat, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, Coll Comp Sci & Technol, State Key Lab High Performance Comp, Changsha 410073, Peoples R China
[3] Zhejiang Univ, Coll Comp Sci & Technol, Key Lab Corneal Dis Res Zhejiang Prov, Hangzhou 310027, Zhejiang, Peoples R China
[4] Peking Univ, Beijing 100871, Peoples R China
[5] Zhejiang Univ, Inst Artificial Intelligence, Shanghai Inst Adv Study, Shanghai AI Lab, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Causal features separation; OOD generalization; selection bias; stable prediction; PREDICTION; SELECTION;
D O I
10.1109/TKDE.2023.3312255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Driven by empirical risk minimization, machine learning algorithm tends to exploit subtle statistical correlations existing in the training environment for prediction, while the spurious correlations are unstable across environments, leading to poor generalization performance. Accordingly, the problem of the Out-of-distribution (OOD) generalization aims to exploit an invariant/stable relationship between features and outcomes that generalizes well on all possible environments. To address the spurious correlation induced by the selection bias, in this article, we propose a novel Clique-based Causal Feature Separation (CCFS) algorithm by explicitly incorporating the causal structure to identify causal features of outcome for OOD generalization. Specifically, the proposed CCFS algorithm identifies the largest clique in the learned causal skeleton. Theoretically, we guarantee that either the largest clique or the rest of the causal skeleton is exactly the set of all causal features of the outcome. Finally, we separate the causal features from the non-causal ones with a sample-reweighting decorrelator for OOD prediction. Extensive experiments validate the effectiveness of the proposed CCFS method on both causal feature identification and OOD generalization tasks.
引用
收藏
页码:1758 / 1772
页数:15
相关论文
共 50 条
  • [41] CausPref: Causal Preference Learning for Out-of-Distribution Recommendation
    He, Yue
    Wang, Zimu
    Cui, Peng
    Zou, Hao
    Zhang, Yafeng
    Cui, Qiang
    Jiang, Yong
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 410 - 421
  • [42] Learning Causal Semantic Representation for Out-of-Distribution Prediction
    Liu, Chang
    Sun, Xinwei
    Wang, Jindong
    Tang, Haoyue
    Li, Tao
    Qin, Tao
    Chen, Wei
    Liu, Tie-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [43] Supervision Adaptation Balancing In-Distribution Generalization and Out-of-Distribution Detection
    Zhao, Zhilin
    Cao, Longbing
    Lin, Kun-Yu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15743 - 15758
  • [44] Out-of-Distribution Generalization by Neural-Symbolic Joint Training
    Liu, Anji
    Xu, Hongming
    Van den Broeck, Guy
    Liang, Yitao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 12252 - 12259
  • [45] Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data
    Ramachandran, Sai Niranjan
    Mukhopadhyay, Rudrabha
    Agarwal, Madhav
    Jawahar, C. V.
    Namboodiri, Vinay
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14767 - 14775
  • [46] An Out-of-Distribution Generalization Framework Based on Variational Backdoor Adjustment
    Su, Hang
    Wang, Wei
    MATHEMATICS, 2024, 12 (01)
  • [47] Targeted Data-driven Regularization for Out-of-Distribution Generalization
    Kamani, Mohammad Mahdi
    Farhang, Sadegh
    Mahdavi, Mehrdad
    Wang, James Z.
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 882 - 891
  • [48] The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
    Hendrycks, Dan
    Basart, Steven
    Mu, Norman
    Kadavath, Saurav
    Wang, Frank
    Dorundo, Evan
    Desai, Rahul
    Zhu, Tyler
    Parajuli, Samyak
    Guo, Mike
    Song, Dawn
    Steinhardt, Jacob
    Gilmer, Justin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8320 - 8329
  • [49] Improving Out-of-Distribution Generalization by Adversarial Training with Structured Priors
    Wang, Qixun
    Wang, Yifei
    Zhu, Hong
    Wang, Yisen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [50] Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalization
    Yang, Ling
    Zheng, Jiayi
    Wang, Heyuan
    Liu, Zhongyi
    Huang, Zhilin
    Hong, Shenda
    Zhang, Wentao
    Cui, Bin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 682 - 693