Out-of-Distribution Generalization With Causal Feature Separation

被引:3
|
作者
Wang, Haotian [1 ,2 ]
Kuang, Kun [3 ]
Lan, Long [1 ,2 ]
Wang, Zige [4 ]
Huang, Wanrong [1 ,2 ]
Wu, Fei [5 ]
Yang, Wenjing [1 ,2 ]
机构
[1] Natl Univ Def Technol, Inst Quantum Informat, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, Coll Comp Sci & Technol, State Key Lab High Performance Comp, Changsha 410073, Peoples R China
[3] Zhejiang Univ, Coll Comp Sci & Technol, Key Lab Corneal Dis Res Zhejiang Prov, Hangzhou 310027, Zhejiang, Peoples R China
[4] Peking Univ, Beijing 100871, Peoples R China
[5] Zhejiang Univ, Inst Artificial Intelligence, Shanghai Inst Adv Study, Shanghai AI Lab, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Causal features separation; OOD generalization; selection bias; stable prediction; PREDICTION; SELECTION;
D O I
10.1109/TKDE.2023.3312255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Driven by empirical risk minimization, machine learning algorithm tends to exploit subtle statistical correlations existing in the training environment for prediction, while the spurious correlations are unstable across environments, leading to poor generalization performance. Accordingly, the problem of the Out-of-distribution (OOD) generalization aims to exploit an invariant/stable relationship between features and outcomes that generalizes well on all possible environments. To address the spurious correlation induced by the selection bias, in this article, we propose a novel Clique-based Causal Feature Separation (CCFS) algorithm by explicitly incorporating the causal structure to identify causal features of outcome for OOD generalization. Specifically, the proposed CCFS algorithm identifies the largest clique in the learned causal skeleton. Theoretically, we guarantee that either the largest clique or the rest of the causal skeleton is exactly the set of all causal features of the outcome. Finally, we separate the causal features from the non-causal ones with a sample-reweighting decorrelator for OOD prediction. Extensive experiments validate the effectiveness of the proposed CCFS method on both causal feature identification and OOD generalization tasks.
引用
收藏
页码:1758 / 1772
页数:15
相关论文
共 50 条
  • [1] Causal softmax for out-of-distribution generalization
    Luo, Jing
    Zhao, Wanqing
    Peng, Jinye
    DIGITAL SIGNAL PROCESSING, 2025, 156
  • [2] Out-of-distribution Generalization with Causal Invariant Transformations
    Wang, Ruoyu
    Yi, Mingyang
    Chen, Zhitang
    Zhu, Shengyu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 375 - 385
  • [3] Deep Relevant Feature Focusing for Out-of-Distribution Generalization
    Wang, Fawu
    Zhang, Kang
    Liu, Zhengyu
    Yuan, Xia
    Zhao, Chunxia
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 245 - 253
  • [4] Understanding and Improving Feature Learning for Out-of-Distribution Generalization
    Chen, Yongqiang
    Huang, Wei
    Zhou, Kaiwen
    Bian, Yatao
    Han, Bo
    Cheng, James
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Enhancing Out-of-distribution Generalization on Graphs via Causal Attention Learning
    Sui, Yongduo
    Mao, Wenyu
    Wang, Shuyao
    Wang, Xiang
    Wu, Jiancan
    He, Xiangnan
    Chua, Tat-Seng
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (05)
  • [6] Certifiable Out-of-Distribution Generalization
    Ye, Nanyang
    Zhu, Lin
    Wang, Jia
    Zeng, Zhaoyu
    Shao, Jiayao
    Peng, Chensheng
    Pan, Bikang
    Li, Kaican
    Zhu, Jun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10927 - 10935
  • [7] Joint Learning of Label and Environment Causal Independence for Graph Out-of-Distribution Generalization
    Gui, Shurui
    Liu, Meng
    Li, Xiner
    Luo, Youzhi
    Ji, Shuiwang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] A survey of out-of-distribution generalization for graph machine learning from a causal view
    Ma, Jing
    AI MAGAZINE, 2024,
  • [9] Out-of-Distribution Generalization in Kernel Regression
    Canatar, Abdulkadir
    Bordelon, Blake
    Pehlevan, Cengiz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation
    Bai, Haoyue
    Sun, Rui
    Hong, Lanqing
    Zhou, Fengwei
    Ye, Nanyang
    Ye, Han-Jia
    Chan, S-H Gary
    Li, Zhenguo
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6705 - 6713