Out-of-Distribution Generalization With Causal Feature Separation

被引:3
|
作者
Wang, Haotian [1 ,2 ]
Kuang, Kun [3 ]
Lan, Long [1 ,2 ]
Wang, Zige [4 ]
Huang, Wanrong [1 ,2 ]
Wu, Fei [5 ]
Yang, Wenjing [1 ,2 ]
机构
[1] Natl Univ Def Technol, Inst Quantum Informat, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, Coll Comp Sci & Technol, State Key Lab High Performance Comp, Changsha 410073, Peoples R China
[3] Zhejiang Univ, Coll Comp Sci & Technol, Key Lab Corneal Dis Res Zhejiang Prov, Hangzhou 310027, Zhejiang, Peoples R China
[4] Peking Univ, Beijing 100871, Peoples R China
[5] Zhejiang Univ, Inst Artificial Intelligence, Shanghai Inst Adv Study, Shanghai AI Lab, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Causal features separation; OOD generalization; selection bias; stable prediction; PREDICTION; SELECTION;
D O I
10.1109/TKDE.2023.3312255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Driven by empirical risk minimization, machine learning algorithm tends to exploit subtle statistical correlations existing in the training environment for prediction, while the spurious correlations are unstable across environments, leading to poor generalization performance. Accordingly, the problem of the Out-of-distribution (OOD) generalization aims to exploit an invariant/stable relationship between features and outcomes that generalizes well on all possible environments. To address the spurious correlation induced by the selection bias, in this article, we propose a novel Clique-based Causal Feature Separation (CCFS) algorithm by explicitly incorporating the causal structure to identify causal features of outcome for OOD generalization. Specifically, the proposed CCFS algorithm identifies the largest clique in the learned causal skeleton. Theoretically, we guarantee that either the largest clique or the rest of the causal skeleton is exactly the set of all causal features of the outcome. Finally, we separate the causal features from the non-causal ones with a sample-reweighting decorrelator for OOD prediction. Extensive experiments validate the effectiveness of the proposed CCFS method on both causal feature identification and OOD generalization tasks.
引用
收藏
页码:1758 / 1772
页数:15
相关论文
共 50 条
  • [31] Tackling Domain Generalization for Out-of-Distribution Endoscopic Imaging
    Ali Teevno, Mansoor
    Ochoa-Ruiz, Gilberto
    Ali, Sharib
    MACHINE LEARNING IN MEDICAL IMAGING, PT II, MLMI 2024, 2025, 15242 : 43 - 52
  • [32] Probing out-of-distribution generalization in machine learning for materials
    Li, Kangming
    Rubungo, Andre Niyongabo
    Lei, Xiangyun
    Persaud, Daniel
    Choudhary, Kamal
    Decost, Brian
    Dieng, Adji Bousso
    Hattrick-Simpers, Jason
    COMMUNICATIONS MATERIALS, 2025, 6 (01)
  • [33] RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction
    Yu, Yemin
    Yuan, Luotian
    Wei, Ying
    Gao, Hanyu
    Wu, Fei
    Wang, Zhihua
    Ye, Xinhai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 374 - 382
  • [34] Can Subnetwork Structure be the Key to Out-of-Distribution Generalization?
    Zhang, Dinghuai
    Ahuja, Kartik
    Xu, Yilun
    Wang, Yisen
    Courville, Aaron
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [35] Face Reconstruction Transfer Attack as Out-of-Distribution Generalization
    June, Yoon Gyo
    Park, Jaewoo
    Dong, Xingbo
    Park, Hojin
    Teoh, Andrew Beng Jin
    Camps, Octavia
    COMPUTER VISION - ECCV 2024, PT LXXV, 2025, 15133 : 396 - 413
  • [36] Learning Invariant Graph Representations for Out-of-Distribution Generalization
    Li, Haoyang
    Zhang, Ziwei
    Wang, Xin
    Zhu, Wenwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [37] Fishr: Invariant Gradient Variances for Out-of-Distribution Generalization
    Rame, Alexandre
    Dancette, Corentin
    Cord, Matthieu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [38] Exploring feature sparsity for out-of-distribution detection
    Chen, Qichao
    Li, Kuan
    Chen, Zhiyuan
    Maul, Tomas
    Yin, Jianping
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] Hyperdimensional Feature Fusion for Out-of-Distribution Detection
    Wilson, Samuel
    Fischer, Tobias
    Sunderhauf, Niko
    Dayoub, Feras
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2643 - 2653
  • [40] Understanding the Feature Norm for Out-of-Distribution Detection
    Park, Jaewoo
    Chai, Jacky Chen Long
    Yoon, Jaeho
    Teoh, Andrew Beng Jin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1557 - 1567