Invariance Principle Meets Information Bottleneck for Out-of-Distribution Generalization

被引:0
|
作者
Ahuja, Kartik [1 ]
Caballero, Ethan [1 ]
Zhang, Dinghuai [1 ]
Gagnon-Audet, Jean-Christophe [1 ]
Bengio, Yoshua [1 ]
Mitliagkas, Ioannis [1 ]
Rish, Irina [1 ]
机构
[1] Univ Montreal, Quebec AI Inst, Mila, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The invariance principle from causality is at the heart of notable approaches such as invariant risk minimization (IRM) that seek to address out-of-distribution (OOD) generalization failures. Despite the promising theory, invariance principle-based approaches fail in common classification tasks, where invariant (causal) features capture all the information about the label. Are these failures due to the methods failing to capture the invariance? Or is the invariance principle itself insufficient? To answer these questions, we revisit the fundamental assumptions in linear regression tasks, where invariance-based approaches were shown to provably generalize OOD. In contrast to the linear regression tasks, we show that for linear classification tasks we need much stronger restrictions on the distribution shifts, or otherwise OOD generalization is impossible. Furthermore, even with appropriate restrictions on distribution shifts in place, we show that the invariance principle alone is insufficient. We prove that a form of the information bottleneck constraint along with invariance helps address key failures when invariant features capture all the information about the label and also retains the existing success when they do not. We propose an approach that incorporates both of these principles and demonstrate its effectiveness in several experiments.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data
    Ramachandran, Sai Niranjan
    Mukhopadhyay, Rudrabha
    Agarwal, Madhav
    Jawahar, C. V.
    Namboodiri, Vinay
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14767 - 14775
  • [42] Out-of-Distribution Generalization by Neural-Symbolic Joint Training
    Liu, Anji
    Xu, Hongming
    Van den Broeck, Guy
    Liang, Yitao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 12252 - 12259
  • [43] An Out-of-Distribution Generalization Framework Based on Variational Backdoor Adjustment
    Su, Hang
    Wang, Wei
    MATHEMATICS, 2024, 12 (01)
  • [44] Targeted Data-driven Regularization for Out-of-Distribution Generalization
    Kamani, Mohammad Mahdi
    Farhang, Sadegh
    Mahdavi, Mehrdad
    Wang, James Z.
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 882 - 891
  • [45] The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
    Hendrycks, Dan
    Basart, Steven
    Mu, Norman
    Kadavath, Saurav
    Wang, Frank
    Dorundo, Evan
    Desai, Rahul
    Zhu, Tyler
    Parajuli, Samyak
    Guo, Mike
    Song, Dawn
    Steinhardt, Jacob
    Gilmer, Justin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8320 - 8329
  • [46] Improving Out-of-Distribution Generalization by Adversarial Training with Structured Priors
    Wang, Qixun
    Wang, Yifei
    Zhu, Hong
    Wang, Yisen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [47] Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
    Ada, Suzan Ece
    Oztop, Erhan
    Ugur, Emre
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3116 - 3123
  • [48] A Multimodal AI System for Out-of-Distribution Generalization of Seizure Identification
    Yang, Yikai
    Nhan Duy Truong
    Eshraghian, Jason K.
    Maher, Christina
    Nikpour, Armin
    Kavehei, Omid
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (07) : 3529 - 3538
  • [49] Learning Causally Invariant Representations for Out-of-Distribution Generalization on Graphs
    Chen, Yongqiang
    Zhang, Yonggang
    Bian, Yatao
    Yang, Han
    Ma, Kaili
    Xie, Binghui
    Liu, Tongliang
    Han, Bo
    Cheng, James
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [50] SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
    Nguyen, Bac
    Uhlich, Stefan
    Cardinaux, Fabien
    Mauch, Lukas
    Edraki, Marzieh
    Courville, Aaron
    COMPUTER VISION - ECCV 2024, PT LXIX, 2025, 15127 : 138 - 154