Invariance Principle Meets Information Bottleneck for Out-of-Distribution Generalization

被引:0
|
作者
Ahuja, Kartik [1 ]
Caballero, Ethan [1 ]
Zhang, Dinghuai [1 ]
Gagnon-Audet, Jean-Christophe [1 ]
Bengio, Yoshua [1 ]
Mitliagkas, Ioannis [1 ]
Rish, Irina [1 ]
机构
[1] Univ Montreal, Quebec AI Inst, Mila, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The invariance principle from causality is at the heart of notable approaches such as invariant risk minimization (IRM) that seek to address out-of-distribution (OOD) generalization failures. Despite the promising theory, invariance principle-based approaches fail in common classification tasks, where invariant (causal) features capture all the information about the label. Are these failures due to the methods failing to capture the invariance? Or is the invariance principle itself insufficient? To answer these questions, we revisit the fundamental assumptions in linear regression tasks, where invariance-based approaches were shown to provably generalize OOD. In contrast to the linear regression tasks, we show that for linear classification tasks we need much stronger restrictions on the distribution shifts, or otherwise OOD generalization is impossible. Furthermore, even with appropriate restrictions on distribution shifts in place, we show that the invariance principle alone is insufficient. We prove that a form of the information bottleneck constraint along with invariance helps address key failures when invariant features capture all the information about the label and also retains the existing success when they do not. We propose an approach that incorporates both of these principles and demonstrate its effectiveness in several experiments.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Tackling Domain Generalization for Out-of-Distribution Endoscopic Imaging
    Ali Teevno, Mansoor
    Ochoa-Ruiz, Gilberto
    Ali, Sharib
    MACHINE LEARNING IN MEDICAL IMAGING, PT II, MLMI 2024, 2025, 15242 : 43 - 52
  • [32] RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction
    Yu, Yemin
    Yuan, Luotian
    Wei, Ying
    Gao, Hanyu
    Wu, Fei
    Wang, Zhihua
    Ye, Xinhai
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 1, 2024, : 374 - 382
  • [33] Deep Relevant Feature Focusing for Out-of-Distribution Generalization
    Wang, Fawu
    Zhang, Kang
    Liu, Zhengyu
    Yuan, Xia
    Zhao, Chunxia
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 245 - 253
  • [34] Can Subnetwork Structure be the Key to Out-of-Distribution Generalization?
    Zhang, Dinghuai
    Ahuja, Kartik
    Xu, Yilun
    Wang, Yisen
    Courville, Aaron
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [35] Understanding and Improving Feature Learning for Out-of-Distribution Generalization
    Chen, Yongqiang
    Huang, Wei
    Zhou, Kaiwen
    Bian, Yatao
    Han, Bo
    Cheng, James
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [36] Face Reconstruction Transfer Attack as Out-of-Distribution Generalization
    June, Yoon Gyo
    Park, Jaewoo
    Dong, Xingbo
    Park, Hojin
    Teoh, Andrew Beng Jin
    Camps, Octavia
    COMPUTER VISION - ECCV 2024, PT LXXV, 2025, 15133 : 396 - 413
  • [37] Learning Invariant Graph Representations for Out-of-Distribution Generalization
    Li, Haoyang
    Zhang, Ziwei
    Wang, Xin
    Zhu, Wenwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [38] Fishr: Invariant Gradient Variances for Out-of-Distribution Generalization
    Rame, Alexandre
    Dancette, Corentin
    Cord, Matthieu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [39] Deep representation learning for domain generalization with information bottleneck principle
    Zhang, Jiao
    Zhang, Xu-Yao
    Wang, Chuang
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2023, 143
  • [40] Supervision Adaptation Balancing In-Distribution Generalization and Out-of-Distribution Detection
    Zhao, Zhilin
    Cao, Longbing
    Lin, Kun-Yu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15743 - 15758