DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation

被引:0
|
作者
Bai, Haoyue [1 ,2 ]
Sun, Rui [2 ]
Hong, Lanqing [2 ]
Zhou, Fengwei [2 ]
Ye, Nanyang [3 ]
Ye, Han-Jia [4 ]
Chan, S-H Gary [1 ]
Li, Zhenguo [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[4] Nanjing Univ, Nanjing, Jiangsu, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While deep learning demonstrates its strong ability to handle independent and identically distributed (IID) data, it often suffers from out-of-distribution (OoD) generalization, where the test data come from another distribution (w.r.t. the training one). Designing a general OoD generalization framework for a wide range of applications is challenging, mainly due to different kinds of distribution shifts in the real world, such as the shift across domains or the extrapolation of correlation. Most of the previous approaches can only solve one specific distribution shift, leading to unsatisfactory performance when applied to various OoD benchmarks. In this work, we propose DecAug, a novel decomposed feature representation and semantic augmentation approach for OoD generalization. Specifically, DecAug disentangles the category-related and context-related features by orthogonalizing the two gradients (w.r.t. intermediate features) of losses for predicting category and context labels, where category-related features contain causal information of the target object, while context-related features cause distribution shifts between training and test data. Furthermore, we perform gradient-based augmentation on context-related features to improve the robustness of learned representations. Experimental results show that DecAug outperforms other state-of-the-art methods on various OoD datasets, which is among the very few methods that can deal with different types of OoD generalization challenges.
引用
收藏
页码:6705 / 6713
页数:9
相关论文
共 50 条
  • [21] A Stable Vision Transformer for Out-of-Distribution Generalization
    Yu, Haoran
    Liu, Baodi
    Wang, Yingjie
    Zhang, Kai
    Tao, Dapeng
    Liu, Weifeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 328 - 339
  • [22] Counterfactual Active Learning for Out-of-Distribution Generalization
    Deng, Xun
    Wang, Wenjie
    Feng, Fuli
    Zhang, Hanwang
    He, Xiangnan
    Liao, Yong
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11362 - 11377
  • [23] Diverse Weight Averaging for Out-of-Distribution Generalization
    Rame, Alexandre
    Kirchmeyer, Matthieu
    Rahier, Thibaud
    Rakotomamonjy, Alain
    Gallinari, Patrick
    Cord, Matthieu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [24] Out-of-distribution Generalization with Causal Invariant Transformations
    Wang, Ruoyu
    Yi, Mingyang
    Chen, Zhitang
    Zhu, Shengyu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 375 - 385
  • [25] Deep Stable Learning for Out-Of-Distribution Generalization
    Zhang, Xingxuan
    Cui, Peng
    Xu, Renzhe
    Zhou, Linjun
    He, Yue
    Shen, Zheyan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5368 - 5378
  • [26] Out-of-distribution generalization for learning quantum dynamics
    Matthias C. Caro
    Hsin-Yuan Huang
    Nicholas Ezzell
    Joe Gibbs
    Andrew T. Sornborger
    Lukasz Cincio
    Patrick J. Coles
    Zoë Holmes
    Nature Communications, 14
  • [27] Towards a Theoretical Framework of Out-of-Distribution Generalization
    Ye, Haotian
    Xie, Chuanlong
    Cai, Tianle
    Li, Ruichen
    Li, Zhenguo
    Wang, Liwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [28] Few-Shot Causal Representation Learning for Out-of-Distribution Generalization on Heterogeneous Graphs
    Ding, Pengfei
    Wang, Yan
    Liu, Guanfeng
    Wang, Nan
    Zhou, Xiaofang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (04) : 1804 - 1818
  • [29] Out-of-distribution generalization via composition: A lens through induction heads in Transformers
    Song, Jiajun
    Xu, Zhuoyan
    Zhong, Yiqiao
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2025, 122 (06)
  • [30] Decomposing texture and semantic for out-of-distribution detection
    Moon, Jeong-Hyeon
    Ahn, Namhyuk
    Sohn, Kyung-Ah
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238