Latent Feature Disentanglement for Visual Domain Generalization

被引:4
|
作者
Gholami, Behnam [1 ]
El-Khamy, Mostafa [1 ,2 ]
Song, Kee-Bong [1 ]
机构
[1] Samsung Semicond Inc, Samsung Device Solut Res Amer, San Diego, CA 92126 USA
[2] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt
关键词
Domain generalization; latent feature; feature disentanglement; image to image translation; StarGAN; ADVERSARIAL NETWORKS;
D O I
10.1109/TIP.2023.3321511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite remarkable success in a variety of computer vision applications, it is well-known that deep learning can fail catastrophically when presented with out-of-distribution data, where there are usually style differences between the training and test images. Toward addressing this challenge, we consider the domain generalization problem, wherein predictors are trained using data drawn from a family of related training (source) domains and then evaluated on a distinct and unseen test domain. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalizes imperfectly to test domains. Data augmentation has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to simple transformations like rotation, brightness change, etc. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in the image style). In this paper, taking the advantage of multiple source domains, we propose a novel approach to express and formalize robustness to these kind of real-world image perturbations. The three key ideas underlying our formulation are (1) leveraging disentangled representations of the images to define different factors of variations, (2) generating perturbed images by changing such factors composing the representations of the images, (3) enforcing the learner (classifier) to be invariant to such changes in the images. We use image-to-image translation models to demonstrate the efficacy of this approach. Based on this, we propose a domain-invariant regularization (DIR) loss function that enforces invariant prediction of targets (class labels) across domains which yields improved generalization performance. We demonstrate the effectiveness of our approach on several widely used datasets for the domain generalization problem, on all of which our results are competitive with the state-of-the-art.
引用
收藏
页码:5751 / 5763
页数:13
相关论文
共 50 条
  • [31] Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement
    Liu, Dongnan
    Zhang, Chaoyi
    Song, Yang
    Huang, Heng
    Wang, Chenyu
    Barnett, Michael
    Cai, Weidong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1333 - 1344
  • [32] Feature Stylization and Domain-aware Contrastive Learning for Domain Generalization
    Jeon, Seogkyu
    Hong, Kibeom
    Lee, Pilhyeon
    Lee, Jewook
    Byun, Hyeran
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 22 - 31
  • [33] DISENTANGLEMENT AND GENERALIZATION UNDER CORRELATION SHIFTS
    Funke, Christina M.
    Vicol, Paul
    Wang, Kuan-Chieh
    Kuemmerer, Matthias
    Zemel, Richard
    Bethge, Matthias
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [34] GMFAD: Towards Generalized Visual Recognition via Multilayer Feature Alignment and Disentanglement
    Li, Haoliang
    Wang, Shiqi
    Wan, Renjie
    Kot, Alex C.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1289 - 1303
  • [35] Universal Heterogeneous Face Analysis via Multi-Domain Feature Disentanglement
    Liu, Decheng
    Gao, Xinbo
    Peng, Chunlei
    Wang, Nannan
    Li, Jie
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 735 - 747
  • [36] TACIT: A Target -Agnostic Feature Disentanglement Framework for Cross -Domain Text Classification
    Song, Rui
    Giunchiglia, Fausto
    Li, Yingji
    Tian, Mingjie
    Xu, Hao
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18999 - 19007
  • [37] Feature-Based Style Randomization for Domain Generalization
    Wang, Yue
    Qi, Lei
    Shi, Yinghuan
    Gao, Yang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5495 - 5509
  • [38] Structure-aware feature stylization for domain generalization
    Cheraghalikhani, Milad
    Noori, Mehrdad
    Osowiechi, David
    Hakim, Gustavo A. Vargas
    Ben Ayed, Ismail
    Desrosiers, Christian
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 244
  • [39] Domain Generalization Via Encoding and Resampling in a Unified Latent Space
    Liu, Yajing
    Xiong, Zhiwei
    Li, Ya
    Tian, Xinmei
    Zha, Zheng-Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 126 - 139
  • [40] Deep Causal Disentanglement Network With Domain Generalization for Cross-Machine Bearing Fault Diagnosis
    Guo, Chaochao
    Sun, Youchao
    Yu, Rourou
    Ren, Xinxin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74