Latent Feature Disentanglement for Visual Domain Generalization

被引:4
|
作者
Gholami, Behnam [1 ]
El-Khamy, Mostafa [1 ,2 ]
Song, Kee-Bong [1 ]
机构
[1] Samsung Semicond Inc, Samsung Device Solut Res Amer, San Diego, CA 92126 USA
[2] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt
关键词
Domain generalization; latent feature; feature disentanglement; image to image translation; StarGAN; ADVERSARIAL NETWORKS;
D O I
10.1109/TIP.2023.3321511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite remarkable success in a variety of computer vision applications, it is well-known that deep learning can fail catastrophically when presented with out-of-distribution data, where there are usually style differences between the training and test images. Toward addressing this challenge, we consider the domain generalization problem, wherein predictors are trained using data drawn from a family of related training (source) domains and then evaluated on a distinct and unseen test domain. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalizes imperfectly to test domains. Data augmentation has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to simple transformations like rotation, brightness change, etc. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in the image style). In this paper, taking the advantage of multiple source domains, we propose a novel approach to express and formalize robustness to these kind of real-world image perturbations. The three key ideas underlying our formulation are (1) leveraging disentangled representations of the images to define different factors of variations, (2) generating perturbed images by changing such factors composing the representations of the images, (3) enforcing the learner (classifier) to be invariant to such changes in the images. We use image-to-image translation models to demonstrate the efficacy of this approach. Based on this, we propose a domain-invariant regularization (DIR) loss function that enforces invariant prediction of targets (class labels) across domains which yields improved generalization performance. We demonstrate the effectiveness of our approach on several widely used datasets for the domain generalization problem, on all of which our results are competitive with the state-of-the-art.
引用
收藏
页码:5751 / 5763
页数:13
相关论文
共 50 条
  • [41] Enhancing Evolving Domain Generalization through Dynamic Latent Representations
    Xie, Binghui
    Chen, Yongqiang
    Wang, Jiaqi
    Zhou, Kaiwen
    Han, Bo
    Meng, Wei
    Cheng, James
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 16040 - 16048
  • [42] Latent Domains Modeling for Visual Domain Adaptation
    Xiong, Caiming
    McCloskey, Scott
    Hsieh, Shao-Hang
    Corso, Jason J.
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2860 - 2866
  • [43] Membership Feature Disentanglement Network
    Ha, Heonseok
    Jang, Jaehee
    Jeong, Yonghyun
    Yoon, Sungroh
    ASIA CCS'22: PROCEEDINGS OF THE 2022 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2022, : 364 - 376
  • [44] FSN: Feature Shift Network for Load-Domain (LD) Domain Generalization
    Chen, Heng
    Zhao, Erkang
    Jia, Yunpeng
    Shi, Lei
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [45] Multi-view Domain Generalization for Visual Recognition
    Niu, Li
    Li, Wen
    Xu, Dong
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4193 - 4201
  • [46] Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation
    Lin, Jianxin
    Chen, Zhibo
    Xia, Yingce
    Liu, Sen
    Qin, Tao
    Luo, Jiebo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (04) : 1254 - 1266
  • [47] Visual representations with texts domain generalization for semantic segmentation
    Yue, Wanlin
    Zhou, Zhiheng
    Cao, Yinglie
    Wu, Weikang
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30069 - 30079
  • [48] Visual representations with texts domain generalization for semantic segmentation
    Wanlin Yue
    Zhiheng Zhou
    Yinglie Cao
    Weikang Wu
    Applied Intelligence, 2023, 53 : 30069 - 30079
  • [49] A novel domain feature disentanglement method for multi-target cross-domain mechanical fault diagnosis
    Liu, Zhenyu
    Zheng, Haowen
    Liu, Hui
    Duan, Guifang
    Tan, Jianrong
    ISA TRANSACTIONS, 2025, 158 : 512 - 524
  • [50] Towards prognostic generalization: a domain conditional invariance and specificity disentanglement network for remaining useful life prediction
    Xia, Pengcheng
    Huang, Yixiang
    Qin, Chengjin
    Liu, Chengliang
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (07) : 3459 - 3477