Latent Feature Disentanglement for Visual Domain Generalization

被引:4
|
作者
Gholami, Behnam [1 ]
El-Khamy, Mostafa [1 ,2 ]
Song, Kee-Bong [1 ]
机构
[1] Samsung Semicond Inc, Samsung Device Solut Res Amer, San Diego, CA 92126 USA
[2] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt
关键词
Domain generalization; latent feature; feature disentanglement; image to image translation; StarGAN; ADVERSARIAL NETWORKS;
D O I
10.1109/TIP.2023.3321511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite remarkable success in a variety of computer vision applications, it is well-known that deep learning can fail catastrophically when presented with out-of-distribution data, where there are usually style differences between the training and test images. Toward addressing this challenge, we consider the domain generalization problem, wherein predictors are trained using data drawn from a family of related training (source) domains and then evaluated on a distinct and unseen test domain. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalizes imperfectly to test domains. Data augmentation has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to simple transformations like rotation, brightness change, etc. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in the image style). In this paper, taking the advantage of multiple source domains, we propose a novel approach to express and formalize robustness to these kind of real-world image perturbations. The three key ideas underlying our formulation are (1) leveraging disentangled representations of the images to define different factors of variations, (2) generating perturbed images by changing such factors composing the representations of the images, (3) enforcing the learner (classifier) to be invariant to such changes in the images. We use image-to-image translation models to demonstrate the efficacy of this approach. Based on this, we propose a domain-invariant regularization (DIR) loss function that enforces invariant prediction of targets (class labels) across domains which yields improved generalization performance. We demonstrate the effectiveness of our approach on several widely used datasets for the domain generalization problem, on all of which our results are competitive with the state-of-the-art.
引用
收藏
页码:5751 / 5763
页数:13
相关论文
共 50 条
  • [21] Causal Disentanglement Domain Generalization for time-series signal fault diagnosis
    Jia, Linshan
    Chow, Tommy W. S.
    Yuan, Yixuan
    NEURAL NETWORKS, 2024, 172
  • [22] Feature filtering and feature decoupling based domain generalization model
    Liu K.
    Wang D.
    Wang J.
    Chen H.
    Liu W.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (03): : 459 - 467
  • [23] Coarse-to-fine domain adaptation object detection with feature disentanglement
    Li, Jiafeng
    Zhi, Mengxun
    Zheng, Yongyu
    Zhuo, Li
    Zhang, Jing
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,
  • [24] Feature Diversification and Adaptation for Federated Domain Generalization
    Yang, Seunghan
    Choi, Seokeon
    Park, Hyunsin
    Choi, Sungha
    Chang, Simyung
    Yuri, Sungrack
    COMPUTER VISION - ECCV 2024, PT LXXII, 2025, 15130 : 52 - 70
  • [25] Explicit feature disentanglement for visual place recognition across appearance changes
    Tang, Li
    Wang, Yue
    Tan, Qimeng
    Xiong, Rong
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2021, 18 (06)
  • [26] Domain Generalization via Feature Variation Decorrelation
    Liu, Chang
    Wang, Lichen
    Li, Kai
    Fu, Yun
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1683 - 1691
  • [27] Cross Contrasting Feature Perturbation for Domain Generalization
    Li, Chenming
    Zhang, Daoan
    Huang, Wenjian
    Zhang, Jianguo
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1327 - 1337
  • [28] Domain Generalization Using a Mixture of Multiple Latent Domains
    Matsuura, Toshihiko
    Harada, Tatsuya
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11749 - 11756
  • [29] Disentanglement via Latent Quantization
    Hsu, Kyle
    Dorrell, Will
    Whittington, James C. R.
    Wu, Jiajun
    Finn, Chelsea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [30] Grounding Visual Representations with Texts for Domain Generalization
    Min, Seonwoo
    Park, Nokyung
    Kim, Siwon
    Park, Seunghyun
    Kim, Jinkyu
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 37 - 53