Latent Feature Disentanglement for Visual Domain Generalization

被引：4

作者：

Gholami, Behnam ^{[1
]}

El-Khamy, Mostafa ^{[1
,2
]}

Song, Kee-Bong ^{[1
]}

机构：

[1] Samsung Semicond Inc, Samsung Device Solut Res Amer, San Diego, CA 92126 USA

[2] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Domain generalization; latent feature; feature disentanglement; image to image translation; StarGAN; ADVERSARIAL NETWORKS;

D O I：

10.1109/TIP.2023.3321511

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite remarkable success in a variety of computer vision applications, it is well-known that deep learning can fail catastrophically when presented with out-of-distribution data, where there are usually style differences between the training and test images. Toward addressing this challenge, we consider the domain generalization problem, wherein predictors are trained using data drawn from a family of related training (source) domains and then evaluated on a distinct and unseen test domain. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalizes imperfectly to test domains. Data augmentation has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to simple transformations like rotation, brightness change, etc. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in the image style). In this paper, taking the advantage of multiple source domains, we propose a novel approach to express and formalize robustness to these kind of real-world image perturbations. The three key ideas underlying our formulation are (1) leveraging disentangled representations of the images to define different factors of variations, (2) generating perturbed images by changing such factors composing the representations of the images, (3) enforcing the learner (classifier) to be invariant to such changes in the images. We use image-to-image translation models to demonstrate the efficacy of this approach. Based on this, we propose a domain-invariant regularization (DIR) loss function that enforces invariant prediction of targets (class labels) across domains which yields improved generalization performance. We demonstrate the effectiveness of our approach on several widely used datasets for the domain generalization problem, on all of which our results are competitive with the state-of-the-art.

引用

页码：5751 / 5763

页数：13

共 50 条

[31] Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement
Liu, Dongnan
Zhang, Chaoyi
Song, Yang
Huang, Heng
Wang, Chenyu
Barnett, Michael
Cai, Weidong
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1333 - 1344
[32] Feature Stylization and Domain-aware Contrastive Learning for Domain Generalization
Jeon, Seogkyu
Hong, Kibeom
Lee, Pilhyeon
Lee, Jewook
Byun, Hyeran
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 22 - 31
[33] DISENTANGLEMENT AND GENERALIZATION UNDER CORRELATION SHIFTS
Funke, Christina M.
Vicol, Paul
Wang, Kuan-Chieh
Kuemmerer, Matthias
Zemel, Richard
Bethge, Matthias
CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
[34] GMFAD: Towards Generalized Visual Recognition via Multilayer Feature Alignment and Disentanglement
Li, Haoliang
Wang, Shiqi
Wan, Renjie
Kot, Alex C.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1289 - 1303
[35] Universal Heterogeneous Face Analysis via Multi-Domain Feature Disentanglement
Liu, Decheng
Gao, Xinbo
Peng, Chunlei
Wang, Nannan
Li, Jie
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 735 - 747
[36] TACIT: A Target -Agnostic Feature Disentanglement Framework for Cross -Domain Text Classification
Song, Rui
Giunchiglia, Fausto
Li, Yingji
Tian, Mingjie
Xu, Hao
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18999 - 19007
[37] Feature-Based Style Randomization for Domain Generalization
Wang, Yue
Qi, Lei
Shi, Yinghuan
Gao, Yang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) : 5495 - 5509
[38] Structure-aware feature stylization for domain generalization
Cheraghalikhani, Milad
Noori, Mehrdad
Osowiechi, David
Hakim, Gustavo A. Vargas
Ben Ayed, Ismail
Desrosiers, Christian
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 244
[39] Domain Generalization Via Encoding and Resampling in a Unified Latent Space
Liu, Yajing
Xiong, Zhiwei
Li, Ya
Tian, Xinmei
Zha, Zheng-Jun
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 126 - 139
[40] Deep Causal Disentanglement Network With Domain Generalization for Cross-Machine Bearing Fault Diagnosis
Guo, Chaochao
Sun, Youchao
Yu, Rourou
Ren, Xinxin
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74

← 1 2 3 4 5 →