Augmentation-based Domain Generalization for Semantic Segmentation

被引:1
|
作者
Schwonberg, Manuel [1 ,2 ,3 ]
El Bouazati, Fadoua [2 ,3 ]
Schmidt, Nico M. [1 ]
Gottschalk, Hanno [2 ,3 ]
机构
[1] CARIAD SE, Wolfsburg, Lower Saxony, Germany
[2] Univ Wuppertal, IZMD, Wuppertal, Germany
[3] Univ Wuppertal, Sch Math & Nat Sci, Wuppertal, Germany
关键词
D O I
10.1109/IV55152.2023.10186752
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised Domain Adaptation (UDA) and domain generalization (DG) are two research areas that aim to tackle the lack of generalization of Deep Neural Networks (DNNs) towards unseen domains. While UDA methods have access to unlabeled target images, domain generalization does not involve any target data and only learns generalized features from a source domain. Image-style randomization or augmentation is a popular approach to improve network generalization without access to the target domain. Complex methods are often proposed that disregard the potential of simple image augmentations for out-of-domain generalization. For this reason, we systematically study the in- and out-of-domain generalization capabilities of simple, rule-based image augmentations like blur, noise, color jitter and many more. Based on a full factorial design of experiment design we provide a systematic statistical evaluation of augmentations and their interactions. Our analysis provides both, expected and unexpected, outcomes. Expected, because our experiments confirm the common scientific standard that combination of multiple different augmentations outperforms single augmentations. Unexpected, because combined augmentations perform competitive to state-of-the-art domain generalization approaches, while being significantly simpler and without training overhead. On the challenging synthetic-to-real domain shift between Synthia and Cityscapes we reach 39.5% mIoU compared to 40.9% mIoU of the best previous work. When additionally employing the recent vision transformer architecture DAFormer we outperform these benchmarks with a performance of 44.2% mIoU.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] SINGLE-DOMAIN GENERALIZATION FOR SEMANTIC SEGMENTATION VIA DUAL-LEVEL DOMAIN AUGMENTATION
    Chang, Shu-Jung
    Lu, Chen-Yu
    Huang, Pei-Kai
    Hsu, Chiou-Ting
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 2335 - 2339
  • [2] Domain generalization for semantic segmentation: a survey
    Rafi, Taki Hasan
    Mahjabin, Ratul
    Ghosh, Emon
    Ko, Young-Woong
    Lee, Jeong-Gun
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (09)
  • [3] ASM: Augmentation-based Semantic Mechanism on Abstractive Summarization
    Ren, Weidong
    Zhou, Hao
    Liu, Gongshen
    Huan, Fei
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Semantic Data Augmentation based Distance Metric Learning for Domain Generalization
    Wang, Mengzhu
    Yuan, Jianlong
    Qian, Qi
    Wang, Zhibin
    Li, Hao
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3214 - 3223
  • [5] Mind the Label Shift of Augmentation-based Graph OOD Generalization
    Yu, Junchi
    Liang, Jian
    He, Ran
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11620 - 11630
  • [6] Single Domain Generalization for LiDAR Semantic Segmentation
    Kim, Hyeonseong
    Kang, Yoonsu
    Oh, Changgyoon
    Yoon, Kuk-Jin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17587 - 17598
  • [7] Instant Domain Augmentation for LiDAR Semantic Segmentation
    Ryu, Kwonyoung
    Hwang, Soonmin
    Park, Jaesik
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9350 - 9360
  • [8] Visual representations with texts domain generalization for semantic segmentation
    Yue, Wanlin
    Zhou, Zhiheng
    Cao, Yinglie
    Wu, Weikang
    [J]. APPLIED INTELLIGENCE, 2023, 53 (24) : 30069 - 30079
  • [9] A Study of RobustNet, a Domain Generalization Method for Semantic Segmentation
    Bou, Xavier
    [J]. IMAGE PROCESSING ON LINE, 2022, 12 : 469 - 479
  • [10] Visual representations with texts domain generalization for semantic segmentation
    Wanlin Yue
    Zhiheng Zhou
    Yinglie Cao
    Weikang Wu
    [J]. Applied Intelligence, 2023, 53 : 30069 - 30079