Multi-modal unsupervised domain adaptation for semantic image segmentation

被引:9
|
作者
Hu, Sijie [1 ]
Bonardi, Fabien [1 ]
Bouchafa, Samia [1 ]
Sidibe, Desire [1 ]
机构
[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France
关键词
Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;
D O I
10.1016/j.patcog.2022.109299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Consistency Regularization for Unsupervised Domain Adaptation in Semantic Segmentation
    Scherer, Sebastian
    Brehm, Stephan
    Lienhart, Rainer
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 : 500 - 511
  • [22] Unsupervised Domain Adaptation for Semantic Segmentation of Urban Scenes
    Biasetton, Matteo
    Michieli, Umberto
    Agresti, Gianluca
    Zanuttigh, Pietro
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1211 - 1220
  • [23] Simplified unsupervised image translation for semantic segmentation adaptation
    Li, Rui
    Cao, Wenming
    Jiao, Qianfen
    Wu, Si
    Wong, Hau-San
    [J]. PATTERN RECOGNITION, 2020, 105
  • [24] CMT: Cross Mean Teacher Unsupervised Domain Adaptation for VHR Image Semantic Segmentation
    Yan, Liang
    Fan, Bin
    Xiang, Shiming
    Pan, Chunhong
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [25] Semantic Consistent Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation
    Zeng, Guodong
    Lerch, Till D.
    Schmaranzer, Florian
    Zheng, Guoyan
    Burger, Juergen
    Gerber, Kate
    Tannast, Moritz
    Siebenrock, Klaus
    Gerber, Nicolas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 201 - 210
  • [26] Differentiated Learning for Multi-Modal Domain Adaptation
    Lv, Jianming
    Liu, Kaijie
    He, Shengfeng
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1322 - 1330
  • [27] MULTI-MODAL SEMANTIC MESH SEGMENTATION IN URBAN SCENES
    Laupheimer, Dominik
    Haala, Norbert
    [J]. XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 5-2 : 267 - 274
  • [28] A Multi-task Unsupervised Domain Adaptation Network for Medical Image Segmentation
    Shi, Yuejing
    Zhu, Fan
    Peng, Yan
    Ye, Zhen
    Zhou, Chaozheng
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND INTELLIGENT CONTROL (IPIC 2021), 2021, 11928
  • [29] Multi-modal brain tumor segmentation via conditional synthesis with Fourier domain adaptation
    Al Khalil, Yasmina
    Ayaz, Aymen
    Lorenz, Cristian
    Weese, Juergen
    Pluim, Josien
    Breeuwer, Marcel
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2024, 112
  • [30] Unpaired multi-modal tumor segmentation with structure adaptation
    Zhou, Pei
    Chen, Houjin
    Li, Yanfeng
    Peng, Yahui
    [J]. APPLIED INTELLIGENCE, 2023, 53 (04) : 3639 - 3651