Multi-modal unsupervised domain adaptation for semantic image segmentation

被引：9

作者：

Hu, Sijie ^{[1
]}

Bonardi, Fabien ^{[1
]}

Bouchafa, Samia ^{[1
]}

Sidibe, Desire ^{[1
]}

机构：

[1] Univ Paris Saclay, Univ Evry, IBISC, F-91020 Evry Courcouronnes, France

来源：

PATTERN RECOGNITION | 2023年 / 137卷

关键词：

Unsupervised domain adaptation; Multi -modal learning; Self -supervised learning; Knowledge transfer; Semantic segmentation;

D O I：

10.1016/j.patcog.2022.109299

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel multi-modal-based Unsupervised Domain Adaptation (UDA) method for semantic segmentation. Recently, depth has proven to be a relevent property for providing geometric cues to en-hance the RGB representation. However, existing UDA methods solely process RGB images or additionally cultivate depth-awareness with an auxiliary depth estimation task. We argue that geometric cues that are crucial to semantic segmentation, such as local shape and relative position, are challenging to recover from an auxiliary depth estimation task with mere color (RGB) information. In this paper, we propose a novel multi-modal UDA method named MMADT, which relies on both RGB and depth images as input. In particular, we design a Depth Fusion Block (DFB) to recalibrate depth information and leverage Depth Ad-versarial Training (DAT) to bridge the depth discrepancy between the source and target domain. Besides, we propose a self-supervised multi-modal depth estimation assistant network named Geo-Assistant (GA) to align the feature space of RGB and depth and shape the sensitivity of our MMADT to depth infor-mation. We experimentally observed significant performance improvement in multiple synthetic to real adaptation benchmarks, i.e., SYNTHIA-to-Cityscapes, GTA5-to-Cityscapes and SELMA-to-Cityscapes. Addi-tionally, our multi-modal UDA scheme is easy to port to other UDA methods with a consistent perfor-mance boost. (c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：12

共 50 条

[1] Adversarial unsupervised domain adaptation for 3D semantic segmentation with multi-modal learning
Liu, Wei
Luo, Zhiming
Cai, Yuanzheng
Yu, Ying
Ke, Yang
Marcato Junior, Jose
Goncalves, Wesley Nunes
Li, Jonathan
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 : 211 - 221
[2] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
Wang, Zeyu
Bu, Shuhui
Huang, Wei
Zheng, Yuanpan
Wu, Qinggang
Chang, Huawen
Zhang, Xu
[J]. Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171
[3] Multi-modal semantic image segmentation
Pemasiri, Akila
Kien Nguyen
Sridharan, Sridha
Fookes, Clinton
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 202
[4] Boosting Multi-Modal Unsupervised Domain Adaptation for LiDAR Semantic Segmentation by Self-Supervised Depth Completion
Cardace, Adriano
Conti, Andrea
Ramirez, Pierluigi Zama
Spezialetti, Riccardo
Salti, Samuele
Stefano, Luigi Di
[J]. IEEE ACCESS, 2023, 11 : 85155 - 85164
[5] An Unsupervised Domain Adaptation Method for Multi-Modal Remote Sensing Image Classification
Liu, Wei
Qin, Rongjun
Su, Fulin
Hu, Kun
[J]. 2018 26TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS 2018), 2018,
[6] Adversarial Cross-modal Domain Adaptation for Multi-modal Semantic Segmentation in Autonomous Driving
Shi, Mengqi
Cao, Haozhi
Xie, Lihua
Yang, Jianfei
[J]. 2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 850 - 855
[7] A multi-grained unsupervised domain adaptation approach for semantic segmentation
Li, Luyang
Ma, Tai
Lu, Yue
Li, Qingli
He, Lianghua
Wen, Ying
[J]. PATTERN RECOGNITION, 2023, 144
[8] Unsupervised Domain Adaptation in Semantic Segmentation: A Review
Toldo, Marco
Maracani, Andrea
Michieli, Umberto
Zanuttigh, Pietro
[J]. TECHNOLOGIES, 2020, 8 (02)
[9] Multichannel Semantic Segmentation with Unsupervised Domain Adaptation
Watanabe, Kohei
Saito, Kuniaki
Ushiku, Yoshitaka
Harada, Tatsuya
[J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT V, 2019, 11133 : 600 - 616
[10] Geometric Unsupervised Domain Adaptation for Semantic Segmentation
Guizilini, Vitor
Li, Jie
Ambrus, Rares
Gaidon, Adrien
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8517 - 8527

← 1 2 3 4 5 →