Adversarial unsupervised domain adaptation for 3D semantic segmentation with multi-modal learning

被引:25
|
作者
Liu, Wei [1 ]
Luo, Zhiming [2 ]
Cai, Yuanzheng [3 ]
Yu, Ying [1 ]
Ke, Yang [4 ]
Marcato Junior, Jose [5 ]
Goncalves, Wesley Nunes [5 ]
Li, Jonathan [4 ]
机构
[1] East China Jiaotong Univ, Sch Software, Nanchang 330013, Jiangxi, Peoples R China
[2] Xiamen Univ, Artificial Intelligence Dept, Xiamen 361005, Peoples R China
[3] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350121, Peoples R China
[4] Univ Waterloo, Waterloo, ON N2L 3G1, Canada
[5] Univ Fed Mato Grosso do Sul, BR-79070900 Campo Grande, MS, Brazil
基金
中国国家自然科学基金;
关键词
Semantic segmentation; Point cloud; Domain adaptation; Adversarial learning; Multi-modal learning; CLASSIFICATION; INFORMATION; AERIAL;
D O I
10.1016/j.isprsjprs.2021.04.012
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Semantic segmentation in 3D point-clouds plays an essential role in various applications, such as autonomous driving, robot control, and mapping. In general, a segmentation model trained on one source domain suffers a severe decline in performance when applied to a different target domain due to the cross-domain discrepancy. Various Unsupervised Domain Adaptation (UDA) approaches have been proposed to tackle this issue. However, most are only for uni-modal data and do not explore how to learn from the multi-modality data containing 2D images and 3D point clouds. We propose an Adversarial Unsupervised Domain Adaptation (AUDA) based 3D semantic segmentation framework for achieving this goal. The proposed AUDA can leverage the complementary information between 2D images and 3D point clouds by cross-modal learning and adversarial learning. On the other hand, there is a highly imbalanced data distribution in real scenarios. We further develop a simple and effective threshold-moving technique during the final inference stage to mitigate this issue. Finally, we conduct experiments on three unsupervised domain adaptation scenarios, ie., Country-to-Country (USA.Singapore), Day-to-Night, and Dataset-to-Dataset (A2D2 -> SemanticKITTI). The experimental results demonstrate the effectiveness of proposed method that can significantly improve segmentation performance for rare classes. Code and trained models are available at https://github.com/weiliu-ai/auda.
引用
收藏
页码:211 / 221
页数:11
相关论文
共 50 条
  • [1] Multi-modal unsupervised domain adaptation for semantic image segmentation
    Hu, Sijie
    Bonardi, Fabien
    Bouchafa, Samia
    Sidibe, Desire
    [J]. PATTERN RECOGNITION, 2023, 137
  • [2] Unsupervised domain adaptation multi-level adversarial network for semantic segmentation based on multi-modal features
    Wang, Zeyu
    Bu, Shuhui
    Huang, Wei
    Zheng, Yuanpan
    Wu, Qinggang
    Chang, Huawen
    Zhang, Xu
    [J]. Tongxin Xuebao/Journal on Communications, 2022, 43 (12): : 157 - 171
  • [3] Adversarial Cross-modal Domain Adaptation for Multi-modal Semantic Segmentation in Autonomous Driving
    Shi, Mengqi
    Cao, Haozhi
    Xie, Lihua
    Yang, Jianfei
    [J]. 2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 850 - 855
  • [4] Cross-Modal Learning for Domain Adaptation in 3D Semantic Segmentation
    Jaritz, Maximilian
    Tuan-Hung Vu
    de Charette, Raoul
    Wirbel, Emilie
    Perez, Patrick
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 1533 - 1544
  • [5] Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation
    Jaritz, Maximilian
    Vu, Tuan-Hung
    de Charette, Raoul
    Wirbel, Émilie
    Pérez, Patrick
    [J]. arXiv, 2021,
  • [6] Multi-Modal Continual Test-Time Adaptation for 3D Semantic Segmentation
    Cao, Haozhi
    Xu, Yuecong
    Yang, Jianfei
    Yin, Pengyu
    Yuan, Shenghai
    Xie, Lihua
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18763 - 18773
  • [7] Robust 3D Semantic Segmentation Method Based on Multi-Modal Collaborative Learning
    Ni, Peizhou
    Li, Xu
    Xu, Wang
    Zhou, Xiaojing
    Jiang, Tao
    Hu, Weiming
    [J]. REMOTE SENSING, 2024, 16 (03)
  • [8] Unsupervised Adversarial Domain Adaptation Network for Semantic Segmentation
    Liu, Wei
    Su, Fulin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (11) : 1978 - 1982
  • [9] Adversarial Unsupervised Domain Adaptation for 3D Semantic Segmentation with 2D Image Fusion of Dense Depth
    Zhang, Xindan
    Li, Ying
    Sheng, Huankun
    Zhang, Xinnian
    [J]. Computer Graphics Forum, 2024, 43 (07)
  • [10] Boosting Multi-Modal Unsupervised Domain Adaptation for LiDAR Semantic Segmentation by Self-Supervised Depth Completion
    Cardace, Adriano
    Conti, Andrea
    Ramirez, Pierluigi Zama
    Spezialetti, Riccardo
    Salti, Samuele
    Stefano, Luigi Di
    [J]. IEEE ACCESS, 2023, 11 : 85155 - 85164