Self-Ensembling GAN for Cross-Domain Semantic Segmentation

被引:3
|
作者
Xu, Yonghao [1 ,2 ]
He, Fengxiang [3 ]
Du, Bo [4 ,5 ]
Tao, Dacheng [3 ]
Zhang, Liangpei [1 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[2] Inst Adv Res Artificial Intelligence IARAI, A-1030 Vienna, Austria
[3] JDcom Inc, JD Explore Acad, Beijing, Peoples R China
[4] Wuhan Univ, Sch Comp Sci, Natl Engn Res Ctr Multimedia Software, Inst Artificial Intelligence, Wuhan 430079, Peoples R China
[5] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan 430079, Peoples R China
关键词
Deep learning; domain adaptation; semantic segmentation; adversarial learning; DEEP NEURAL-NETWORK;
D O I
10.1109/TMM.2022.3229976
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNNs) have greatly contributed to the performance gains in semantic segmentation. Nevertheless, training DNNs generally requires large amounts of pixel-level labeled data, which is expensive and time-consuming to collect in practice. To mitigate the annotation burden, this paper proposes a self-ensembling generative adversarial network (SE-GAN) exploiting cross-domain data for semantic segmentation. In SE-GAN, a teacher network and a student network constitute a self-ensembling model for generating semantic segmentation maps, which together with a discriminator, forms a GAN. Despite its simplicity, we find SE-GAN can significantly boost the performance of adversarial training and enhance the stability of the model, the latter of which is a common barrier shared by most adversarial training-based methods. We theoretically analyze SE-GAN and provide an O(1/root N) generalization bound (N is the training sample size), which suggests controlling the discriminator's hypothesis complexity to enhance the generalizability. Accordingly, we choose a simple network as the discriminator. Extensive and systematic experiments in two standard settings demonstrate that the proposed method significantly outperforms current state-of-the-art approaches.
引用
收藏
页码:7837 / 7850
页数:14
相关论文
共 50 条
  • [21] CDAC: Cross-domain Attention Consistency in Transformer for Domain Adaptive Semantic Segmentation
    Wang, Kaihong
    Kim, Donghyun
    Feris, Rogerio
    Betke, Margrit
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11485 - 11495
  • [22] Ensembling Transformers for Cross-domain Automatic Term Extraction
    Hanh Thi Hong Tran
    Martinc, Matej
    Pelicon, Andraz
    Doucet, Antoine
    Pollak, Senja
    FROM BORN-PHYSICAL TO BORN-VIRTUAL: AUGMENTING INTELLIGENCE IN DIGITAL LIBRARIES, ICADL 2022, 2022, 13636 : 90 - 100
  • [23] Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation
    Chen, Yiyang
    Zhao, Shanshan
    Ding, Changxing
    Tang, Liyao
    Wang, Chaoyue
    Tao, Dacheng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3866 - 3875
  • [24] Transformation-Consistent Self-Ensembling Model for Semisupervised Medical Image Segmentation
    Li, Xiaomeng
    Yu, Lequan
    Chen, Hao
    Fu, Chi-Wing
    Xing, Lei
    Heng, Pheng-Ann
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 523 - 534
  • [25] Semantic-aware short path adversarial training for cross-domain semantic segmentation
    Shan, Yuhu
    Chew, Chee Meng
    Lu, Wen Feng
    NEUROCOMPUTING, 2020, 380 : 125 - 132
  • [26] Uncertainty-aware consistency regularization for cross-domain semantic segmentation
    Zhou, Qianyu
    Feng, Zhengyang
    Gu, Qiqi
    Cheng, Guangliang
    Lu, Xuequan
    Shi, Jianping
    Ma, Lizhuang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 221
  • [27] A Cross-Domain Coupling Network for Semantic Segmentation of Remote Sensing Images
    Li, Xin
    Xu, Feng
    Tao, Feifei
    Tong, Yao
    Gao, Hongmin
    Liu, Fan
    Chen, Ziqi
    Lyu, Xin
    IEEE Geoscience and Remote Sensing Letters, 2024, 21
  • [28] Uncertainty-aware consistency regularization for cross-domain semantic segmentation
    Zhou, Qianyu
    Feng, Zhengyang
    Gu, Qiqi
    Cheng, Guangliang
    Lu, Xuequan
    Shi, Jianping
    Ma, Lizhuang
    Computer Vision and Image Understanding, 2022, 221
  • [29] Confidence-and-Refinement Adaptation Model for Cross-Domain Semantic Segmentation
    Zhang, Xiaohong
    Chen, Yi
    Shen, Ziyi
    Shen, Yuming
    Zhang, Haofeng
    Zhang, Yudong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9529 - 9542
  • [30] Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing
    Cho, Kyusik
    Lee, Suhyeon
    Seong, Hongje
    Kim, Euntai
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 489 - 498