Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing

被引:0
|
作者
Cho, Kyusik [1 ]
Lee, Suhyeon [1 ]
Seong, Hongje [1 ]
Kim, Euntai [1 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea
关键词
D O I
10.1109/WACV56688.2023.00056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The network trained for domain adaptation is prone to bias toward the easy-to-transfer classes. Since the ground truth label on the target domain is unavailable during training, the bias problem leads to skewed predictions, forgetting to predict hard-to-transfer classes. To address this problem, we propose Cross-domain Moving Object Mixing (CMOM) that cuts several objects, including hard-to-transfer classes, in the source domain video clip and pastes them into the target domain video clip. Unlike image-level domain adaptation, the temporal context should be maintained to mix moving objects in two different videos. Therefore, we design CMOM to mix with consecutive video frames, so that unrealistic movements are not occurring. We additionally propose Feature Alignment with Temporal Context (FATC) to enhance target domain feature discriminability. FATC exploits the robust source domain features, which are trained with ground truth labels, to learn discriminative target domain features in an unsupervised manner by filtering unreliable predictions with temporal consensus. We demonstrate the effectiveness of the proposed approaches through extensive experiments. In particular, our model reaches mIoU of 53.81% on VIPER. Cityscapes-Seq benchmark and mIoU of 56.31% on SYNTHIA-Seq. Cityscapes-Seq benchmark, surpassing the state-of-the-art methods by large margins.
引用
下载
收藏
页码:489 / 498
页数:10
相关论文
共 50 条
  • [41] Compressed Domain Video Object Segmentation
    Porikli, Fatih
    Bashir, Faisal
    Sun, Huifang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (01) : 2 - 14
  • [42] Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation?
    Wu, Xinyi
    Wu, Zhenyao
    Wan, Jin
    Ju, Lili
    Wang, Song
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 357 - 373
  • [43] Cross-Domain Object Detection with Missing Classes in Target Domain
    Qiu, Benliu
    Qiu, Heqian
    Wen, Haitao
    Song, Zichen
    Xu, Linfeng
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [44] Source-Data-Free Cross-Domain Knowledge Transfer for Semantic Segmentation
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 92 - 100
  • [45] Cross-domain few-shot semantic segmentation for the astronaut work environment
    Sun, Qingwei
    Chao, Jiangang
    Lin, Wanhong
    Advances in Space Research, 2024, 74 (11) : 5934 - 5949
  • [46] Depth-Assisted ResiDualGAN for Cross-Domain Aerial Images Semantic Segmentation
    Yang, Zhao
    Guo, Peng
    Gao, Han
    Chen, Xiuwan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [47] Depth-Assisted ResiDualGAN for Cross-Domain Aerial Images Semantic Segmentation
    Yang, Zhao
    Guo, Peng
    Gao, Han
    Chen, Xiuwan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [48] Informative Data Mining for One-shot Cross-Domain Semantic Segmentation
    Wang, Yuxi
    Liang, Jian
    Xiao, Jun
    Mei, Shuqi
    Yang, Yuran
    Zhang, Zhaoxiang
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1064 - 1074
  • [49] Sequential Recommendation via an Adaptive Cross-domain Knowledge Decomposition
    Zhao, Chuang
    Li, Xinyu
    He, Ming
    Zhao, Hongke
    Fan, Jianping
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3453 - 3463
  • [50] Cross-domain Semantic Feature Learning via Adversarial Adaptation Networks
    Li, Rui
    Cao, Wenming
    Qian, Sheng
    Wong, Hau-San
    Wu, Si
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 37 - 42