Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing

被引:0
|
作者
Cho, Kyusik [1 ]
Lee, Suhyeon [1 ]
Seong, Hongje [1 ]
Kim, Euntai [1 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea
关键词
D O I
10.1109/WACV56688.2023.00056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The network trained for domain adaptation is prone to bias toward the easy-to-transfer classes. Since the ground truth label on the target domain is unavailable during training, the bias problem leads to skewed predictions, forgetting to predict hard-to-transfer classes. To address this problem, we propose Cross-domain Moving Object Mixing (CMOM) that cuts several objects, including hard-to-transfer classes, in the source domain video clip and pastes them into the target domain video clip. Unlike image-level domain adaptation, the temporal context should be maintained to mix moving objects in two different videos. Therefore, we design CMOM to mix with consecutive video frames, so that unrealistic movements are not occurring. We additionally propose Feature Alignment with Temporal Context (FATC) to enhance target domain feature discriminability. FATC exploits the robust source domain features, which are trained with ground truth labels, to learn discriminative target domain features in an unsupervised manner by filtering unreliable predictions with temporal consensus. We demonstrate the effectiveness of the proposed approaches through extensive experiments. In particular, our model reaches mIoU of 53.81% on VIPER. Cityscapes-Seq benchmark and mIoU of 56.31% on SYNTHIA-Seq. Cityscapes-Seq benchmark, surpassing the state-of-the-art methods by large margins.
引用
下载
收藏
页码:489 / 498
页数:10
相关论文
共 50 条
  • [1] Cross-Domain Transformer with Adaptive Thresholding for Domain Adaptive Semantic Segmentation
    Liu, Quansheng
    Wang, Lei
    Jun, Yu
    Gao, Fang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 147 - 159
  • [2] Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation
    Kim, Minsu
    Joung, Sunghun
    Kim, Seungryong
    Park, Jungin
    Kim, Ig-Jae
    Sohn, Kwanghoon
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1799 - 1807
  • [3] TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation
    Gong, Rui
    Danelljan, Martin
    Dai, Dengxin
    Paudel, Danda Pani
    Chhatkuli, Ajad
    Yu, Fisher
    Van Gool, Luc
    COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 19 - 35
  • [4] CDAC: Cross-domain Attention Consistency in Transformer for Domain Adaptive Semantic Segmentation
    Wang, Kaihong
    Kim, Donghyun
    Feris, Rogerio
    Betke, Margrit
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11485 - 11495
  • [5] Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer
    Lv, Fengmao
    Liang, Tao
    Chen, Xiang
    Lin, Guosheng
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4333 - 4342
  • [6] MULTISCALE DOMAIN ADAPTIVE YOLO FOR CROSS-DOMAIN OBJECT DETECTION
    Hnewa, Mazin
    Radha, Hayder
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3323 - 3327
  • [7] Cross-Domain Adaptive Teacher for Object Detection
    Li, Yu-Jhe
    Dai, Xiaoliang
    Ma, Chih-Yao
    Liu, Yen-Cheng
    Chen, Kan
    Wu, Bichen
    He, Zijian
    Kitani, Kris
    Vajda, Peter
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7571 - 7580
  • [8] Crots: Cross-Domain Teacher–Student Learning for Source-Free Domain Adaptive Semantic Segmentation
    Xin Luo
    Wei Chen
    Zhengfa Liang
    Longqi Yang
    Siwei Wang
    Chen Li
    International Journal of Computer Vision, 2024, 132 : 20 - 39
  • [9] Cross-Domain Few-Shot Semantic Segmentation
    Lei, Shuo
    Zhang, Xuchao
    He, Jianfeng
    Chen, Fanglan
    Du, Bowen
    Lu, Chang-Tien
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 73 - 90
  • [10] A global reweighting approach for cross-domain semantic segmentation
    Zhang, Yuhang
    Tian, Shishun
    Liao, Muxin
    Hua, Guoguang
    Zou, Wenbin
    Xu, Chen
    Signal Processing: Image Communication, 2025, 130