Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing

被引：0

作者：

Cho, Kyusik ^{[1
]}

Lee, Suhyeon ^{[1
]}

Seong, Hongje ^{[1
]}

Kim, Euntai ^{[1
]}

机构：

[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea

来源：

2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) | 2023年

关键词：

D O I：

10.1109/WACV56688.2023.00056

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The network trained for domain adaptation is prone to bias toward the easy-to-transfer classes. Since the ground truth label on the target domain is unavailable during training, the bias problem leads to skewed predictions, forgetting to predict hard-to-transfer classes. To address this problem, we propose Cross-domain Moving Object Mixing (CMOM) that cuts several objects, including hard-to-transfer classes, in the source domain video clip and pastes them into the target domain video clip. Unlike image-level domain adaptation, the temporal context should be maintained to mix moving objects in two different videos. Therefore, we design CMOM to mix with consecutive video frames, so that unrealistic movements are not occurring. We additionally propose Feature Alignment with Temporal Context (FATC) to enhance target domain feature discriminability. FATC exploits the robust source domain features, which are trained with ground truth labels, to learn discriminative target domain features in an unsupervised manner by filtering unreliable predictions with temporal consensus. We demonstrate the effectiveness of the proposed approaches through extensive experiments. In particular, our model reaches mIoU of 53.81% on VIPER. Cityscapes-Seq benchmark and mIoU of 56.31% on SYNTHIA-Seq. Cityscapes-Seq benchmark, surpassing the state-of-the-art methods by large margins.

引用

下载

页码：489 / 498

页数：10

共 50 条

[1] Cross-Domain Transformer with Adaptive Thresholding for Domain Adaptive Semantic Segmentation
Liu, Quansheng
Wang, Lei
Jun, Yu
Gao, Fang
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 147 - 159
[2] Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation
Kim, Minsu
Joung, Sunghun
Kim, Seungryong
Park, Jungin
Kim, Ig-Jae
Sohn, Kwanghoon
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1799 - 1807
[3] TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation
Gong, Rui
Danelljan, Martin
Dai, Dengxin
Paudel, Danda Pani
Chhatkuli, Ajad
Yu, Fisher
Van Gool, Luc
COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 19 - 35
[4] CDAC: Cross-domain Attention Consistency in Transformer for Domain Adaptive Semantic Segmentation
Wang, Kaihong
Kim, Donghyun
Feris, Rogerio
Betke, Margrit
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11485 - 11495
[5] Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer
Lv, Fengmao
Liang, Tao
Chen, Xiang
Lin, Guosheng
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4333 - 4342
[6] MULTISCALE DOMAIN ADAPTIVE YOLO FOR CROSS-DOMAIN OBJECT DETECTION
Hnewa, Mazin
Radha, Hayder
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3323 - 3327
[7] Cross-Domain Adaptive Teacher for Object Detection
Li, Yu-Jhe
Dai, Xiaoliang
Ma, Chih-Yao
Liu, Yen-Cheng
Chen, Kan
Wu, Bichen
He, Zijian
Kitani, Kris
Vajda, Peter
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7571 - 7580
[8] Crots: Cross-Domain Teacher–Student Learning for Source-Free Domain Adaptive Semantic Segmentation
Xin Luo
Wei Chen
Zhengfa Liang
Longqi Yang
Siwei Wang
Chen Li
International Journal of Computer Vision, 2024, 132 : 20 - 39
[9] Cross-Domain Few-Shot Semantic Segmentation
Lei, Shuo
Zhang, Xuchao
He, Jianfeng
Chen, Fanglan
Du, Bowen
Lu, Chang-Tien
COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 73 - 90
[10] A global reweighting approach for cross-domain semantic segmentation
Zhang, Yuhang
Tian, Shishun
Liao, Muxin
Hua, Guoguang
Zou, Wenbin
Xu, Chen
Signal Processing: Image Communication, 2025, 130

← 1 2 3 4 5 →