Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation

被引：2

作者：

Bevandic, Petra ^{[1
]}

Orsic, Marin ^{[1
]}

Saric, Josip ^{[1
]}

Grubisic, Ivan ^{[1
]}

Segvic, Sinisa ^{[1
]}

机构：

[1] Univ Zagreb, Fac Elect Engn & Comp, Unska 3, Zagreb 10000, Croatia

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2024年 / 132卷 / 07期

关键词：

Semantic segmentation; Multi-domain training; Universal taxonomy;

D O I：

10.1007/s11263-024-01986-z

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep supervised models have an unprecedented capacity to absorb large quantities of training data. Hence, training on multiple datasets becomes a method of choice towards strong generalization in usual scenes and graceful performance degradation in edge cases. Unfortunately, popular datasets often have discrepant granularities. For instance, the Cityscapes road class subsumes all driving surfaces, while Vistas defines separate classes for road markings, manholes etc. Furthermore, many datasets have overlapping labels. For instance, pickups are labeled as trucks in VIPER, cars in Vistas, and vans in ADE20k. We address this challenge by considering labels as unions of universal visual concepts. This allows seamless and principled learning on multi-domain dataset collections without requiring any relabeling effort. Our method improves within-dataset and cross-dataset generalization, and provides opportunity to learn visual concepts which are not separately labeled in any of the training datasets. Experiments reveal competitive or state-of-the-art performance on two multi-domain dataset collections and on the WildDash 2 benchmark.

引用

页码：2450 / 2472

页数：23

共 50 条

[1] Multi-Domain Incremental Learning for Semantic Segmentation
Garg, Prachi
Saluja, Rohit
Balasubramanian, Vineeth N.
Arora, Chetan
Subramanian, Anbumani
Jawahar, C., V
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2080 - 2090
[2] Multi-domain semantic segmentation with overlapping labels
Bevandic, Petra
Orsic, Marin
Grubisic, Ivan
Saric, Josip
Segvic, Sinisa
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2422 - 2431
[3] Learning Visual Words for Weakly-Supervised Semantic Segmentation
Ru, Lixiang
Du, Bo
Wu, Chen
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 982 - 988
[4] Multi-Granular Semantic Mining for Weakly Supervised Semantic Segmentation
Zhang, Meijie
Li, Jianwu
Zhou, Tianfei
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6019 - 6028
[5] Semi-supervised single- and multi-domain regression with multi-domain training
Michaeli, Tomer
Eldar, Yonina C.
Sapiro, Guillermo
INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2012, 1 (01) : 68 - 97
[6] WEAKLY SUPERVISED USER INTENT DETECTION FOR MULTI-DOMAIN DIALOGUES
Sun, Ming
Pappu, Aasish
Chen, Yun-Nung
Rudnicky, Alexander I.
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 91 - 97
[7] MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
Lambert, John
Liu, Zhuang
Sener, Ozan
Hays, James
Koltun, Vladlen
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2876 - 2885
[8] MSeg: A Composite Dataset for Multi-Domain Semantic Segmentation
Lambert, John
Liu, Zhuang
Sener, Ozan
Hays, James
Koltun, Vladlen
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 796 - 810
[9] An Empirical Study on Multi-domain Robust Semantic Segmentation
Liu, Yajie
Ge, Pu
Liu, Qingjie
Fan, Shichao
Wang, Yunhong
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4289 - 4304
[10] Weakly Supervised Semantic Segmentation with a Multi-Image Model
Vezhnevets, Alexander
Ferrari, Vittorio
Buhmann, Joachim M.
2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 643 - 650

← 1 2 3 4 5 →