Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation

被引:2
|
作者
Bevandic, Petra [1 ]
Orsic, Marin [1 ]
Saric, Josip [1 ]
Grubisic, Ivan [1 ]
Segvic, Sinisa [1 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Unska 3, Zagreb 10000, Croatia
关键词
Semantic segmentation; Multi-domain training; Universal taxonomy;
D O I
10.1007/s11263-024-01986-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep supervised models have an unprecedented capacity to absorb large quantities of training data. Hence, training on multiple datasets becomes a method of choice towards strong generalization in usual scenes and graceful performance degradation in edge cases. Unfortunately, popular datasets often have discrepant granularities. For instance, the Cityscapes road class subsumes all driving surfaces, while Vistas defines separate classes for road markings, manholes etc. Furthermore, many datasets have overlapping labels. For instance, pickups are labeled as trucks in VIPER, cars in Vistas, and vans in ADE20k. We address this challenge by considering labels as unions of universal visual concepts. This allows seamless and principled learning on multi-domain dataset collections without requiring any relabeling effort. Our method improves within-dataset and cross-dataset generalization, and provides opportunity to learn visual concepts which are not separately labeled in any of the training datasets. Experiments reveal competitive or state-of-the-art performance on two multi-domain dataset collections and on the WildDash 2 benchmark.
引用
收藏
页码:2450 / 2472
页数:23
相关论文
共 50 条
  • [1] Multi-Domain Incremental Learning for Semantic Segmentation
    Garg, Prachi
    Saluja, Rohit
    Balasubramanian, Vineeth N.
    Arora, Chetan
    Subramanian, Anbumani
    Jawahar, C., V
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2080 - 2090
  • [2] Multi-domain semantic segmentation with overlapping labels
    Bevandic, Petra
    Orsic, Marin
    Grubisic, Ivan
    Saric, Josip
    Segvic, Sinisa
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2422 - 2431
  • [3] Learning Visual Words for Weakly-Supervised Semantic Segmentation
    Ru, Lixiang
    Du, Bo
    Wu, Chen
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 982 - 988
  • [4] Multi-Granular Semantic Mining for Weakly Supervised Semantic Segmentation
    Zhang, Meijie
    Li, Jianwu
    Zhou, Tianfei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6019 - 6028
  • [5] Semi-supervised single- and multi-domain regression with multi-domain training
    Michaeli, Tomer
    Eldar, Yonina C.
    Sapiro, Guillermo
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2012, 1 (01) : 68 - 97
  • [6] WEAKLY SUPERVISED USER INTENT DETECTION FOR MULTI-DOMAIN DIALOGUES
    Sun, Ming
    Pappu, Aasish
    Chen, Yun-Nung
    Rudnicky, Alexander I.
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 91 - 97
  • [7] MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
    Lambert, John
    Liu, Zhuang
    Sener, Ozan
    Hays, James
    Koltun, Vladlen
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2876 - 2885
  • [8] MSeg: A Composite Dataset for Multi-Domain Semantic Segmentation
    Lambert, John
    Liu, Zhuang
    Sener, Ozan
    Hays, James
    Koltun, Vladlen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 796 - 810
  • [9] An Empirical Study on Multi-domain Robust Semantic Segmentation
    Liu, Yajie
    Ge, Pu
    Liu, Qingjie
    Fan, Shichao
    Wang, Yunhong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4289 - 4304
  • [10] Weakly Supervised Semantic Segmentation with a Multi-Image Model
    Vezhnevets, Alexander
    Ferrari, Vittorio
    Buhmann, Joachim M.
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 643 - 650