MSeg: A Composite Dataset for Multi-Domain Semantic Segmentation

被引:5
|
作者
Lambert, John [1 ]
Liu, Zhuang [2 ]
Sener, Ozan [3 ]
Hays, James [1 ]
Koltun, Vladlen [3 ]
机构
[1] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30332 USA
[2] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA
[3] Intel Labs, Santa Clara, CA 95054 USA
关键词
Training; Semantics; Computational modeling; Annotations; Taxonomy; Image segmentation; Benchmark testing; Robust vision; semantic segmentation; instance segmentation; panoptic segmentation; domain generalization;
D O I
10.1109/TPAMI.2022.3151200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present MSeg, a composite dataset that unifies semantic segmentation datasets from different domains. A naive merge of the constituent datasets yields poor performance due to inconsistent taxonomies and annotation practices. We reconcile the taxonomies and bring the pixel-level annotations into alignment by relabeling more than 220,000 object masks in more than 80,000 images, requiring more than 1.34 years of collective annotator effort. The resulting composite dataset enables training a single semantic segmentation model that functions effectively across domains and generalizes to datasets that were not seen during training. We adopt zero-shot cross-dataset transfer as a benchmark to systematically evaluate a model's robustness and show that MSeg training yields substantially more robust models in comparison to training on individual datasets or naive mixing of datasets without the presented contributions. A model trained on MSeg ranks first on the WildDash-v1 leaderboard for robust semantic segmentation, with no exposure to WildDash data during training. We evaluate our models in the 2020 Robust Vision Challenge (RVC) as an extreme generalization experiment. MSeg training sets include only three of the seven datasets in the RVC; more importantly, the evaluation taxonomy of RVC is different and more detailed. Surprisingly, our model shows competitive performance and ranks second. To evaluate how close we are to the grand aim of robust, efficient, and complete scene understanding, we go beyond semantic segmentation by training instance segmentation and panoptic segmentation models using our dataset. Moreover, we also evaluate various engineering design decisions and metrics, including resolution and computational efficiency. Although our models are far from this grand aim, our comprehensive evaluation is crucial for progress. We share all the models and code with the community.
引用
收藏
页码:796 / 810
页数:15
相关论文
共 50 条
  • [1] MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
    Lambert, John
    Liu, Zhuang
    Sener, Ozan
    Hays, James
    Koltun, Vladlen
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2876 - 2885
  • [2] Multi-Domain Incremental Learning for Semantic Segmentation
    Garg, Prachi
    Saluja, Rohit
    Balasubramanian, Vineeth N.
    Arora, Chetan
    Subramanian, Anbumani
    Jawahar, C., V
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2080 - 2090
  • [3] Multi-domain semantic segmentation with overlapping labels
    Bevandic, Petra
    Orsic, Marin
    Grubisic, Ivan
    Saric, Josip
    Segvic, Sinisa
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2422 - 2431
  • [4] An Empirical Study on Multi-domain Robust Semantic Segmentation
    Liu, Yajie
    Ge, Pu
    Liu, Qingjie
    Fan, Shichao
    Wang, Yunhong
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4289 - 4304
  • [5] Semantic Segmentation via Multi-task, Multi-domain Learning
    Fourure, Damien
    Emonet, Remi
    Fromont, Elisa
    Muselet, Damien
    Tremeau, Alain
    Wolf, Christian
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 333 - 343
  • [6] Scaling Up Multi-domain Semantic Segmentation with Sentence Embeddings
    Yin, Wei
    Liu, Yifan
    Shen, Chunhua
    Sun, Baichuan
    van den Hengel, Anton
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4036 - 4051
  • [7] Multi-Domain Semantic-Segmentation using Multi-Head Model
    Masaki, Shota
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2802 - 2808
  • [8] Multi-task, multi-domain learning: Application to semantic segmentation and pose regression
    Fourure, Damien
    Emonet, Remi
    Fromont, Elisa
    Muselet, Damien
    Neverova, Natalia
    Tremeau, Alain
    Wolf, Christian
    NEUROCOMPUTING, 2017, 251 : 68 - 80
  • [9] Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation
    Bevandic, Petra
    Orsic, Marin
    Saric, Josip
    Grubisic, Ivan
    Segvic, Sinisa
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2450 - 2472
  • [10] What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation
    Blumenstiel, Benedikt
    Jakubik, Johannes
    Kuehne, Hilde
    Voessing, Michael
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,