MSeg: A Composite Dataset for Multi-Domain Semantic Segmentation

被引：5

作者：

Lambert, John ^{[1
]}

Liu, Zhuang ^{[2
]}

Sener, Ozan ^{[3
]}

Hays, James ^{[1
]}

Koltun, Vladlen ^{[3
]}

机构：

[1] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30332 USA

[2] Univ Calif Berkeley, Dept EECS, Berkeley, CA 94720 USA

[3] Intel Labs, Santa Clara, CA 95054 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 01期

关键词：

Training; Semantics; Computational modeling; Annotations; Taxonomy; Image segmentation; Benchmark testing; Robust vision; semantic segmentation; instance segmentation; panoptic segmentation; domain generalization;

D O I：

10.1109/TPAMI.2022.3151200

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present MSeg, a composite dataset that unifies semantic segmentation datasets from different domains. A naive merge of the constituent datasets yields poor performance due to inconsistent taxonomies and annotation practices. We reconcile the taxonomies and bring the pixel-level annotations into alignment by relabeling more than 220,000 object masks in more than 80,000 images, requiring more than 1.34 years of collective annotator effort. The resulting composite dataset enables training a single semantic segmentation model that functions effectively across domains and generalizes to datasets that were not seen during training. We adopt zero-shot cross-dataset transfer as a benchmark to systematically evaluate a model's robustness and show that MSeg training yields substantially more robust models in comparison to training on individual datasets or naive mixing of datasets without the presented contributions. A model trained on MSeg ranks first on the WildDash-v1 leaderboard for robust semantic segmentation, with no exposure to WildDash data during training. We evaluate our models in the 2020 Robust Vision Challenge (RVC) as an extreme generalization experiment. MSeg training sets include only three of the seven datasets in the RVC; more importantly, the evaluation taxonomy of RVC is different and more detailed. Surprisingly, our model shows competitive performance and ranks second. To evaluate how close we are to the grand aim of robust, efficient, and complete scene understanding, we go beyond semantic segmentation by training instance segmentation and panoptic segmentation models using our dataset. Moreover, we also evaluate various engineering design decisions and metrics, including resolution and computational efficiency. Although our models are far from this grand aim, our comprehensive evaluation is crucial for progress. We share all the models and code with the community.

引用

页码：796 / 810

页数：15

共 50 条

[1] MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
Lambert, John
Liu, Zhuang
Sener, Ozan
Hays, James
Koltun, Vladlen
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2876 - 2885
[2] Multi-Domain Incremental Learning for Semantic Segmentation
Garg, Prachi
Saluja, Rohit
Balasubramanian, Vineeth N.
Arora, Chetan
Subramanian, Anbumani
Jawahar, C., V
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2080 - 2090
[3] Multi-domain semantic segmentation with overlapping labels
Bevandic, Petra
Orsic, Marin
Grubisic, Ivan
Saric, Josip
Segvic, Sinisa
2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 2422 - 2431
[4] An Empirical Study on Multi-domain Robust Semantic Segmentation
Liu, Yajie
Ge, Pu
Liu, Qingjie
Fan, Shichao
Wang, Yunhong
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (10) : 4289 - 4304
[5] Semantic Segmentation via Multi-task, Multi-domain Learning
Fourure, Damien
Emonet, Remi
Fromont, Elisa
Muselet, Damien
Tremeau, Alain
Wolf, Christian
STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 333 - 343
[6] Scaling Up Multi-domain Semantic Segmentation with Sentence Embeddings
Yin, Wei
Liu, Yifan
Shen, Chunhua
Sun, Baichuan
van den Hengel, Anton
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4036 - 4051
[7] Multi-Domain Semantic-Segmentation using Multi-Head Model
Masaki, Shota
Hirakawa, Tsubasa
Yamashita, Takayoshi
Fujiyoshi, Hironobu
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2802 - 2808
[8] Multi-task, multi-domain learning: Application to semantic segmentation and pose regression
Fourure, Damien
Emonet, Remi
Fromont, Elisa
Muselet, Damien
Neverova, Natalia
Tremeau, Alain
Wolf, Christian
NEUROCOMPUTING, 2017, 251 : 68 - 80
[9] Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation
Bevandic, Petra
Orsic, Marin
Saric, Josip
Grubisic, Ivan
Segvic, Sinisa
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2450 - 2472
[10] What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation
Blumenstiel, Benedikt
Jakubik, Johannes
Kuehne, Hilde
Voessing, Michael
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →