HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation

被引:15
|
作者
Ding, Jian [1 ,2 ,3 ]
Xue, Nan [1 ]
Xia, Gui-Song [1 ,2 ]
Schiele, Bernt [3 ]
Dai, Dengxin [3 ]
机构
[1] Wuhan Univ, Sch Comp Sci, NERCMS, Wuhan, Peoples R China
[2] Wuhan Univ, State Key Lab LIESMARS, Wuhan, Peoples R China
[3] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
关键词
D O I
10.1109/CVPR52729.2023.01479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current semantic segmentation models have achieved great success under the independent and identically distributed (i.i.d.) condition. However, in real-world applications, test data might come from a different domain than training data. Therefore, it is important to improve model robustness against domain differences. This work studies semantic segmentation under the domain generalization setting, where a model is trained only on the source domain and tested on the unseen target domain. Existing works show that Vision Transformers are more robust than CNNs and show that this is related to the visual grouping property of self-attention. In this work, we propose a novel hierarchical grouping transformer (HGFormer) to explicitly group pixels to form part-level masks and then whole-level masks. The masks at different scales aim to segment out both parts and a whole of classes. HGFormer combines mask classification results at both scales for class label prediction. We assemble multiple interesting cross-domain settings by using seven public semantic segmentation datasets. Experiments show that HGFormer yields more robust semantic segmentation results than per-pixel classification methods and flat-grouping transformers, and outperforms previous methods significantly. Code will be available at https: //github.com/dingjiansw101/HGFormer.
引用
收藏
页码:15413 / 15423
页数:11
相关论文
共 50 条
  • [1] A Patch Diversity Transformer for Domain Generalized Semantic Segmentation
    He, Pei
    Jiao, Licheng
    Shang, Ronghua
    Liu, Xu
    Liu, Fang
    Yang, Shuyuan
    Zhang, Xiangrong
    Wang, Shuang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14138 - 14150
  • [2] Scene sketch semantic segmentation with hierarchical Transformer
    Yang, Jie
    Ke, Aihua
    Yu, Yaoxiang
    Cai, Bo
    KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [3] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
    Shao, Yilin
    Sun, Long
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 132 - 146
  • [4] CoT: Contourlet Transformer for Hierarchical Semantic Segmentation
    Shao, Yilin
    Sun, Long
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 132 - 146
  • [5] A Re-Parameterized Vision Transformer (ReVT) for Domain-Generalized Semantic Segmentation
    Termoehlen, Jan-Aike
    Bartels, Timo
    Fingscheidt, Tim
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4378 - 4387
  • [6] Adversarial Semantic Hallucination for Domain Generalized Semantic Segmentation
    Tjio, Gabriel
    Liu, Ping
    Zhou, Joey Tianyi
    Goh, Rick Siow Mong
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3849 - 3858
  • [7] Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation
    Kim, Minsu
    Joung, Sunghun
    Kim, Seungryong
    Park, Jungin
    Kim, Ig-Jae
    Sohn, Kwanghoon
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1799 - 1807
  • [8] HSPFormer: Hierarchical Spatial Perception Transformer for Semantic Segmentation
    Chen, Siyu
    Han, Ting
    Zhang, Changshe
    Su, Jinhe
    Wang, Ruisheng
    Chen, Yiping
    Wang, Zongyue
    Cai, Guorong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [9] DAT: DOMAIN ADAPTIVE TRANSFORMER FOR DOMAIN ADAPTIVE SEMANTIC SEGMENTATION
    Park, Jinyoung
    Son, Minseok
    Lee, Sumin
    Kim, Changick
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 4183 - 4187
  • [10] Semantic-Aware Domain Generalized Segmentation
    Peng, Duo
    Lei, Yinjie
    Hayat, Munawar
    Guo, Yulan
    Li, Wen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2584 - 2595