HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation

被引:15
|
作者
Ding, Jian [1 ,2 ,3 ]
Xue, Nan [1 ]
Xia, Gui-Song [1 ,2 ]
Schiele, Bernt [3 ]
Dai, Dengxin [3 ]
机构
[1] Wuhan Univ, Sch Comp Sci, NERCMS, Wuhan, Peoples R China
[2] Wuhan Univ, State Key Lab LIESMARS, Wuhan, Peoples R China
[3] Max Planck Inst Informat, Saarland Informat Campus, Saarbrucken, Germany
关键词
D O I
10.1109/CVPR52729.2023.01479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current semantic segmentation models have achieved great success under the independent and identically distributed (i.i.d.) condition. However, in real-world applications, test data might come from a different domain than training data. Therefore, it is important to improve model robustness against domain differences. This work studies semantic segmentation under the domain generalization setting, where a model is trained only on the source domain and tested on the unseen target domain. Existing works show that Vision Transformers are more robust than CNNs and show that this is related to the visual grouping property of self-attention. In this work, we propose a novel hierarchical grouping transformer (HGFormer) to explicitly group pixels to form part-level masks and then whole-level masks. The masks at different scales aim to segment out both parts and a whole of classes. HGFormer combines mask classification results at both scales for class label prediction. We assemble multiple interesting cross-domain settings by using seven public semantic segmentation datasets. Experiments show that HGFormer yields more robust semantic segmentation results than per-pixel classification methods and flat-grouping transformers, and outperforms previous methods significantly. Code will be available at https: //github.com/dingjiansw101/HGFormer.
引用
收藏
页码:15413 / 15423
页数:11
相关论文
共 50 条
  • [21] Hierarchical Image Segmentation by Polygon Grouping
    Prasad, Lakshman
    Swaminarayan, Sriram
    2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 221 - 228
  • [22] Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
    Pak, Byeonghyun
    Woo, Byeongju
    Kim, Sunghwan
    Kim, Dae-hwan
    Kim, Hoseong
    COMPUTER VISION-ECCV 2024, PT LVII, 2025, 15115 : 37 - 54
  • [23] Contrastive Grouping with Transformer for Referring Image Segmentation
    Tang, Jiajin
    Zheng, Ge
    Shi, Cheng
    Yang, Sibei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23570 - 23580
  • [24] META-LEARNED FEATURE CRITICS FOR DOMAIN GENERALIZED SEMANTIC SEGMENTATION
    Shiau, Zu-Yun
    Lin, Wei-Wei
    Lin, Ci-Siang
    Wang, Yu-Chiang Frank
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2244 - 2248
  • [25] ELiFormer: A hierarchical Transformer based Model with Efficient Encoder and Lightweight Decoder for Semantic Segmentation
    Wu, Zixuan
    Zhou, Yue
    2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [26] A Hierarchical Loss for Semantic Segmentation
    Muller, Bruce
    Smith, William
    VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, : 260 - 267
  • [27] Deep Hierarchical Semantic Segmentation
    Li, Liulei
    Zhou, Tianfei
    Wang, Wenguan
    Li, Jianwu
    Yang, Yi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1236 - 1247
  • [28] Efficient Semantic Segmentation using Gradual Grouping
    Vallurupalli, Nikitha
    Annamaneni, Sriharsha
    Varma, Girish
    Jawahar, C., V
    Mathew, Manu
    Nagori, Soyeb
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 711 - 719
  • [29] A segmentation network for generalized lesion extraction with semantic fusion of transformer with value vector enhancement
    Wang, Yuefei
    Wei, Yuanhong
    Yu, Xi
    Wang, Jin
    Zhang, Yutong
    Zhang, Li
    Wan, Yuxuan
    Chen, Zhixuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 266
  • [30] Transformer Scale Gate for Semantic Segmentation
    Shi, Hengcan
    Hayat, Munawar
    Cai, Jianfei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3051 - 3060