CARD: Semantic Segmentation With Efficient Class-Aware Regularized Decoder

被引:0
|
作者
Huang, Ye [1 ]
Kang, Di [2 ]
Chen, Liang [3 ]
Jia, Wenjing [4 ]
He, Xiangjian [5 ]
Duan, Lixin [1 ]
Zhe, Xuefei [2 ]
Bao, Linchao [6 ]
机构
[1] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen 518000, Peoples R China
[2] Tencent AI Lab, Shenzhen 518000, Peoples R China
[3] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350007, Peoples R China
[4] Univ Technol Sydney, Global Big Data & Technol Ctr, Sydney, NSW 2007, Australia
[5] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315000, Peoples R China
[6] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
关键词
Automobiles; Feature extraction; Semantic segmentation; Task analysis; Decoding; Training; Cows; representation learning; cityscapes; Pascal context; COCOStuff;
D O I
10.1109/TCSVT.2024.3395132
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation has recently achieved notable advances by exploiting "class-level" contextual information during learning, e.g., the Object Contextual Representation (OCR) and Context Prior (CPNet) approaches. However, these approaches simply concatenate class-level information to pixel features to boost pixel representation learning, which cannot fully utilize intra-class and inter-class contextual information. Moreover, these approaches learn soft class centers based on coarse mask prediction, which is prone to error accumulation. To better exploit class-level information, we propose a universal Class-Aware Regularization (CAR) approach to optimize the intra-class variance and inter-class distance during feature learning, motivated by the fact that humans can recognize an object by itself no matter which other objects it appears with. Moreover, we design a dedicated decoder for CAR (named CARD), which consists of a novel spatial token mixer and an upsampling module, to maximize its gain for existing baselines while being highly efficient in terms of computational cost. Specifically, CAR consists of three novel loss functions. The first loss function encourages more compact class representations within each class, the second directly maximizes the distance between different class centers, and the third further pushes the distance between inter-class centers and pixels. Furthermore, the class center in our approach is directly generated from ground truth instead of from the error-prone coarse prediction. CAR can be directly applied to most existing segmentation models during training, including OCR and CPNet, and can largely improve their accuracy at no additional inference overhead. Extensive experiments and ablation studies conducted on multiple benchmark datasets demonstrate that the proposed CAR can boost the accuracy of all baseline models by up to 2.23% mIOU with superior generalization ability. CARD outperforms state-of-the-art approaches on multiple benchmarks with a highly efficient architecture. The code will be available at https://github.com/edwardyehuang/CAR.
引用
收藏
页码:9024 / 9038
页数:15
相关论文
共 50 条
  • [1] CAR: Class-Aware Regularizations for Semantic Segmentation
    Huang, Ye
    Kang, Di
    Chen, Liang
    Zhe, Xuefei
    Jia, Wenjing
    Bao, Linchao
    He, Xiangjian
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 518 - 534
  • [2] CAA: Class-Aware Affinity calculation add-on for semantic segmentation
    Tang, Huadong
    Zhao, Youpeng
    Du, Chaofan
    Xu, Min
    Wu, Qiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [3] Lightweight Semantic Segmentation Network Leveraging Class-Aware Contextual Information
    Xu, Xuetian
    Huang, Shaorong
    Lai, Helang
    IEEE ACCESS, 2023, 11 : 144722 - 144734
  • [4] Class-Aware Domain Adaptation for Semantic Segmentation of Remote Sensing Images
    Xu, Qingsong
    Yuan, Xin
    Ouyang, Chaojun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [5] Image synthesis with class-aware semantic diffusion models for surgical scene segmentation
    Zhou, Yihang
    Towning, Rebecca
    Awad, Zaid
    Giannarou, Stamatia
    HEALTHCARE TECHNOLOGY LETTERS, 2025, 12 (01)
  • [6] Class-aware feature attention-based semantic segmentation on hyperspectral images
    Sevugan, Prabu
    Rudhrakoti, Venkatesan
    Kim, Tai-hoon
    Gunasekaran, Megala
    Purushotham, Swarnalatha
    Chinthaginjala, Ravikumar
    Ahmad, Irfan
    Kumar, A.
    PLOS ONE, 2025, 20 (02):
  • [7] Consistency Learning based on Class-Aware Style Variation for Domain Generalizable Semantic Segmentation
    Su, Siwei
    Wang, Haijian
    Yang, Meng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6029 - 6038
  • [8] Class-Aware Pruning for Efficient Neural Networks
    Jiang, Mengnan
    Wang, Jingcun
    Eldebiky, Amro
    Yin, Xunzhao
    Zhuo, Cheng
    Lin, Ing-Chao
    Li Zhang, Grace
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [9] Class-Aware Adversarial Transformers for Medical Image Segmentation
    You, Chenyu
    Zhao, Ruihan
    Liu, Fenglin
    Dong, Siyuan
    Chinchali, Sandeep
    Topcu, Ufuk
    Staib, Lawrence
    Duncan, James S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] Class-aware edge-assisted lightweight semantic segmentation network for power transmission line inspection
    Qingkai Zhou
    Qingwu Li
    Chang Xu
    Qiuyu Lu
    Yaqin Zhou
    Applied Intelligence, 2023, 53 : 6826 - 6843