CARD: Semantic Segmentation With Efficient Class-Aware Regularized Decoder

被引:0
|
作者
Huang, Ye [1 ]
Kang, Di [2 ]
Chen, Liang [3 ]
Jia, Wenjing [4 ]
He, Xiangjian [5 ]
Duan, Lixin [1 ]
Zhe, Xuefei [2 ]
Bao, Linchao [6 ]
机构
[1] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen 518000, Peoples R China
[2] Tencent AI Lab, Shenzhen 518000, Peoples R China
[3] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350007, Peoples R China
[4] Univ Technol Sydney, Global Big Data & Technol Ctr, Sydney, NSW 2007, Australia
[5] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315000, Peoples R China
[6] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
关键词
Automobiles; Feature extraction; Semantic segmentation; Task analysis; Decoding; Training; Cows; representation learning; cityscapes; Pascal context; COCOStuff;
D O I
10.1109/TCSVT.2024.3395132
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation has recently achieved notable advances by exploiting "class-level" contextual information during learning, e.g., the Object Contextual Representation (OCR) and Context Prior (CPNet) approaches. However, these approaches simply concatenate class-level information to pixel features to boost pixel representation learning, which cannot fully utilize intra-class and inter-class contextual information. Moreover, these approaches learn soft class centers based on coarse mask prediction, which is prone to error accumulation. To better exploit class-level information, we propose a universal Class-Aware Regularization (CAR) approach to optimize the intra-class variance and inter-class distance during feature learning, motivated by the fact that humans can recognize an object by itself no matter which other objects it appears with. Moreover, we design a dedicated decoder for CAR (named CARD), which consists of a novel spatial token mixer and an upsampling module, to maximize its gain for existing baselines while being highly efficient in terms of computational cost. Specifically, CAR consists of three novel loss functions. The first loss function encourages more compact class representations within each class, the second directly maximizes the distance between different class centers, and the third further pushes the distance between inter-class centers and pixels. Furthermore, the class center in our approach is directly generated from ground truth instead of from the error-prone coarse prediction. CAR can be directly applied to most existing segmentation models during training, including OCR and CPNet, and can largely improve their accuracy at no additional inference overhead. Extensive experiments and ablation studies conducted on multiple benchmark datasets demonstrate that the proposed CAR can boost the accuracy of all baseline models by up to 2.23% mIOU with superior generalization ability. CARD outperforms state-of-the-art approaches on multiple benchmarks with a highly efficient architecture. The code will be available at https://github.com/edwardyehuang/CAR.
引用
收藏
页码:9024 / 9038
页数:15
相关论文
共 50 条
  • [31] Prompt-and-Transfer: Dynamic Class-Aware Enhancement for Few-Shot Segmentation
    Bi, Hanbo
    Feng, Yingchao
    Diao, Wenhui
    Wang, Peijin
    Mao, Yongqiang
    Fu, Kun
    Wang, Hongqi
    Sun, Xian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (01) : 131 - 148
  • [32] CAWM: Class-Aware Weight Map for Improved Semi-Supervised Nuclei Segmentation
    Lim, Seohoon
    Xu, Zhixin
    Chong, Yosep
    Jung, Seung-Won
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 81 - 85
  • [33] Class-aware similarity hashing for data classification
    Roussev, Vassil
    Richard, Golden
    Marziale, Lodovico
    IFIP Advances in Information and Communication Technology, 2008, 285 : 291 - 298
  • [34] Large-Scale Land Cover Mapping with Fine-Grained Classes via Class-Aware Semi-Supervised Semantic Segmentation
    Dong, Runmin
    Mou, Lichao
    Chen, Mengxuan
    Li, Weijia
    Tong, Xin-Yi
    Yuan, Shuai
    Zhang, Lixian
    Zheng, Juepeng
    Zhu, Xiao Xiang
    Fu, Haohuan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16737 - 16747
  • [35] Class-aware similarity hashing for data classification
    Roussev, Vassil
    Richard III, Golden
    Marziale, Lodovico
    IFIP Advances in Information and Communication Technology, 2008, 285 : 101 - 113
  • [36] Class-Aware Based KNN Classification Method
    Bian, Zekang
    Zhang, Jin
    Wang, Shitong
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2021, 34 (10): : 873 - 884
  • [37] Class-aware similarity hashing for data classification
    Roussev, Vassil
    Richard, Golden, III
    Marziale, Lodovico
    ADVANCES IN DIGITAL FORENSICS IV, 2008, 285 : 101 - 113
  • [38] Contrastive Class-aware Adaptation for Domain Generalization
    Chen, Tianle
    Baktashmotlagh, Mahsa
    Salzmann, Mathieu
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4871 - 4876
  • [39] Class-aware graph Siamese representation learning
    Xu, Chengcheng
    Wang, Tianfeng
    Chen, Man
    Chen, Jun
    Pan, Zhisong
    NEUROCOMPUTING, 2025, 620
  • [40] Layerwise Class-Aware Convolutional Neural Network
    Cui, Zhen
    Niu, Zhiheng
    Liu, Luoqi
    Yan, Shuicheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (12) : 2601 - 2612