CARD: Semantic Segmentation With Efficient Class-Aware Regularized Decoder

被引:0
|
作者
Huang, Ye [1 ]
Kang, Di [2 ]
Chen, Liang [3 ]
Jia, Wenjing [4 ]
He, Xiangjian [5 ]
Duan, Lixin [1 ]
Zhe, Xuefei [2 ]
Bao, Linchao [6 ]
机构
[1] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen 518000, Peoples R China
[2] Tencent AI Lab, Shenzhen 518000, Peoples R China
[3] Fujian Normal Univ, Coll Photon & Elect Engn, Fuzhou 350007, Peoples R China
[4] Univ Technol Sydney, Global Big Data & Technol Ctr, Sydney, NSW 2007, Australia
[5] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo 315000, Peoples R China
[6] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, England
关键词
Automobiles; Feature extraction; Semantic segmentation; Task analysis; Decoding; Training; Cows; representation learning; cityscapes; Pascal context; COCOStuff;
D O I
10.1109/TCSVT.2024.3395132
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation has recently achieved notable advances by exploiting "class-level" contextual information during learning, e.g., the Object Contextual Representation (OCR) and Context Prior (CPNet) approaches. However, these approaches simply concatenate class-level information to pixel features to boost pixel representation learning, which cannot fully utilize intra-class and inter-class contextual information. Moreover, these approaches learn soft class centers based on coarse mask prediction, which is prone to error accumulation. To better exploit class-level information, we propose a universal Class-Aware Regularization (CAR) approach to optimize the intra-class variance and inter-class distance during feature learning, motivated by the fact that humans can recognize an object by itself no matter which other objects it appears with. Moreover, we design a dedicated decoder for CAR (named CARD), which consists of a novel spatial token mixer and an upsampling module, to maximize its gain for existing baselines while being highly efficient in terms of computational cost. Specifically, CAR consists of three novel loss functions. The first loss function encourages more compact class representations within each class, the second directly maximizes the distance between different class centers, and the third further pushes the distance between inter-class centers and pixels. Furthermore, the class center in our approach is directly generated from ground truth instead of from the error-prone coarse prediction. CAR can be directly applied to most existing segmentation models during training, including OCR and CPNet, and can largely improve their accuracy at no additional inference overhead. Extensive experiments and ablation studies conducted on multiple benchmark datasets demonstrate that the proposed CAR can boost the accuracy of all baseline models by up to 2.23% mIOU with superior generalization ability. CARD outperforms state-of-the-art approaches on multiple benchmarks with a highly efficient architecture. The code will be available at https://github.com/edwardyehuang/CAR.
引用
收藏
页码:9024 / 9038
页数:15
相关论文
共 50 条
  • [41] DOCNet: Dual-Domain Optimized Class-Aware Network for Remote Sensing Image Segmentation
    Ma, Xiaowen
    Che, Rui
    Wang, Xinyu
    Ma, Mengting
    Wu, Sensen
    Feng, Tian
    Zhang, Wei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [42] A difference enhancement and class-aware rebalancing semi-supervised network for cropland semantic change detection
    Dai, Anjin
    Yang, Jianyu
    Zhang, Yuxuan
    Zhang, Tingting
    Tang, Kaixuan
    Xiao, Xiangyi
    Zhang, Shuoji
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2025, 137
  • [43] Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging
    Jiang, Zhongliang
    Kang, Yunfeng
    Bi, Yuan
    Li, Xuesong
    Li, Chenyang
    Navab, Nassir
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 13
  • [44] Unsupervised Domain Adaptation With Class-Aware Memory Alignment
    Wang, Hui
    Zheng, Liangli
    Zhao, Hanbin
    Li, Shijian
    Li, Xi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9930 - 9942
  • [45] Class-aware domain adaptation for improving adversarial robustness
    Hou, Xianxu
    Liu, Jingxin
    Xu, Bolei
    Wang, Xiaolong
    Liu, Bozhi
    Qiu, Guoping
    IMAGE AND VISION COMPUTING, 2020, 99 (99)
  • [46] Class-Aware Mutual Mixup with Triple Alignments for Semi-supervised Cross-Domain Segmentation
    Cai, Zhuotong
    Xin, Jingmin
    Zeng, Tianyi
    Dong, Siyuan
    Zheng, Nanning
    Duncan, James S.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 68 - 79
  • [47] A class-aware representation refinement framework for graph classification
    Xu, Jiaxing
    Ni, Jinjie
    Ke, Yiping
    INFORMATION SCIENCES, 2024, 679
  • [48] Class-Aware Robust Adversarial Training for Object Detection
    Chen, Pin-Chun
    Kung, Bo-Han
    Chen, Jun-Cheng
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10415 - 10424
  • [49] Class-Aware Fully Convolutional Gaussian and Poisson Denoising
    Remez, Tal
    Litany, Or
    Giryes, Raja
    Bronstein, Alex M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5707 - 5722
  • [50] Class-Aware Image Search for Interpretable Cancer Identification
    Ebrahimian, Arash
    Mohammadi, Hossein
    Babaie, Morteza
    Maftoon, Nima
    Tizhoosh, H. R.
    IEEE ACCESS, 2020, 8 : 197352 - 197362