Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers

被引:32
|
作者
Huang, Zhou [1 ,2 ]
Dai, Hang [3 ]
Xiang, Tian-Zhu [4 ]
Wang, Shuo [5 ]
Chen, Huai-Xin [2 ]
Qin, Jie [6 ]
Xiong, Huan [7 ]
机构
[1] Sichuan Changhong Elect Co Ltd, Mianyang, Sichuan, Peoples R China
[2] UESTC, Chengdu, Peoples R China
[3] Univ Glasgow, Glasgow, Lanark, Scotland
[4] G42, Shanghai, Peoples R China
[5] Swiss Fed Inst Technol, Zurich, Switzerland
[6] NUAA, CCST, Nanjing, Peoples R China
[7] MBZUAI, Abu Dhabi, U Arab Emirates
关键词
D O I
10.1109/CVPR52729.2023.00538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision transformers have recently shown strong global context modeling capabilities in camouflaged object detection. However, they suffer from two major limitations: less effective locality modeling and insufficient feature aggregation in decoders, which are not conducive to camouflaged object detection that explores subtle cues from indistinguishable backgrounds. To address these issues, in this paper, we propose a novel transformer-based Feature Shrinkage Pyramid Network (FSPNet), which aims to hierarchically decode locality-enhanced neighboring transformer features through progressive shrinking for camouflaged object detection. Specifically, we propose a nonlocal token enhancement module (NL-TEM) that employs the non-local mechanism to interact neighboring tokens and explore graph-based high-order relations within tokens to enhance local representations of transformers. Moreover, we design a feature shrinkage decoder (FSD) with adjacent interaction modules (AIM), which progressively aggregates adjacent transformer features through a layer-by-layer shrinkage pyramid to accumulate imperceptible but effective cues as much as possible for object information decoding. Extensive quantitative and qualitative experiments demonstrate that the proposed model significantly outperforms the existing 24 competitors on three challenging COD benchmark datasets under six widely-used evaluation metrics. Our code is publicly available at https: //github.com/ZhouHuang23/FSPNet.
引用
收藏
页码:5557 / 5566
页数:10
相关论文
共 50 条
  • [21] Contextual feature fusion and refinement network for camouflaged object detection
    Yang, Jinyu
    Shi, Yanjiao
    Jiang, Ying
    Lu, Zixuan
    Yi, Yugen
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [22] Boundary Guided Feature Fusion Network for Camouflaged Object Detection
    Qiu, Tianchi
    Li, Xiuhong
    Liu, Kangwei
    Li, Songlin
    Chen, Fan
    Zhou, Chenyu
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 433 - 444
  • [23] Boundary Feature Fusion and Foreground Guidance for Camouflaged Object Detection
    Liu, Wen-Xi
    Zhang, Jia-Bang
    Li, Yue-Zhou
    Lai, Yu
    Niu, Yu-Zhen
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (07): : 2279 - 2290
  • [24] Camouflaged Object Detection
    Fan, Deng-Ping
    Ji, Ge-Peng
    Sun, Guolei
    Cheng, Ming-Ming
    Shen, Jianbing
    Shao, Ling
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2774 - 2784
  • [25] MuTrans: Multiple Transformers for Fusing Feature Pyramid on 2D and 3D Object Detection
    Xie, Bangquan
    Yang, Liang
    Wei, Ailin
    Weng, Xiaoxiong
    Li, Bing
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4407 - 4415
  • [26] Small Object Detection Based on Lightweight Feature Pyramid
    Li, Ziyang
    Guo, Chenwei
    Han, Guang
    [J]. IEEE Transactions on Consumer Electronics, 2024, 70 (03) : 6064 - 6074
  • [27] SFPN: Semantic Feature Pyramid Network for Object Detection
    Gan, Yi
    Xu, Wei
    Su, Jianbo
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 795 - 802
  • [28] Bidirectional Matrix Feature Pyramid Network for Object Detection
    Xu, Wei
    Gan, Yi
    Su, Jianbo
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8000 - 8007
  • [29] Attentional feature pyramid network for small object detection
    Min, Kyungseo
    Lee, Gun-Hee
    Lee, Seong-Whan
    [J]. NEURAL NETWORKS, 2022, 155 : 439 - 450
  • [30] Bidirectional Parallel Feature Pyramid Network for Object Detection
    Zhang, Zhengning
    Zhang, Lin
    Wang, Yue
    Feng, Pengming
    Sun, Baochen
    [J]. IEEE ACCESS, 2022, 10 : 49422 - 49432