Feature Shrinkage Pyramid for Camouflaged Object Detection with Transformers

被引:32
|
作者
Huang, Zhou [1 ,2 ]
Dai, Hang [3 ]
Xiang, Tian-Zhu [4 ]
Wang, Shuo [5 ]
Chen, Huai-Xin [2 ]
Qin, Jie [6 ]
Xiong, Huan [7 ]
机构
[1] Sichuan Changhong Elect Co Ltd, Mianyang, Sichuan, Peoples R China
[2] UESTC, Chengdu, Peoples R China
[3] Univ Glasgow, Glasgow, Lanark, Scotland
[4] G42, Shanghai, Peoples R China
[5] Swiss Fed Inst Technol, Zurich, Switzerland
[6] NUAA, CCST, Nanjing, Peoples R China
[7] MBZUAI, Abu Dhabi, U Arab Emirates
关键词
D O I
10.1109/CVPR52729.2023.00538
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision transformers have recently shown strong global context modeling capabilities in camouflaged object detection. However, they suffer from two major limitations: less effective locality modeling and insufficient feature aggregation in decoders, which are not conducive to camouflaged object detection that explores subtle cues from indistinguishable backgrounds. To address these issues, in this paper, we propose a novel transformer-based Feature Shrinkage Pyramid Network (FSPNet), which aims to hierarchically decode locality-enhanced neighboring transformer features through progressive shrinking for camouflaged object detection. Specifically, we propose a nonlocal token enhancement module (NL-TEM) that employs the non-local mechanism to interact neighboring tokens and explore graph-based high-order relations within tokens to enhance local representations of transformers. Moreover, we design a feature shrinkage decoder (FSD) with adjacent interaction modules (AIM), which progressively aggregates adjacent transformer features through a layer-by-layer shrinkage pyramid to accumulate imperceptible but effective cues as much as possible for object information decoding. Extensive quantitative and qualitative experiments demonstrate that the proposed model significantly outperforms the existing 24 competitors on three challenging COD benchmark datasets under six widely-used evaluation metrics. Our code is publicly available at https: //github.com/ZhouHuang23/FSPNet.
引用
收藏
页码:5557 / 5566
页数:10
相关论文
共 50 条
  • [41] Transformed Dynamic Feature Pyramid for Small Object Detection
    Liang, Hong
    Yang, Ying
    Zhang, Qian
    Feng, Linxia
    Ren, Jie
    Liang, Qiyao
    [J]. IEEE ACCESS, 2021, 9 : 134649 - 134659
  • [42] Residual feature pyramid networks for salient object detection
    Wang, Ben
    Chen, Shuhan
    Wang, Jian
    Hu, Xuelong
    [J]. VISUAL COMPUTER, 2020, 36 (09): : 1897 - 1908
  • [43] HYPER FEATURE FUSION PYRAMID NETWORK FOR OBJECT DETECTION
    Huang, Shouzhi
    Li, Xiaoyu
    Jiang, Zhuqing
    Guo, Xiaoqiang
    Men, Aidong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [44] Annular Feature Pyramid Network for Salient Object Detection
    Zheng, Tao
    Li, Bo
    Liu, Jiajia
    [J]. 2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 1 - 6
  • [45] Feature enhancement modules applied to a feature pyramid network for object detection
    Liu, Min
    Lin, Kun
    Huo, Wujie
    Hu, Lanlan
    He, Zhizi
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (02) : 617 - 629
  • [46] Feature enhancement modules applied to a feature pyramid network for object detection
    Min Liu
    Kun Lin
    Wujie Huo
    Lanlan Hu
    Zhizi He
    [J]. Pattern Analysis and Applications, 2023, 26 : 617 - 629
  • [47] Lightweight camouflaged object detection model based on multilevel feature fusion
    Qiaoyi Li
    Zhengjie Wang
    Xiaoning Zhang
    Hongbao Du
    [J]. Complex & Intelligent Systems, 2024, 10 : 4409 - 4419
  • [48] EINet: camouflaged object detection with pyramid vision transformer (vol 31, 053002, 2022)
    Li, Chen
    Jiao, Ge
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (05)
  • [49] Lightweight camouflaged object detection model based on multilevel feature fusion
    Li, Qiaoyi
    Wang, Zhengjie
    Zhang, Xiaoning
    Du, Hongbao
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (03) : 4409 - 4419
  • [50] Hierarchical Focused Feature Pyramid Network for Small Object Detection
    Wang, Siwei
    Chen, Zhiwei
    Ding, Haoyang
    Cao, Liujuan
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 432 - 444