Feature Correlation-Steered Capsule Network for object detection

被引:18
|
作者
Lin, Zhongqi [1 ,2 ]
Jia, Jingdun [2 ]
Huang, Feng [3 ]
Gao, Wanlin [1 ,2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr & Rural Affairs, Key Lab Agr Informatizat Standardizat, Beijing 100083, Peoples R China
[3] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Capsule Network (CapsNet); Feature correlation; Part-object association; Expectation-maximum routing agreement; Object detection;
D O I
10.1016/j.neunet.2021.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite Convolutional Neural Networks (CNNs) based approaches have been successful in objects detection, they predominantly focus on positioning discriminative regions while overlooking the internal holistic part-whole associations within objects. This would ultimately lead to the neglect of feature relationships between object and its parts as well as among those parts, both of which are significantly helpful for detecting discriminative parts. In this paper, we propose to "look insider the objects "by digging into part-whole feature correlations and take the attempts to leverage those correlations endowed by the Capsule Network (CapsNet) for robust object detection. Actually, highly correlated capsules across adjacent layers share high familiarity, which will be more likely to be routed together. In light of this, we take such correlations between different capsules of the preceding training samples as an awareness to constrain the subsequent candidate voting scope during the routing procedure, and a Feature Correlation-Steered CapsNet (FCS-CapsNet) with Locally-Constrained Expectation-Maximum (EM) Routing Agreement (LCEMRA) is proposed. Different from conventional EM routing, LCEMRA stipulates that only those relevant low-level capsules (parts) meeting the requirement of quantified intra-object cohesiveness can be clustered to make up high-level capsules (objects). In doing so, part-object associations can be dug by transformation weighting matrixes between capsules layers during such "part backtracking'' procedure. LCEMRA enables low-level capsules to selectively gather projections from a non-spatially-fixed set of high-level capsules. Experiments on VOC2007, VOC2012, HKU-IS, DUTS, and COCO show that FCS-CapsNet can achieve promising object detection effects across multiple evaluation metrics, which are on-par with state-of-the-arts. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页码:25 / 41
页数:17
相关论文
共 50 条
  • [1] Feature Transform Correlation Network for Object Detection
    Wan, Shouhong
    Li, Xiaoting
    Jin, Peiquan
    Xie, Jia
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1312 - 1319
  • [2] Noise Is Also Useful: Negative Correlation-Steered Latent Contrastive Learning
    Yan, Jiexi
    Luo, Lei
    Xu, Chenghao
    Deng, Cheng
    Huang, Heng
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 31 - 40
  • [3] Selective Feature Network for Object Detection
    Cui, Yuning
    Shi, Dianxi
    Zhang, Yongjun
    Sung, Qianchong
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] EFGNet: Encoder steered multi-modality feature guidance network for RGB-D salient object detection
    Xia, Chenxing
    Duan, Songsong
    Fang, Xianjin
    Gao, Xiuju
    Sun, Yanguang
    Ge, Bin
    Zhang, Hanling
    Li, Kuan-Ching
    DIGITAL SIGNAL PROCESSING, 2022, 131
  • [5] The variational correlation network for object detection
    Takahashi, Haruhisa
    International Conference on Computational Intelligence for Modelling, Control & Automation Jointly with International Conference on Intelligent Agents, Web Technologies & Internet Commerce, Vol 1, Proceedings, 2006, : 314 - 319
  • [6] Feature Refine Network for Salient Object Detection
    Yang, Jiejun
    Wang, Liejun
    Li, Yongming
    SENSORS, 2022, 22 (12)
  • [7] Parallel Feature Pyramid Network for Object Detection
    Kim, Seung-Wook
    Kook, Hyong-Keun
    Sun, Jee-Young
    Kang, Mun-Cheon
    Ko, Sung-Jea
    COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 239 - 256
  • [8] Feature aggregation network for small object detection
    Jing, Rudong
    Zhang, Wei
    Li, Yuzhuo
    Li, Wenlin
    Liu, Yanyan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [9] An improved feature pyramid network for object detection
    Zhu, Linxiang
    Lee, Feifei
    Cai, Jiawei
    Yu, Hongliu
    Chen, Qiu
    NEUROCOMPUTING, 2022, 483 : 127 - 139
  • [10] Enhanced feature pyramidal network for object detection
    Shao, Mingwen
    Zhang, Wei
    Li, Yunhao
    Fan, Bingbing
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)