Feature Correlation-Steered Capsule Network for object detection

被引：18

作者：

Lin, Zhongqi ^{[1
,2
]}

Jia, Jingdun ^{[2
]}

Huang, Feng ^{[3
]}

Gao, Wanlin ^{[1
,2
]}

机构：

[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China

[2] Minist Agr & Rural Affairs, Key Lab Agr Informatizat Standardizat, Beijing 100083, Peoples R China

[3] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China

来源：

NEURAL NETWORKS | 2022年 / 147卷

基金：

中国国家自然科学基金;

关键词：

Capsule Network (CapsNet); Feature correlation; Part-object association; Expectation-maximum routing agreement; Object detection;

D O I：

10.1016/j.neunet.2021.12.003

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite Convolutional Neural Networks (CNNs) based approaches have been successful in objects detection, they predominantly focus on positioning discriminative regions while overlooking the internal holistic part-whole associations within objects. This would ultimately lead to the neglect of feature relationships between object and its parts as well as among those parts, both of which are significantly helpful for detecting discriminative parts. In this paper, we propose to "look insider the objects "by digging into part-whole feature correlations and take the attempts to leverage those correlations endowed by the Capsule Network (CapsNet) for robust object detection. Actually, highly correlated capsules across adjacent layers share high familiarity, which will be more likely to be routed together. In light of this, we take such correlations between different capsules of the preceding training samples as an awareness to constrain the subsequent candidate voting scope during the routing procedure, and a Feature Correlation-Steered CapsNet (FCS-CapsNet) with Locally-Constrained Expectation-Maximum (EM) Routing Agreement (LCEMRA) is proposed. Different from conventional EM routing, LCEMRA stipulates that only those relevant low-level capsules (parts) meeting the requirement of quantified intra-object cohesiveness can be clustered to make up high-level capsules (objects). In doing so, part-object associations can be dug by transformation weighting matrixes between capsules layers during such "part backtracking'' procedure. LCEMRA enables low-level capsules to selectively gather projections from a non-spatially-fixed set of high-level capsules. Experiments on VOC2007, VOC2012, HKU-IS, DUTS, and COCO show that FCS-CapsNet can achieve promising object detection effects across multiple evaluation metrics, which are on-par with state-of-the-arts. (c) 2021 Elsevier Ltd. All rights reserved.

引用

页码：25 / 41

页数：17

共 50 条

[1] Feature Transform Correlation Network for Object Detection
Wan, Shouhong
Li, Xiaoting
Jin, Peiquan
Xie, Jia
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1312 - 1319
[2] Noise Is Also Useful: Negative Correlation-Steered Latent Contrastive Learning
Yan, Jiexi
Luo, Lei
Xu, Chenghao
Deng, Cheng
Huang, Heng
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 31 - 40
[3] Selective Feature Network for Object Detection
Cui, Yuning
Shi, Dianxi
Zhang, Yongjun
Sung, Qianchong
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[4] EFGNet: Encoder steered multi-modality feature guidance network for RGB-D salient object detection
Xia, Chenxing
Duan, Songsong
Fang, Xianjin
Gao, Xiuju
Sun, Yanguang
Ge, Bin
Zhang, Hanling
Li, Kuan-Ching
DIGITAL SIGNAL PROCESSING, 2022, 131
[5] The variational correlation network for object detection
Takahashi, Haruhisa
International Conference on Computational Intelligence for Modelling, Control & Automation Jointly with International Conference on Intelligent Agents, Web Technologies & Internet Commerce, Vol 1, Proceedings, 2006, : 314 - 319
[6] Feature Refine Network for Salient Object Detection
Yang, Jiejun
Wang, Liejun
Li, Yongming
SENSORS, 2022, 22 (12)
[7] Parallel Feature Pyramid Network for Object Detection
Kim, Seung-Wook
Kook, Hyong-Keun
Sun, Jee-Young
Kang, Mun-Cheon
Ko, Sung-Jea
COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 239 - 256
[8] Feature aggregation network for small object detection
Jing, Rudong
Zhang, Wei
Li, Yuzhuo
Li, Wenlin
Liu, Yanyan
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
[9] An improved feature pyramid network for object detection
Zhu, Linxiang
Lee, Feifei
Cai, Jiawei
Yu, Hongliu
Chen, Qiu
NEUROCOMPUTING, 2022, 483 : 127 - 139
[10] Enhanced feature pyramidal network for object detection
Shao, Mingwen
Zhang, Wei
Li, Yunhao
Fan, Bingbing
JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)

← 1 2 3 4 5 →