An enhanced SSD with feature fusion and visual reasoning for object detection

被引：55

作者：

Leng, Jiaxu ^{[1
]}

Liu, Ying ^{[1
]}

机构：

[1] Univ Chinese Acad Sci, Sch Comp & Control Engn, Campus Yanqi, Beijing 101400, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2019年 / 31卷 / 10期

关键词：

Object detection; Feature fusion; Visual reasoning; Feature maps; CONVOLUTIONAL NETWORKS;

D O I：

10.1007/s00521-018-3486-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Single Shot Multibox Detector (SSD) is one of the top performing object detection algorithms in terms of both accuracy and speed. SSD achieves impressive performance on various datasets by using different output layers for object detection. However, each layer in the feature pyramid is used independently, and SSD considers only the fine-grained details of the objects but ignores the context surrounding objects. In this paper, we proposed an enhanced SSD, called ESSD, that improved the performance of the conventional SSD by fusing feature maps of different output layers, instead of growing layers close to the input data. Our method used two-way transfer of feature information and feature fusion to enhance the network. To assist further with object detection, we proposed a visual reasoning method that utilized fully the relationships between objects instead of using only the features of the objects themselves. This addition of visual reasoning proved very effective for detecting objects that are too small or have small features. To evaluate the proposed ESSD, we trained the model with VOC2007 and VOC2012 training sets and evaluated the performance on the Pascal VOC2007 test set. For 300 x 300 input, ESSD achieved 79.2% mean average precision (mAP) at 52.0 frames per second (FPS), and for 512 x 512 input, this approach achieved 82.4% mAP at 18.6 FPS. These results demonstrated that our proposed method can achieve state-of-the-art mAP, which is a better result than provided by the conventional SSD and other advanced detectors.

引用

页码：6549 / 6558

页数：10

共 50 条

[31] Small Object Recognition Algorithm of Grain Pests Based on SSD Feature Fusion
Lyu, Zongwang
Jin, Huifang
Zhen, Tong
Sun, Fuyan
Xu, Hui
IEEE ACCESS, 2021, 9 : 43202 - 43213
[32] Visual Relationship Detection with Multimodal Fusion and Reasoning
Xiao, Shouguan
Fu, Weiping
SENSORS, 2022, 22 (20)
[33] Feature Rescaling and Fusion for Tiny Object Detection
Liu, Jingwei
Gu, Yi
Han, Shumin
Zhang, Zhibin
Guo, Jiafeng
Cheng, Xueqi
IEEE ACCESS, 2021, 9 : 62946 - 62955
[34] Research on Target Detection Algorithm Based on Improved SSD Feature Fusion
Ge, Haibo
Li, Qiang
Zhou, Ting
Huang, Chaofeng
Computer Engineering and Applications, 2023, 59 (22) : 193 - 201
[35] Genetic Feature Fusion for Object Skeleton Detection
Qiao, Yang
Tian, Yunjie
Liu, Yue
Jiao, Jianbin
SECURITY AND COMMUNICATION NETWORKS, 2021, 2021 (2021)
[36] Improving Object Detection with Feature Fusion Methods
Cui, Yuning
Shi, Dianxi
Zhang, Yongjun
Sun, Qianchong
Xu, Huachi
Jing, Luoxi
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2022, 31 (07)
[37] Feature fusion for object detection at one map
Xi, Xing
Wu, Yuanqing
Xia, Canming
He, Shenghuang
IMAGE AND VISION COMPUTING, 2022, 123
[38] Adaptive Feature Fusion for Small Object Detection
Zhang, Qi
Zhang, Hongying
Lu, Xiuwen
APPLIED SCIENCES-BASEL, 2022, 12 (22):
[39] Visual Perception based Adaptive Feature Fusion for Visual Object Tracking
Krieger, Evan
Asari, Vijayan K.
2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1345 - 1350
[40] AEFFNet: Attention Enhanced Feature Fusion Network for Small Object Detection in UAV Imagery
Nian, Zhaoyu
Yang, Wenzhu
Chen, Hao
IEEE ACCESS, 2025, 13 : 26494 - 26505

← 1 2 3 4 5 →