An enhanced SSD with feature fusion and visual reasoning for object detection

被引：55

作者：

Leng, Jiaxu ^{[1
]}

Liu, Ying ^{[1
]}

机构：

[1] Univ Chinese Acad Sci, Sch Comp & Control Engn, Campus Yanqi, Beijing 101400, Peoples R China

来源：

NEURAL COMPUTING & APPLICATIONS | 2019年 / 31卷 / 10期

关键词：

Object detection; Feature fusion; Visual reasoning; Feature maps; CONVOLUTIONAL NETWORKS;

D O I：

10.1007/s00521-018-3486-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Single Shot Multibox Detector (SSD) is one of the top performing object detection algorithms in terms of both accuracy and speed. SSD achieves impressive performance on various datasets by using different output layers for object detection. However, each layer in the feature pyramid is used independently, and SSD considers only the fine-grained details of the objects but ignores the context surrounding objects. In this paper, we proposed an enhanced SSD, called ESSD, that improved the performance of the conventional SSD by fusing feature maps of different output layers, instead of growing layers close to the input data. Our method used two-way transfer of feature information and feature fusion to enhance the network. To assist further with object detection, we proposed a visual reasoning method that utilized fully the relationships between objects instead of using only the features of the objects themselves. This addition of visual reasoning proved very effective for detecting objects that are too small or have small features. To evaluate the proposed ESSD, we trained the model with VOC2007 and VOC2012 training sets and evaluated the performance on the Pascal VOC2007 test set. For 300 x 300 input, ESSD achieved 79.2% mean average precision (mAP) at 52.0 frames per second (FPS), and for 512 x 512 input, this approach achieved 82.4% mAP at 18.6 FPS. These results demonstrated that our proposed method can achieve state-of-the-art mAP, which is a better result than provided by the conventional SSD and other advanced detectors.

引用

页码：6549 / 6558

页数：10

共 50 条

[21] FESSD:SSD target detection based on feature fusion and feature enhancement
Qian, Huaming
Wang, Huilin
Feng, Shuai
Yan, Shuya
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (01)
[22] FESSD:SSD target detection based on feature fusion and feature enhancement
Huaming Qian
Huilin Wang
Shuai Feng
Shuya Yan
Journal of Real-Time Image Processing, 2023, 20
[23] A small underwater object detection model with enhanced feature extraction and fusion
Li, Tao
Gang, Yijin
Li, Sumin
Shang, Yizi
SCIENTIFIC REPORTS, 2025, 15 (01):
[24] SAFF-SSD: Self-Attention Combined Feature Fusion-Based SSD for Small Object Detection in Remote Sensing
Huo, Bihan
Li, Chenglong
Zhang, Jianwei
Xue, Yingjian
Lin, Zhoujin
REMOTE SENSING, 2023, 15 (12)
[25] A Marine Object Detection Algorithm Based on SSD and Feature Enhancement
Hu, Kai
Lu, Feiyu
Lu, Meixia
Deng, Zhiliang
Liu, Yunping
COMPLEXITY, 2020, 2020
[26] Adaptive feature fusion for visual object tracking
Zhao, Shaochuan
Xu, Tianyang
Wu, Xiao-Jun
Zhu, Xue-Feng
PATTERN RECOGNITION, 2021, 111
[27] Sequential Feature Fusion for Object Detection
Wang, Qiang
Han, Yahong
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 689 - 699
[28] FFR-SSD: feature fusion and reconstruction single shot detector for multi-scale object detection
Cheng, Xu
Wang, Zhixiang
Song, Chen
Yu, Zitong
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 3145 - 3153
[29] FFR-SSD: feature fusion and reconstruction single shot detector for multi-scale object detection
Xu Cheng
Zhixiang Wang
Chen Song
Zitong Yu
Signal, Image and Video Processing, 2023, 17 : 3145 - 3153
[30] A Lightweight Fusion Strategy With Enhanced Interlayer Feature Correlation for Small Object Detection
Xiao, Yao
Xu, Tingfa
Yu, Xin
Fang, Yuqiang
Li, Jianan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62

← 1 2 3 4 5 →