An enhanced SSD with feature fusion and visual reasoning for object detection

被引:55
|
作者
Leng, Jiaxu [1 ]
Liu, Ying [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp & Control Engn, Campus Yanqi, Beijing 101400, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2019年 / 31卷 / 10期
关键词
Object detection; Feature fusion; Visual reasoning; Feature maps; CONVOLUTIONAL NETWORKS;
D O I
10.1007/s00521-018-3486-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single Shot Multibox Detector (SSD) is one of the top performing object detection algorithms in terms of both accuracy and speed. SSD achieves impressive performance on various datasets by using different output layers for object detection. However, each layer in the feature pyramid is used independently, and SSD considers only the fine-grained details of the objects but ignores the context surrounding objects. In this paper, we proposed an enhanced SSD, called ESSD, that improved the performance of the conventional SSD by fusing feature maps of different output layers, instead of growing layers close to the input data. Our method used two-way transfer of feature information and feature fusion to enhance the network. To assist further with object detection, we proposed a visual reasoning method that utilized fully the relationships between objects instead of using only the features of the objects themselves. This addition of visual reasoning proved very effective for detecting objects that are too small or have small features. To evaluate the proposed ESSD, we trained the model with VOC2007 and VOC2012 training sets and evaluated the performance on the Pascal VOC2007 test set. For 300 x 300 input, ESSD achieved 79.2% mean average precision (mAP) at 52.0 frames per second (FPS), and for 512 x 512 input, this approach achieved 82.4% mAP at 18.6 FPS. These results demonstrated that our proposed method can achieve state-of-the-art mAP, which is a better result than provided by the conventional SSD and other advanced detectors.
引用
收藏
页码:6549 / 6558
页数:10
相关论文
共 50 条
  • [21] FESSD:SSD target detection based on feature fusion and feature enhancement
    Qian, Huaming
    Wang, Huilin
    Feng, Shuai
    Yan, Shuya
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (01)
  • [22] FESSD:SSD target detection based on feature fusion and feature enhancement
    Huaming Qian
    Huilin Wang
    Shuai Feng
    Shuya Yan
    Journal of Real-Time Image Processing, 2023, 20
  • [23] A small underwater object detection model with enhanced feature extraction and fusion
    Li, Tao
    Gang, Yijin
    Li, Sumin
    Shang, Yizi
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [24] SAFF-SSD: Self-Attention Combined Feature Fusion-Based SSD for Small Object Detection in Remote Sensing
    Huo, Bihan
    Li, Chenglong
    Zhang, Jianwei
    Xue, Yingjian
    Lin, Zhoujin
    REMOTE SENSING, 2023, 15 (12)
  • [25] A Marine Object Detection Algorithm Based on SSD and Feature Enhancement
    Hu, Kai
    Lu, Feiyu
    Lu, Meixia
    Deng, Zhiliang
    Liu, Yunping
    COMPLEXITY, 2020, 2020
  • [26] Adaptive feature fusion for visual object tracking
    Zhao, Shaochuan
    Xu, Tianyang
    Wu, Xiao-Jun
    Zhu, Xue-Feng
    PATTERN RECOGNITION, 2021, 111
  • [27] Sequential Feature Fusion for Object Detection
    Wang, Qiang
    Han, Yahong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 689 - 699
  • [28] FFR-SSD: feature fusion and reconstruction single shot detector for multi-scale object detection
    Cheng, Xu
    Wang, Zhixiang
    Song, Chen
    Yu, Zitong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 3145 - 3153
  • [29] FFR-SSD: feature fusion and reconstruction single shot detector for multi-scale object detection
    Xu Cheng
    Zhixiang Wang
    Chen Song
    Zitong Yu
    Signal, Image and Video Processing, 2023, 17 : 3145 - 3153
  • [30] A Lightweight Fusion Strategy With Enhanced Interlayer Feature Correlation for Small Object Detection
    Xiao, Yao
    Xu, Tingfa
    Yu, Xin
    Fang, Yuqiang
    Li, Jianan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62