An enhanced SSD with feature fusion and visual reasoning for object detection

被引:55
|
作者
Leng, Jiaxu [1 ]
Liu, Ying [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp & Control Engn, Campus Yanqi, Beijing 101400, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2019年 / 31卷 / 10期
关键词
Object detection; Feature fusion; Visual reasoning; Feature maps; CONVOLUTIONAL NETWORKS;
D O I
10.1007/s00521-018-3486-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single Shot Multibox Detector (SSD) is one of the top performing object detection algorithms in terms of both accuracy and speed. SSD achieves impressive performance on various datasets by using different output layers for object detection. However, each layer in the feature pyramid is used independently, and SSD considers only the fine-grained details of the objects but ignores the context surrounding objects. In this paper, we proposed an enhanced SSD, called ESSD, that improved the performance of the conventional SSD by fusing feature maps of different output layers, instead of growing layers close to the input data. Our method used two-way transfer of feature information and feature fusion to enhance the network. To assist further with object detection, we proposed a visual reasoning method that utilized fully the relationships between objects instead of using only the features of the objects themselves. This addition of visual reasoning proved very effective for detecting objects that are too small or have small features. To evaluate the proposed ESSD, we trained the model with VOC2007 and VOC2012 training sets and evaluated the performance on the Pascal VOC2007 test set. For 300 x 300 input, ESSD achieved 79.2% mean average precision (mAP) at 52.0 frames per second (FPS), and for 512 x 512 input, this approach achieved 82.4% mAP at 18.6 FPS. These results demonstrated that our proposed method can achieve state-of-the-art mAP, which is a better result than provided by the conventional SSD and other advanced detectors.
引用
收藏
页码:6549 / 6558
页数:10
相关论文
共 50 条
  • [31] Small Object Recognition Algorithm of Grain Pests Based on SSD Feature Fusion
    Lyu, Zongwang
    Jin, Huifang
    Zhen, Tong
    Sun, Fuyan
    Xu, Hui
    IEEE ACCESS, 2021, 9 : 43202 - 43213
  • [32] Visual Relationship Detection with Multimodal Fusion and Reasoning
    Xiao, Shouguan
    Fu, Weiping
    SENSORS, 2022, 22 (20)
  • [33] Feature Rescaling and Fusion for Tiny Object Detection
    Liu, Jingwei
    Gu, Yi
    Han, Shumin
    Zhang, Zhibin
    Guo, Jiafeng
    Cheng, Xueqi
    IEEE ACCESS, 2021, 9 : 62946 - 62955
  • [34] Research on Target Detection Algorithm Based on Improved SSD Feature Fusion
    Ge, Haibo
    Li, Qiang
    Zhou, Ting
    Huang, Chaofeng
    Computer Engineering and Applications, 2023, 59 (22) : 193 - 201
  • [35] Genetic Feature Fusion for Object Skeleton Detection
    Qiao, Yang
    Tian, Yunjie
    Liu, Yue
    Jiao, Jianbin
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021 (2021)
  • [36] Improving Object Detection with Feature Fusion Methods
    Cui, Yuning
    Shi, Dianxi
    Zhang, Yongjun
    Sun, Qianchong
    Xu, Huachi
    Jing, Luoxi
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2022, 31 (07)
  • [37] Feature fusion for object detection at one map
    Xi, Xing
    Wu, Yuanqing
    Xia, Canming
    He, Shenghuang
    IMAGE AND VISION COMPUTING, 2022, 123
  • [38] Adaptive Feature Fusion for Small Object Detection
    Zhang, Qi
    Zhang, Hongying
    Lu, Xiuwen
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [39] Visual Perception based Adaptive Feature Fusion for Visual Object Tracking
    Krieger, Evan
    Asari, Vijayan K.
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1345 - 1350
  • [40] AEFFNet: Attention Enhanced Feature Fusion Network for Small Object Detection in UAV Imagery
    Nian, Zhaoyu
    Yang, Wenzhu
    Chen, Hao
    IEEE ACCESS, 2025, 13 : 26494 - 26505