An enhanced SSD with feature fusion and visual reasoning for object detection

被引:55
|
作者
Leng, Jiaxu [1 ]
Liu, Ying [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp & Control Engn, Campus Yanqi, Beijing 101400, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2019年 / 31卷 / 10期
关键词
Object detection; Feature fusion; Visual reasoning; Feature maps; CONVOLUTIONAL NETWORKS;
D O I
10.1007/s00521-018-3486-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single Shot Multibox Detector (SSD) is one of the top performing object detection algorithms in terms of both accuracy and speed. SSD achieves impressive performance on various datasets by using different output layers for object detection. However, each layer in the feature pyramid is used independently, and SSD considers only the fine-grained details of the objects but ignores the context surrounding objects. In this paper, we proposed an enhanced SSD, called ESSD, that improved the performance of the conventional SSD by fusing feature maps of different output layers, instead of growing layers close to the input data. Our method used two-way transfer of feature information and feature fusion to enhance the network. To assist further with object detection, we proposed a visual reasoning method that utilized fully the relationships between objects instead of using only the features of the objects themselves. This addition of visual reasoning proved very effective for detecting objects that are too small or have small features. To evaluate the proposed ESSD, we trained the model with VOC2007 and VOC2012 training sets and evaluated the performance on the Pascal VOC2007 test set. For 300 x 300 input, ESSD achieved 79.2% mean average precision (mAP) at 52.0 frames per second (FPS), and for 512 x 512 input, this approach achieved 82.4% mAP at 18.6 FPS. These results demonstrated that our proposed method can achieve state-of-the-art mAP, which is a better result than provided by the conventional SSD and other advanced detectors.
引用
收藏
页码:6549 / 6558
页数:10
相关论文
共 50 条
  • [41] Contour feature fusion SSD Algorithm
    Yang, Dawei
    Bi, Cheng
    Mao, Lin
    Zhang, Rubo
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 3423 - 3426
  • [42] SSD Object Detection Model Based on Multi-Frequency Feature Theory
    Li, Jinling
    Hou, Qingshan
    Xing, Jinsheng
    Ju, Jianguo
    IEEE ACCESS, 2020, 8 (08): : 82294 - 82305
  • [43] Spatio-temporal feature fusion based correlative binary relevance for visual object detection
    Amaresh, M.
    Chitrakala, S.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (05):
  • [44] Learning fusion strategies for visual object detection
    Paletta, L
    Rome, E
    2000 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2000), VOLS 1-3, PROCEEDINGS, 2000, : 1446 - 1452
  • [45] Camouflage Object Detection Based on Feature Fusion and Edge Detection
    Ding, Cheng
    Bai, Xueqiong
    Lv, Yong
    Liu, Yang
    Niu, Chunhui
    Liu, Xin
    ACTA PHOTONICA SINICA, 2024, 53 (08)
  • [46] EFLDet: enhanced feature learning for object detection
    Liao, Yongwei
    Zhang, Guipeng
    Yang, Zhenguo
    Liu, Wenyin
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 1033 - 1045
  • [47] EFLDet: enhanced feature learning for object detection
    Yongwei Liao
    Guipeng Zhang
    Zhenguo Yang
    Wenyin Liu
    Neural Computing and Applications, 2022, 34 : 1033 - 1045
  • [48] Enhanced feature pyramidal network for object detection
    Shao, Mingwen
    Zhang, Wei
    Li, Yunhao
    Fan, Bingbing
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (01)
  • [49] CATrack: Convolution and Attention Feature Fusion for Visual Object Tracking
    Zhang, Longkun
    Wen, Jiajun
    Dai, Zichen
    Zhou, Rouyi
    Lai, Zhihui
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 469 - 480
  • [50] RESEARCH ON DRIED DAYLILY GRADING BASED ON SSD DETAIL DETECTION WITH FEATURE FUSION
    Zhang, Xueli
    Song, Haiyan
    Zheng, Decong
    Chang, Renjie
    Li, Chenfei
    Sun, Yile
    Liu, Zonglin
    INMATEH-AGRICULTURAL ENGINEERING, 2024, 74 (03): : 824 - 832