An improved object detection algorithm based on multi-scaled and deformable convolutional neural networks

被引:55
|
作者
Cao, Danyang [1 ,2 ]
Chen, Zhixin [1 ]
Gao, Lei [1 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing 100144, Peoples R China
[2] Beijing Key Lab Integrat & Anal Large Scale Strea, Beijing 100144, Peoples R China
基金
北京市自然科学基金;
关键词
Object detection; Machine learning; AI; Deformable convolution; Computer vision; FUSION;
D O I
10.1186/s13673-020-00219-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection methods aim to identify all target objects in the target image and determine the categories and position information in order to achieve machine vision understanding. Numerous approaches have been proposed to solve this problem, mainly inspired by methods of computer vision and deep learning. However, existing approaches always perform poorly for the detection of small, dense objects, and even fail to detect objects with random geometric transformations. In this study, we compare and analyse mainstream object detection algorithms and propose a multi-scaled deformable convolutional object detection network to deal with the challenges faced by current methods. Our analysis demonstrates a strong performance on par, or even better, than state of the art methods. We use deep convolutional networks to obtain multi-scaled features, and add deformable convolutional structures to overcome geometric transformations. We then fuse the multi-scaled features by up sampling, in order to implement the final object recognition and region regress. Experiments prove that our suggested framework improves the accuracy of detecting small target objects with geometric deformation, showing significant improvements in the trade-off between accuracy and speed.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] DeepID-Net: Object Detection with Deformable Part Based Convolutional Neural Networks
    Ouyang, Wanli
    Zeng, Xingyu
    Wang, Xiaogang
    Qiu, Shi
    Luo, Ping
    Tian, Yonglong
    Li, Hongsheng
    Yang, Shuo
    Wang, Zhe
    Li, Hongyang
    Wang, Kun
    Yan, Junjie
    Loy, Chen-Change
    Tang, Xiaoou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (07) : 1320 - 1334
  • [2] DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection
    Ouyang, Wanli
    Wang, Xiaogang
    Zeng, Xingyu
    Qiu, Shi
    Luo, Ping
    Tian, Yonglong
    Li, Hongsheng
    Yang, Shuo
    Wang, Zhe
    Loy, Chen-Change
    Tang, Xiaoou
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2403 - 2412
  • [3] Object Recognition Algorithm Based on an Improved Convolutional Neural Network
    Zheyi Fan
    Yu Song
    Wei Li
    [J]. Journal of Beijing Institute of Technology, 2020, 29 (02) : 139 - 145
  • [4] Towards a fast and accurate road object detection algorithm based on convolutional neural networks
    Zhang, Qinghui
    Wan, Chenxia
    Han, Weiliang
    Bian, Shanfeng
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [5] Object Recognition Algorithm Based on an Improved Convolutional Neural Network
    Fan, Zheyi
    Song, Yu
    Li, Wei
    [J]. Journal of Beijing Institute of Technology (English Edition), 2020, 29 (02): : 139 - 145
  • [6] Improved Object Detection With Iterative Localization Refinement in Convolutional Neural Networks
    Cheng, Kai-Wen
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (09) : 2261 - 2275
  • [7] ITERATIVE LOCALIZATION REFINEMENT IN CONVOLUTIONAL NEURAL NETWORKS FOR IMPROVED OBJECT DETECTION
    Cheng, Kai-Wen
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3643 - 3647
  • [8] Cloud Detection and Tracking Based on Object Detection with Convolutional Neural Networks
    Carballo, Jose Antonio
    Bonilla, Javier
    Fernandez-Reche, Jesus
    Nouri, Bijan
    Avila-Marin, Antonio
    Fabel, Yann
    Alarcon-Padilla, Diego-Cesar
    [J]. ALGORITHMS, 2023, 16 (10)
  • [9] Convolutional Neural Networks-Based Object Detection Algorithm by Jointing Semantic Segmentation for Images
    Qiang, Baohua
    Chen, Ruidong
    Zhou, Mingliang
    Pang, Yuanchao
    Zhai, Yijie
    Yang, Minghao
    [J]. SENSORS, 2020, 20 (18) : 1 - 14
  • [10] Face Detection Based on Improved Multi-task Cascaded Convolutional Neural Networks
    Jia, Siyu
    Tian, Ying
    [J]. IAENG International Journal of Computer Science, 2024, 51 (02) : 67 - 74