A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

被引:1074
|
作者
Cai, Zhaowei [1 ]
Fan, Quanfu [2 ]
Feris, Rogerio S. [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, SVCL, San Diego, CA 92103 USA
[2] IBM TJ Watson Res, Yorktown Hts, NY USA
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
基金
美国国家科学基金会;
关键词
Object detection; Multi-scale; Unified neural network;
D O I
10.1007/978-3-319-46493-0_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produce a strong multi-scale object detector. The unified network is learned end-to-end, by optimizing a multi-task loss. Feature upsampling by deconvolution is also explored, as an alternative to input upsampling, to reduce the memory and computation costs. State-of-the-art object detection performance, at up to 15 fps, is reported on datasets, such as KITTI and Caltech, containing a substantial number of small objects.
引用
收藏
页码:354 / 370
页数:17
相关论文
共 50 条
  • [1] Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection
    Song, Kaiyou
    Yang, Hua
    Yin, Zhouping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 2972 - 2985
  • [2] Multi-scale deep neural network for salient object detection
    Xiao, Fen
    Deng, Wenzheng
    Peng, Liangchan
    Cao, Chunhong
    Hu, Kai
    Gao, Xieping
    IET IMAGE PROCESSING, 2018, 12 (11) : 2036 - 2041
  • [3] Multi-scale Dilated Convolutional Neural Network for Object Detection in UAV Images
    Zhang R.
    Shao Z.
    Aleksei P.
    Wang J.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2020, 45 (06): : 895 - 903
  • [4] A Network Intrusion Detection Method Based on Deep Multi-scale Convolutional Neural Network
    Wang, Xiaowei
    Yin, Shoulin
    Li, Hang
    Wang, Jiachi
    Teng, Lin
    INTERNATIONAL JOURNAL OF WIRELESS INFORMATION NETWORKS, 2020, 27 (04) : 503 - 517
  • [5] A Network Intrusion Detection Method Based on Deep Multi-scale Convolutional Neural Network
    Xiaowei Wang
    Shoulin Yin
    Hang Li
    Jiachi Wang
    Lin Teng
    International Journal of Wireless Information Networks, 2020, 27 : 503 - 517
  • [6] MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection
    Ma, Wenchi
    Wu, Yuanwei
    Wang, Zongbo
    Wang, Guanghui
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2510 - 2515
  • [7] A Deep Multi-scale Convolutional Neural Network for Classifying Heartbeats
    Bai, Mengyao
    Xu, Yongjun
    Wang, Lianyan
    Wei, Zhihui
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [8] A Multi-Scale Fusion Convolutional Neural Network for Face Detection
    Chen, Qiaosong
    Meng, Xiaomin
    Li, Wen
    Fu, Xingyu
    Deng, Xin
    Wang, Jin
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1013 - 1018
  • [9] Transferring scale-independent features to support multi-scale object recognition with deep convolutional neural network
    Zhou, Xiran
    26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 614 - 615
  • [10] Multi-scale face detection based on convolutional neural network
    Luo, Mingzhu
    Xiao, Yewei
    Zhou, Yan
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1752 - 1757