A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

被引:1047
|
作者
Cai, Zhaowei [1 ]
Fan, Quanfu [2 ]
Feris, Rogerio S. [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, SVCL, San Diego, CA 92103 USA
[2] IBM TJ Watson Res, Yorktown Hts, NY USA
来源
基金
美国国家科学基金会;
关键词
Object detection; Multi-scale; Unified neural network;
D O I
10.1007/978-3-319-46493-0_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produce a strong multi-scale object detector. The unified network is learned end-to-end, by optimizing a multi-task loss. Feature upsampling by deconvolution is also explored, as an alternative to input upsampling, to reduce the memory and computation costs. State-of-the-art object detection performance, at up to 15 fps, is reported on datasets, such as KITTI and Caltech, containing a substantial number of small objects.
引用
收藏
页码:354 / 370
页数:17
相关论文
共 50 条
  • [1] Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection
    Song, Kaiyou
    Yang, Hua
    Yin, Zhouping
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 2972 - 2985
  • [2] Multi-scale deep neural network for salient object detection
    Xiao, Fen
    Deng, Wenzheng
    Peng, Liangchan
    Cao, Chunhong
    Hu, Kai
    Gao, Xieping
    [J]. IET IMAGE PROCESSING, 2018, 12 (11) : 2036 - 2041
  • [3] Multi-scale Dilated Convolutional Neural Network for Object Detection in UAV Images
    Zhang, Ruiqian
    Shao, Zhenfeng
    Aleksei, Portnov
    Wang, Jiaming
    [J]. Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2020, 45 (06): : 895 - 903
  • [4] A Network Intrusion Detection Method Based on Deep Multi-scale Convolutional Neural Network
    Wang, Xiaowei
    Yin, Shoulin
    Li, Hang
    Wang, Jiachi
    Teng, Lin
    [J]. INTERNATIONAL JOURNAL OF WIRELESS INFORMATION NETWORKS, 2020, 27 (04) : 503 - 517
  • [5] A Network Intrusion Detection Method Based on Deep Multi-scale Convolutional Neural Network
    Xiaowei Wang
    Shoulin Yin
    Hang Li
    Jiachi Wang
    Lin Teng
    [J]. International Journal of Wireless Information Networks, 2020, 27 : 503 - 517
  • [6] MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection
    Ma, Wenchi
    Wu, Yuanwei
    Wang, Zongbo
    Wang, Guanghui
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2510 - 2515
  • [7] A Deep Multi-scale Convolutional Neural Network for Classifying Heartbeats
    Bai, Mengyao
    Xu, Yongjun
    Wang, Lianyan
    Wei, Zhihui
    [J]. 2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [8] A Multi-Scale Fusion Convolutional Neural Network for Face Detection
    Chen, Qiaosong
    Meng, Xiaomin
    Li, Wen
    Fu, Xingyu
    Deng, Xin
    Wang, Jin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1013 - 1018
  • [9] Transferring scale-independent features to support multi-scale object recognition with deep convolutional neural network
    Zhou, Xiran
    [J]. 26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 614 - 615
  • [10] Multi-scale face detection based on convolutional neural network
    Luo, Mingzhu
    Xiao, Yewei
    Zhou, Yan
    [J]. 2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1752 - 1757