A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

被引:1047
|
作者
Cai, Zhaowei [1 ]
Fan, Quanfu [2 ]
Feris, Rogerio S. [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, SVCL, San Diego, CA 92103 USA
[2] IBM TJ Watson Res, Yorktown Hts, NY USA
来源
基金
美国国家科学基金会;
关键词
Object detection; Multi-scale; Unified neural network;
D O I
10.1007/978-3-319-46493-0_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produce a strong multi-scale object detector. The unified network is learned end-to-end, by optimizing a multi-task loss. Feature upsampling by deconvolution is also explored, as an alternative to input upsampling, to reduce the memory and computation costs. State-of-the-art object detection performance, at up to 15 fps, is reported on datasets, such as KITTI and Caltech, containing a substantial number of small objects.
引用
收藏
页码:354 / 370
页数:17
相关论文
共 50 条
  • [31] PulseID: Multi-scale photoplethysmographic identification using a deep convolutional neural network
    Wei, Riling
    Xu, Xiaogang
    Li, Yue
    Zhang, Yiyi
    Wang, Jun
    Chen, Hanjie
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [32] Object Recognition, Localization and Grasp Detection Using a Unified Deep Convolutional Neural Network with Multi-task Loss
    Jia, Qun
    Cao, Zhiqiang
    Zhao, Xionglei
    Pang, Lei
    Yu, Yingying
    Yu, Junzhi
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1557 - 1562
  • [33] Novel Approach in Vegetation Detection Using Multi-Scale Convolutional Neural Network
    Albalooshi, Fatema A.
    [J]. Applied Sciences (Switzerland), 2024, 14 (22):
  • [34] AN IMPROVED MULTI-SCALE FIRE DETECTION METHOD BASED ON CONVOLUTIONAL NEURAL NETWORK
    Huang Hongyu
    Kuang Ping
    Li Fan
    Shi Huaxin
    [J]. 2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, : 109 - 112
  • [35] Pedestrian Detection via Multi-scale Feature Fusion Convolutional Neural Network
    Guo, Aixin
    Yin, Baoqun
    Zhang, Jing
    Yao, Jinfa
    [J]. 2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 1364 - 1368
  • [36] An efficient fire detection algorithm based on multi-scale convolutional neural network
    Cheng, Yanying
    Chen, Ke
    Bai, Hui
    Mou, Chunjie
    Zhang, Yuchun
    Yang, Kai
    Gao, Yunji
    Liu, Yu
    [J]. FIRE AND MATERIALS, 2022, 46 (07) : 981 - 992
  • [37] Multi-level and multi-scale deep saliency network for salient object detection
    Zhang, Qing
    Lin, Jiajun
    Zhuge, Jingling
    Yuan, Wenhao
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 415 - 424
  • [38] MRMNet: Multi-scale residual multi-branch neural network for object detection
    Dong, Yongsheng
    Liu, Yafeng
    Li, Xuelong
    [J]. NEUROCOMPUTING, 2024, 596
  • [39] Multi-scale convolutional neural network for texture recognition
    Wei, Xile
    Hu, Benyong
    Gao, Tianshi
    Wang, Jiang
    Deng, Bin
    [J]. DISPLAYS, 2022, 75
  • [40] A multi-scale convolutional neural network for heartbeat classification
    Zheng, Lesong
    Zhang, Miao
    Qiu, Lishen
    Ma, Gang
    Zhu, Wenliang
    Wang, Lirong
    [J]. 2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1488 - 1492