CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes

被引:995
|
作者
Li, Yuhong [1 ,2 ]
Zhang, Xiaofan [1 ]
Chen, Deming [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR.2018.00120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a network for Congested Scene Recognition called CSRNet to provide a data-driven and deep learning method that can understand highly congested scenes and perform accurate count estimation as well as present high-quality density maps. The proposed CSRNet is composed of two major components: a convolutional neural network (CNN) as the front-end for 2D feature extraction and a dilated CNN for the back-end, which uses dilated kernels to deliver larger reception fields and to replace pooling operations. CSRNet is an easy-trained model because of its pure convolutional structure. We demonstrate CSRNet on four datasets (ShanghaiTech dataset, the UCF CC 50 dataset, the WorldEXPO' 10 dataset, and the UCSD dataset) and we deliver the state-of-the-art performance. In the ShanghaiTech Part B dataset, CSRNet achieves 47.3% lower Mean Absolute Error (MAE) than the previous state-of-theart method. We extend the targeted applications for counting other objects, such as the vehicle in TRANCOS dataset. Results show that CSRNet significantly improves the output quality with 15.4% lower MAE than the previous state-of-the- art approach.
引用
收藏
页码:1091 / 1100
页数:10
相关论文
共 50 条
  • [21] A Dilated Convolutional Neural Network for Cross-Layers of Contextual Information for Congested Crowd Counting
    Zhao, Zhiqiang
    Ma, Peihong
    Jia, Meng
    Wang, Xiaofan
    Hei, Xinhong
    [J]. SENSORS, 2024, 24 (06)
  • [22] Intelligent Fault Detection via Dilated Convolutional Neural Networks
    Khan, Mohammad Azam
    Kim, Yong-Hwa
    Choo, Jaegul
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 729 - 731
  • [23] DILATED CONVOLUTIONAL NEURAL NETWORKS FOR PANORAMIC IMAGE SALIENCY PREDICTION
    Dai, Feng
    Zhang, Youqiang
    Ma, Yike
    Li, Hongliang
    Zhao, Qiang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2558 - 2562
  • [24] Self-paced hybrid dilated convolutional neural networks
    Wenzhen Zhang
    Guangquan Lu
    Shichao Zhang
    Yonggang Li
    [J]. Multimedia Tools and Applications, 2022, 81 : 34169 - 34181
  • [25] Self-paced hybrid dilated convolutional neural networks
    Zhang, Wenzhen
    Lu, Guangquan
    Zhang, Shichao
    Li, Yonggang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (24) : 34169 - 34181
  • [26] Multiscale Dilated Convolutional Neural Networks for Transient Electromagnetic Inversion
    Cheng, Kai
    Yang, Xiaodong
    Wu, Xiaoping
    [J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [27] Towards Understanding the Invertibility of Convolutional Neural Networks
    Gilbert, Anna C.
    Zhang, Yi
    Lee, Kibok
    Zhang, Yuting
    Lee, Honglak
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1703 - 1710
  • [28] Understanding Convolutional Neural Networks From Excitations
    Ying, Zijian
    Li, Qianmu
    Lian, Zhichao
    Hou, Jun
    Lin, Tong
    Wang, Tao
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [29] Understanding convolutional neural networks with a mathematical model
    Kuo, C. -C. Jay
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2016, 41 : 406 - 413
  • [30] Congested crowd instance localization with dilated convolutional swin transformer
    Gao, Junyu
    Gong, Maoguo
    Li, Xuelong
    [J]. NEUROCOMPUTING, 2022, 513 : 94 - 103