CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes

被引:995
|
作者
Li, Yuhong [1 ,2 ]
Zhang, Xiaofan [1 ]
Chen, Deming [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR.2018.00120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a network for Congested Scene Recognition called CSRNet to provide a data-driven and deep learning method that can understand highly congested scenes and perform accurate count estimation as well as present high-quality density maps. The proposed CSRNet is composed of two major components: a convolutional neural network (CNN) as the front-end for 2D feature extraction and a dilated CNN for the back-end, which uses dilated kernels to deliver larger reception fields and to replace pooling operations. CSRNet is an easy-trained model because of its pure convolutional structure. We demonstrate CSRNet on four datasets (ShanghaiTech dataset, the UCF CC 50 dataset, the WorldEXPO' 10 dataset, and the UCSD dataset) and we deliver the state-of-the-art performance. In the ShanghaiTech Part B dataset, CSRNet achieves 47.3% lower Mean Absolute Error (MAE) than the previous state-of-theart method. We extend the targeted applications for counting other objects, such as the vehicle in TRANCOS dataset. Results show that CSRNet significantly improves the output quality with 15.4% lower MAE than the previous state-of-the- art approach.
引用
收藏
页码:1091 / 1100
页数:10
相关论文
共 50 条
  • [1] LCNNet: Light-weight convolutional neural networks for understanding the highly congested scenes
    Wang, Renjie
    Tan, Fei
    Yang, Kunlong
    Hao, Yuwen
    Li, Fengguo
    Yu, Xiaoyuan
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 1991 - 2004
  • [2] Fully Convolutional Crowd Counting on Highly Congested Scenes
    Marsden, Mark
    McGuinness, Kevin
    Little, Suzanne
    O'Connor, Noel E.
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 27 - 33
  • [3] Quantum Dilated Convolutional Neural Networks
    Chen, Yixiong
    [J]. IEEE ACCESS, 2022, 10 : 20240 - 20246
  • [4] Deeper multi-column dilated convolutional network for congested crowd understanding
    Yan, Leilei
    Zhang, Li
    Zheng, Xiaohan
    Li, Fanzhang
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 1407 - 1422
  • [5] Deeper multi-column dilated convolutional network for congested crowd understanding
    Yan, Leilei
    Zhang, Li
    Zheng, Xiaohan
    Li, Fanzhang
    [J]. Neural Computing and Applications, 2022, 34 (02) : 1407 - 1422
  • [6] Deeper multi-column dilated convolutional network for congested crowd understanding
    Leilei Yan
    Li Zhang
    Xiaohan Zheng
    Fanzhang Li
    [J]. Neural Computing and Applications, 2022, 34 : 1407 - 1422
  • [7] Towards understanding residual and dilated dense neural networks via convolutional sparse coding
    Zhang, Zhiyang
    Zhang, Shihua
    [J]. NATIONAL SCIENCE REVIEW, 2021, 8 (03)
  • [8] Towards understanding residual and dilated dense neural networks via convolutional sparse coding
    Zhiyang Zhang
    Shihua Zhang
    [J]. National Science Review, 2021, 8 (03) : 127 - 139
  • [9] SCFFNet: Spatial Context Feature Fusion Network for Understanding the Highly Congested Scenes
    Xiong, Liyan
    Yi, Hu
    Huang, Xiaohui
    Huang, Weichun
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [10] ATTENTION-BASED ATROUS CONVOLUTIONAL NEURAL NETWORKS: VISUALISATION AND UNDERSTANDING PERSPECTIVES OF ACOUSTIC SCENES
    Ren, Zhao
    Kong, Qiuqiang
    Han, Jing
    Plumbley, Mark D.
    Schuller, Bjoern W.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 56 - 60