Crowd density estimation for fisheye images

被引:0
|
作者
Yang J. [1 ]
Lin C. [1 ]
Nie L. [1 ]
Liu M. [1 ]
Zhao Y. [1 ]
机构
[1] Institute of Information Science, Beijing Jiaotong University, Beijing
基金
中国国家自然科学基金;
关键词
crowd estimation; distortion processing; distribution matching; fisheye image; fisheye image dataset;
D O I
10.13700/j.bh.1001-5965.2021.0520
中图分类号
O212 [数理统计];
学科分类号
摘要
Aiming at the problem that the traditional crowd density estimation methods are not applicable under the distortion of fisheye images, this paper presents a crowd density estimation method for fisheye images, which realizes the monitoring of human traffic in scene of using fisheye lens. For model structure, we introduced deformable convolution to improve the adaptability of the model to fisheye distortion. For generating the training targets, we used Gaussian transform to perform a distribution match on the density maps of annotations, which depends on the features of fisheye distortion. For training, we optimized the loss function to avoid the model from falling into local optimal solutions. In addition, we collected and labeled the corresponding dataset due to the lack of dataset for fisheye crowd estimation. At last, by comparing the subjective and objective experiments with classical algorithms, we proved the superiority of the crowd estimation method for fisheye images in this paper with the mean absolute error of 3.78 in the test dataset, which is lower than others. © 2022 Beijing University of Aeronautics and Astronautics (BUAA). All rights reserved.
引用
收藏
页码:1455 / 1463
页数:8
相关论文
共 16 条
  • [1] DAI J F, QI H Z, XIONG Y W, Et al., Deformable convolutional networks [C], 2017 IEEE International Conference on Computer Vision, pp. 164-113, (2017)
  • [2] XUE Z C, XUE N, XIA G S, Et al., Learning to calibrate straight lines for fisheye image rectification, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1643-1651, (2019)
  • [3] LIAO K, LIN C, ZHAO Y, A deep ordinal distortion estimation approach for distortion rectification [J], IEEE Transactions on Image Processing, 30, pp. 3362-3375, (2021)
  • [4] XU J, YANG H B, SONG Y, Et al., A face recognition technology based on fisheye camera [J], Information & Communications, 31, 1, pp. 131-132, (2018)
  • [5] ZHANG Y Y, ZHOU D S, CHEN S Q, Et al., Single-image crowd counting via multi-column convolutional neural network, 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 589-597, (2016)
  • [6] LI Y H, ZHANG X F, CHEN D M., CSRNet: Dilated convolutional neural networks for understanding the highly congested scenes, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1091-1100, (2018)
  • [7] SIMONYAN K, ZISSERMAN A., Very deep convolutional networks for large-scale image recognition
  • [8] GUO D, LI K, ZHA Z J, Et al., DADNet: Dilated-attention-deformable ConvNet for crowd counting [C], Proceedings of the 27th ACM International Conference on Multimedia, pp. 1823-1832, (2019)
  • [9] ZHANG A R, SHEN J Y, XIAO Z H, Et al., Relational attention network for crowd counting, 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 6787-6796, (2019)
  • [10] WANG Q, BRECKON T P., Crowd counting via segmentation guided attention networks and curriculum loss