Multi-scale dilated convolution of feature Fusion Network for Crowd counting

被引:0
|
作者
Donghua Liu
Guodong Wang
Guangtao Zhai
机构
[1] Qingdao University,College of Computer Science and Technology
[2] Shanghai Jiao Tong University,Institute of Image Communication and Network Engineering
来源
关键词
Crowd counting; Convolution neural network; Dilated convolution; Feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Crowd counting has long been a challenging task due to the perspective distortion and variability in head size. The previous methods ignore the multi-scale information in images or simply use convolutions with different kernel sizes to extract multi-scale features, resulting in incomplete multi-scale features extracted. In this paper, we propose a crowd counting model called Multi-scale Dilated Convolution of Feature Fusion Network (MsDFNet) based on a CNN (convolutional neural network). Our MsDFNet is based on the regression method of the density map. The density map is predicted by the parameters learned by CNN to obtain better prediction results. The proposed network mainly includes three components, a CNN to extract low-level features, a multi-scale dilated convolution module and multi-column feature fusion blocks, a density map regression module. Multi-scale dilated convolutions are employed to extract multi-scale high-level features, and the features extracted from different columns are fused. The combination of the multi-scale dilated convolution module and the multi-column feature fusion block can effectively extract more complete multi-scale features and boost the performance of counting small-sized targets. Experiments show that the problem of various head sizes in images can be effectively solved by fusing multi-scale context feature information. We prove the effectiveness of our method on two public datasets (The ShanghaiTech dataset and the UCF_CC_50 dataset). We compare our method with the previous state-of-the-art crowd counting algorithms in terms of MAE (Mean Absolute Error) and MSE (Mean Square Error) and significantly improves the performance, especially in case of various head sizes. On the UCF_CC_50 dataset, our method reduces the MAE index by 28.6 compared with the previous state-of-the-art method. (The lower the MAE, the better the performance).
引用
收藏
页码:37939 / 37952
页数:13
相关论文
共 50 条
  • [31] MCANet: multi-scale contextual feature fusion network based on Atrous convolution
    Ke Li
    ZhanDong Liu
    Multimedia Tools and Applications, 2023, 82 : 34679 - 34702
  • [32] Multi-level feature fusion network for crowd counting
    Wang, Luyang
    Li, Yun
    Peng, Sifan
    Tang, Xiao
    Yin, Baoqun
    IET COMPUTER VISION, 2021, 15 (01) : 60 - 72
  • [33] A Multi-scale Dilated Residual Convolution Network for Image Denoising
    Jia, Xinlei
    Peng, Yali
    Ge, Bao
    Li, Jun
    Liu, Shigang
    Wang, Wenan
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1231 - 1246
  • [34] A Multi-scale Dilated Residual Convolution Network for Image Denoising
    Xinlei Jia
    Yali Peng
    Bao Ge
    Jun Li
    Shigang Liu
    Wenan Wang
    Neural Processing Letters, 2023, 55 : 1231 - 1246
  • [35] MULTI-STEP QUANTIZATION OF A MULTI-SCALE NETWORK FOR CROWD COUNTING
    Shim, Kyujin
    Byun, Junyoung
    Kim, Changick
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 683 - 687
  • [36] MSR-FAN: Multi-scale residual feature-aware network for crowd counting
    Zhao, Haoyu
    Min, Weidong
    Wei, Xin
    Wang, Qi
    Fu, Qiyan
    Wei, Zitai
    IET IMAGE PROCESSING, 2021, 15 (14) : 3512 - 3521
  • [37] Crowd Counting Method Based on Multi-Scale Enhanced Network
    Xu Tao
    Duan Yinong
    Du Jiahao
    Liu Caihua
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1764 - 1771
  • [38] Dense Crowd Counting Network Based on Multi-scale Perception
    Li, Hengchao
    Liu, Xianglian
    Liu, Peng
    Feng, Bin
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2024, 59 (05): : 1176 - 1183
  • [39] MSIANet: Multi-scale Interactive Attention Crowd Counting Network
    Zhang, Shihui
    Zhao, Weibo
    Wang, Lei
    Wang, Wei
    Li, Qunpeng
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 2236 - 2245
  • [40] MHANet: Multi-scale hybrid attention network for crowd counting
    Yu, Ying
    Yu, Jiamao
    Qian, Jin
    Zhu, Zhiliang
    Han, Xing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9445 - 9455