Multi-scale dilated convolution of feature Fusion Network for Crowd counting

被引:0
|
作者
Donghua Liu
Guodong Wang
Guangtao Zhai
机构
[1] Qingdao University,College of Computer Science and Technology
[2] Shanghai Jiao Tong University,Institute of Image Communication and Network Engineering
来源
关键词
Crowd counting; Convolution neural network; Dilated convolution; Feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Crowd counting has long been a challenging task due to the perspective distortion and variability in head size. The previous methods ignore the multi-scale information in images or simply use convolutions with different kernel sizes to extract multi-scale features, resulting in incomplete multi-scale features extracted. In this paper, we propose a crowd counting model called Multi-scale Dilated Convolution of Feature Fusion Network (MsDFNet) based on a CNN (convolutional neural network). Our MsDFNet is based on the regression method of the density map. The density map is predicted by the parameters learned by CNN to obtain better prediction results. The proposed network mainly includes three components, a CNN to extract low-level features, a multi-scale dilated convolution module and multi-column feature fusion blocks, a density map regression module. Multi-scale dilated convolutions are employed to extract multi-scale high-level features, and the features extracted from different columns are fused. The combination of the multi-scale dilated convolution module and the multi-column feature fusion block can effectively extract more complete multi-scale features and boost the performance of counting small-sized targets. Experiments show that the problem of various head sizes in images can be effectively solved by fusing multi-scale context feature information. We prove the effectiveness of our method on two public datasets (The ShanghaiTech dataset and the UCF_CC_50 dataset). We compare our method with the previous state-of-the-art crowd counting algorithms in terms of MAE (Mean Absolute Error) and MSE (Mean Square Error) and significantly improves the performance, especially in case of various head sizes. On the UCF_CC_50 dataset, our method reduces the MAE index by 28.6 compared with the previous state-of-the-art method. (The lower the MAE, the better the performance).
引用
收藏
页码:37939 / 37952
页数:13
相关论文
共 50 条
  • [21] Crowd Counting Algorithm for Multi-Scale Fusion Based on Dual Branch Feature Extraction
    Zeng, Yunyun
    Zhang, Hongying
    Yuan, Mingdong
    Computer Engineering and Applications, 60 (20): : 224 - 232
  • [22] A Crowd Counting and Localization Network Based on Adaptive Feature Fusion and Multi-Scale Global Attention Up Sampling
    Wang, Min
    Huang, Li
    Yan, Jingke
    Huang, Jin
    Yang, Tao
    IEEE ACCESS, 2024, 12 : 12919 - 12939
  • [23] Redesigning Multi-Scale Neural Network for Crowd Counting
    Du, Zhipeng
    Shi, Miaojing
    Deng, Jiankang
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3664 - 3678
  • [24] Crowd Counting based on Multi-level Multi-scale Feature
    Di Wu
    Zheyi Fan
    Shuhan Yi
    Applied Intelligence, 2023, 53 : 21891 - 21901
  • [25] Multi-Scale Guided Attention Network for Crowd Counting
    Li, Pengfei
    Zhang, Min
    Wan, Jian
    Jiang, Ming
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [26] Multi-scale Attention Recalibration Network for crowd counting
    Xie, Jinyang
    Pang, Chen
    Zheng, Yanjun
    Li, Liang
    Lyu, Chen
    Lyu, Lei
    Liu, Hong
    APPLIED SOFT COMPUTING, 2022, 117
  • [27] STOCHASTIC MULTI-SCALE AGGREGATION NETWORK FOR CROWD COUNTING
    Wang, Mingjie
    Cai, Hao
    Zhou, Jun
    Gong, Minglun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2008 - 2012
  • [28] Crowd Counting based on Multi-level Multi-scale Feature
    Wu, Di
    Fan, Zheyi
    Yi, Shuhan
    APPLIED INTELLIGENCE, 2023, 53 (19) : 21891 - 21901
  • [29] Hybrid Dilated Convolution with Multi-Scale Residual Fusion Network for Hyperspectral Image Classification
    Li, Chenming
    Qiu, Zelin
    Cao, Xueying
    Chen, Zhonghao
    Gao, Hongmin
    Hua, Zaijun
    MICROMACHINES, 2021, 12 (05)
  • [30] MCANet: multi-scale contextual feature fusion network based on Atrous convolution
    Li, Ke
    Liu, ZhanDong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34679 - 34702