Multi-scale dilated convolution of feature Fusion Network for Crowd counting

被引:0
|
作者
Donghua Liu
Guodong Wang
Guangtao Zhai
机构
[1] Qingdao University,College of Computer Science and Technology
[2] Shanghai Jiao Tong University,Institute of Image Communication and Network Engineering
来源
关键词
Crowd counting; Convolution neural network; Dilated convolution; Feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Crowd counting has long been a challenging task due to the perspective distortion and variability in head size. The previous methods ignore the multi-scale information in images or simply use convolutions with different kernel sizes to extract multi-scale features, resulting in incomplete multi-scale features extracted. In this paper, we propose a crowd counting model called Multi-scale Dilated Convolution of Feature Fusion Network (MsDFNet) based on a CNN (convolutional neural network). Our MsDFNet is based on the regression method of the density map. The density map is predicted by the parameters learned by CNN to obtain better prediction results. The proposed network mainly includes three components, a CNN to extract low-level features, a multi-scale dilated convolution module and multi-column feature fusion blocks, a density map regression module. Multi-scale dilated convolutions are employed to extract multi-scale high-level features, and the features extracted from different columns are fused. The combination of the multi-scale dilated convolution module and the multi-column feature fusion block can effectively extract more complete multi-scale features and boost the performance of counting small-sized targets. Experiments show that the problem of various head sizes in images can be effectively solved by fusing multi-scale context feature information. We prove the effectiveness of our method on two public datasets (The ShanghaiTech dataset and the UCF_CC_50 dataset). We compare our method with the previous state-of-the-art crowd counting algorithms in terms of MAE (Mean Absolute Error) and MSE (Mean Square Error) and significantly improves the performance, especially in case of various head sizes. On the UCF_CC_50 dataset, our method reduces the MAE index by 28.6 compared with the previous state-of-the-art method. (The lower the MAE, the better the performance).
引用
收藏
页码:37939 / 37952
页数:13
相关论文
共 50 条
  • [41] COMAL: compositional multi-scale feature enhanced learning for crowd counting
    Zhou, Fangbo
    Zhao, Huailin
    Zhang, Yani
    Zhang, Qing
    Liang, Lanjun
    Li, Yaoyao
    Duan, Zuodong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (15) : 20541 - 20560
  • [42] Multi-Scale Network with Integrated Attention Unit for Crowd Counting
    Hafeezallah, Adel
    Al-Dhamari, Ahlam
    Abu-Bakar, Syed Abd Rahman
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3879 - 3903
  • [43] COMAL: compositional multi-scale feature enhanced learning for crowd counting
    Fangbo Zhou
    Huailin Zhao
    Yani Zhang
    Qing Zhang
    Lanjun Liang
    Yaoyao Li
    Zuodong Duan
    Multimedia Tools and Applications, 2022, 81 : 20541 - 20560
  • [44] Rolling bearing fault diagnosis based on dilated convolution and enhanced multi-scale feature adaptive fusion
    Han K.
    Zhan H.
    Yu J.
    Wang R.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (06): : 1285 - 1295
  • [45] JMFEEL-Net: a joint multi-scale feature enhancement and lightweight transformer network for crowd counting
    Mingtao Wang
    Xin Zhou
    Yuanyuan Chen
    Knowledge and Information Systems, 2024, 66 : 3033 - 3053
  • [46] JMFEEL-Net: a joint multi-scale feature enhancement and lightweight transformer network for crowd counting
    Wang, Mingtao
    Zhou, Xin
    Chen, Yuanyuan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (05) : 3033 - 3053
  • [47] Deep multi-scale dilated convolution network for coronary artery segmentation
    Qiu, Yue
    Chai, Senchun
    Zhu, Enjun
    Zhang, Nan
    Zhang, Gaochang
    Zhao, Xin
    Cui, Lingguo
    Farhan, Ishrak Md
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [48] Multi-scale dilated convolution of convolutional neural network for image denoising
    Yanjie Wang
    Guodong Wang
    Chenglizhao Chen
    Zhenkuan Pan
    Multimedia Tools and Applications, 2019, 78 : 19945 - 19960
  • [49] Multi-scale dilated convolution of convolutional neural network for image denoising
    Wang, Yanjie
    Wang, Guodong
    Chen, Chenglizhao
    Pan, Zhenkuan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (14) : 19945 - 19960
  • [50] Cascade-guided multi-scale attention network for crowd counting
    Shufang Li
    Zhengping Hu
    Mengyao Zhao
    Zhe Sun
    Signal, Image and Video Processing, 2021, 15 : 1663 - 1670