Deep feature network with multi-scale fusion for highly congested crowd counting

被引:1
|
作者
Yan, Leilei [1 ]
Zhang, Li [1 ]
Zheng, Xiaohan [1 ]
Li, Fanzhang [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Jiangsu, Peoples R China
关键词
Crowd counting; Multi-scale; Dilated convolution framework; Gridding; Low-level spatial information; High-level semantic information; PARTIALLY OCCLUDED HUMANS; BAYESIAN COMBINATION; MULTIPLE; IMAGE;
D O I
10.1007/s13042-023-01941-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a deep feature network with multi-scale fusion (DFNet) for addressing the problem of crowd counting in highly congested noisy scenes. DFNet contains three modules: feature encoder, feature decoder and feature fusion. The feature encoder uses a VGG-16-based convolutional neural network (CNN) that encodes features from images and forms a kind of low-level spatial information. The feature decoder is a multi-column dilated convolutional neural network (McDCNN) with different dilation rates that can capture a multi-scale contextual information, decode the low-level spatial information and generate a kind of high-level semantic information. Furthermore, the multi-column architecture in McDCNN can effectively relieve the "gridding" issue presented in the dilated convolution framework. The feature fusion block uses a simple and effective network architecture to sufficiently incorporate the low-level spatial and the high-level semantic information for facilitating high-quality density map estimation and performing accurate crowd counting. Extensive experiments on several highly challenging crowd counting datasets are conducted. Experimental results show that DFNet is comparable with recent state-of-the-art approaches.
引用
收藏
页码:819 / 835
页数:17
相关论文
共 50 条
  • [21] A Crowd Counting and Localization Network Based on Adaptive Feature Fusion and Multi-Scale Global Attention Up Sampling
    Wang, Min
    Huang, Li
    Yan, Jingke
    Huang, Jin
    Yang, Tao
    IEEE ACCESS, 2024, 12 : 12919 - 12939
  • [22] Redesigning Multi-Scale Neural Network for Crowd Counting
    Du, Zhipeng
    Shi, Miaojing
    Deng, Jiankang
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3664 - 3678
  • [23] Crowd Counting based on Multi-level Multi-scale Feature
    Di Wu
    Zheyi Fan
    Shuhan Yi
    Applied Intelligence, 2023, 53 : 21891 - 21901
  • [24] Multi-Scale Guided Attention Network for Crowd Counting
    Li, Pengfei
    Zhang, Min
    Wan, Jian
    Jiang, Ming
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [25] Multi-scale Attention Recalibration Network for crowd counting
    Xie, Jinyang
    Pang, Chen
    Zheng, Yanjun
    Li, Liang
    Lyu, Chen
    Lyu, Lei
    Liu, Hong
    APPLIED SOFT COMPUTING, 2022, 117
  • [26] STOCHASTIC MULTI-SCALE AGGREGATION NETWORK FOR CROWD COUNTING
    Wang, Mingjie
    Cai, Hao
    Zhou, Jun
    Gong, Minglun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2008 - 2012
  • [27] Crowd Counting based on Multi-level Multi-scale Feature
    Wu, Di
    Fan, Zheyi
    Yi, Shuhan
    APPLIED INTELLIGENCE, 2023, 53 (19) : 21891 - 21901
  • [28] Multi-level feature fusion network for crowd counting
    Wang, Luyang
    Li, Yun
    Peng, Sifan
    Tang, Xiao
    Yin, Baoqun
    IET COMPUTER VISION, 2021, 15 (01) : 60 - 72
  • [29] MULTI-STEP QUANTIZATION OF A MULTI-SCALE NETWORK FOR CROWD COUNTING
    Shim, Kyujin
    Byun, Junyoung
    Kim, Changick
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 683 - 687
  • [30] MSR-FAN: Multi-scale residual feature-aware network for crowd counting
    Zhao, Haoyu
    Min, Weidong
    Wei, Xin
    Wang, Qi
    Fu, Qiyan
    Wei, Zitai
    IET IMAGE PROCESSING, 2021, 15 (14) : 3512 - 3521