MLANet: multi-level attention network with multi-scale feature fusion for crowd counting

被引:0
|
作者
Xiong, Liyan [1 ]
Zeng, Yijuan [1 ]
Huang, Xiaohui [1 ]
Li, Zhida [1 ]
Huang, Peng [2 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] Rd Network Operat Management Co, Jiangxi Prov Commun Investment Grp Co Ltd, Nanchang 330013, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-level structure; Multi-scale feature; Attention mechanism; Crowd counting;
D O I
10.1007/s10586-024-04326-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Estimating the population in a given scene is a process known as crowd counting. The field has recently garnered significant attention, and many innovative methods have emerged. However, intense scale variations and background interference make crowd counting in realistic scenes always challenging. To address these in this paper, a multi-level attention network with multi-scale feature fusion named MLANet is proposed. The network consists of three sections: a multi-level base feature extraction front-end network, a centralized dilated multi-scale feature fusion mid-end network with a global attention module, and a back-end network for the generation of density maps. By incorporating a flexible attention module and multi-scale features, the method can accurately capture crowd information at different scales and achieve accurate counting results. We evaluated the method on four public datasets (UCF_CC_50, ShanghaiTech, WorldExpo'10, and Beijing BRT), and the experimental results demonstrate a significant reduction in counting error when compared with existing methods.
引用
收藏
页码:6591 / 6608
页数:18
相关论文
共 50 条
  • [31] Crowd counting based on attention-guided multi-scale fusion networks
    Zhang, Bo
    Wang, Naiyao
    Zhao, Zheng
    Abraham, Ajith
    Liu, Hongbo
    [J]. NEUROCOMPUTING, 2021, 451 : 12 - 24
  • [32] STNet: Scale Tree Network With Multi-Level Auxiliator for Crowd Counting
    Wang, Mingjie
    Cai, Hao
    Han, Xian-Feng
    Zhou, Jun
    Gong, Minglun
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2074 - 2084
  • [33] Artifact-Assisted multi-level and multi-scale feature fusion attention network for low-dose CT denoising
    Cui, Xueying
    Guo, Yingting
    Zhang, Xiong
    Hong Shangguan
    Liu, Bin
    Wang, Anhong
    [J]. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2022, 30 (05) : 875 - 889
  • [34] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    [J]. VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [35] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    [J]. The Visual Computer, 2023, 39 : 2969 - 2980
  • [36] A Multi-Scale and Multi-Level Spectral-Spatial Feature Fusion Network for Hyperspectral Image Classification
    Mu, Caihong
    Guo, Zhen
    Liu, Yi
    [J]. REMOTE SENSING, 2020, 12 (01)
  • [37] Crowd counting using a self-attention multi-scale cascaded network
    Li, He
    Zhang, Shihui
    Kong, Weihang
    [J]. IET COMPUTER VISION, 2019, 13 (06) : 556 - 561
  • [38] Multi-Scale Context Aggregation Network with Attention-Guided for Crowd Counting
    Wang, Xin
    Lv, Rongrong
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    [J]. PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 240 - 245
  • [39] MFP-Net: Multi-scale feature pyramid network for crowd counting
    Lei, Tao
    Zhang, Dong
    Wang, Risheng
    Li, Shuying
    Zhang, Weijiang
    Nandi, Asoke K.
    [J]. IET IMAGE PROCESSING, 2021, 15 (14) : 3522 - 3533
  • [40] MSGSA: Multi-Scale Guided Self-Attention Network for Crowd Counting
    Sun, Yange
    Li, Meng
    Guo, Huaping
    Zhang, Li
    [J]. ELECTRONICS, 2023, 12 (12)