HTNet: A Hybrid Model Boosted by Triple Self-attention for Crowd Counting

被引:0
|
作者
Li, Yang [1 ]
Yin, Baoqun [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowd Counting; Deep Learning; Self-Attention; Hybrid Model;
D O I
10.1007/978-981-99-8555-5_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The swift development of convolutional neural network (CNN) has enabled significant headway in crowd counting research. However, the fixed-size convolutional kernels of traditional methods make it difficult to handle problems such as drastic scale change and complex background interference. In this regard, we propose a hybrid crowd counting model to tackle existing challenges. Firstly, we leverage a global self-attention module (GAM) after CNN backbone to capture wider contextual information. Secondly, due to the gradual recovery of the feature map size in the decoding stage, the local self-attention module (LAM) is employed to reduce computational complexity. With this design, the model can fuse features from global and local perspectives to better cope with scale change. Additionally, to establish the interdependence between spatial and channel dimensions, we further design a novel channel self-attention module (CAM) and combine it with LAM. Finally, we construct a simple yet useful double head module that outputs a foreground segmentation map in addition to the intermediate density map, which are then multiplied together in a pixel-wise style to suppress background interference. The experimental results on several benchmark datasets demonstrate that our method achieves remarkable improvement.
引用
收藏
页码:290 / 301
页数:12
相关论文
共 50 条
  • [1] Crowd Counting Network with Self-attention Distillation
    Li, Yaoyao
    Wang, Li
    Zhao, Huailin
    Nie, Zhen
    JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2020, 7 (02): : 116 - 120
  • [2] Crowd Counting Network with Self-attention Distillation
    Wang, Li
    Zhao, Huailin
    Nie, Zhen
    Li, Yaoyao
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB2020), 2020, : 587 - 591
  • [3] Self-attention Guidance Based Crowd Localization and Counting
    Ma, Zhouzhou
    Gu, Guanghua
    Zhao, Wenrui
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (05) : 966 - 982
  • [4] Crowd counting method based on the self-attention residual network
    Liu, Yan-Bo
    Jia, Rui-Sheng
    Liu, Qing-Ming
    Zhang, Xing-Li
    Sun, Hong-Mei
    APPLIED INTELLIGENCE, 2021, 51 (01) : 427 - 440
  • [5] Crowd counting method based on the self-attention residual network
    Yan-Bo Liu
    Rui-Sheng Jia
    Qing-Ming Liu
    Xing-Li Zhang
    Hong-Mei Sun
    Applied Intelligence, 2021, 51 : 427 - 440
  • [6] MSGSA: Multi-Scale Guided Self-Attention Network for Crowd Counting
    Sun, Yange
    Li, Meng
    Guo, Huaping
    Zhang, Li
    ELECTRONICS, 2023, 12 (12)
  • [7] Dual-branch crowd counting algorithm based on self-attention mechanism
    Yang T.-L.
    Li L.-X.
    Zhang W.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (10): : 1955 - 1965
  • [8] Double Recursive Sparse Self-attention Based Crowd Counting in the Cluttered Background
    Zhou, Boxiang
    Wang, Suyu
    Xiao, Sai
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 722 - 734
  • [9] Crowd counting using a self-attention multi-scale cascaded network
    Li, He
    Zhang, Shihui
    Kong, Weihang
    IET COMPUTER VISION, 2019, 13 (06) : 556 - 561
  • [10] TRIPLE ATTENTION FOR ROBUST VIDEO CROWD COUNTING
    Wu, Qiyao
    Zhang, Chongyang
    Kong, Xiyu
    Zhao, Muming
    Chen, Yanjun
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1966 - 1970