Hybrid attention network based on progressive embedding scale-context for crowd counting

被引:21
|
作者
Wang, Fusen [1 ,2 ]
Sang, Jun [1 ,2 ]
Wu, Zhongyuan [1 ,2 ]
Liu, Qi [1 ,2 ]
Sang, Nong [3 ]
机构
[1] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 400044, Peoples R China
[2] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430000, Peoples R China
基金
中国国家自然科学基金;
关键词
Crowd counting; Hybrid attention; Progressive embedding scale-context; Density map estimation;
D O I
10.1016/j.ins.2022.01.046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing crowd counting methods usually adopt attention mechanisms to tackle background noise, or apply multilevel features or multiscale context fusion to tackle scale variation. However, these approaches deal with these two problems separately. In this paper, we propose a hybrid attention network (HAN) by employing progressive embedding scale context (PES) information, which enables the network to simultaneously suppress noise and adapt head scale variation. We build the hybrid attention mechanism through two parallel spatial attention and channel attention modules, which makes the network focus more on the human head area and reduce the interference of background objects. In addition, we embed certain scale-context to the hybrid attention along the spatial and channel dimensions to alleviate the counting errors caused by the variation of perspective and head scale. Finally, we propose a progressive learning strategy through cascading multiple hybrid attention modules with embedding different scale contexts, which can gradually integrate different scale-context information into the current feature map from global to local. Ablation experiments show that the network architecture can gradually learn multi scale features and suppress background noise. Extensive experiments demonstrate that HANet obtains state-of-the-art counting performance on five mainstream datasets.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:306 / 318
页数:13
相关论文
共 50 条
  • [1] Scale-Context Perceptive Network for Crowd Counting and Localization in Smart City System
    Zhai, Wenzhe
    Gao, Mingliang
    Guo, Xiangyu
    Li, Qilei
    Jeon, Gwanggil
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (21) : 18930 - 18940
  • [2] Context Attention Fusion Network for crowd counting
    Wang, Tao
    Zhang, Ting
    Zhang, Kaibing
    Wang, Huake
    Li, Minqi
    Lu, Jian
    KNOWLEDGE-BASED SYSTEMS, 2023, 271
  • [3] MHANet: Multi-scale hybrid attention network for crowd counting
    Yu, Ying
    Yu, Jiamao
    Qian, Jin
    Zhu, Zhiliang
    Han, Xing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9445 - 9455
  • [4] Multi-branch progressive embedding network for crowd counting
    Zhou, Lifang
    Rao, Songlin
    Li, Weisheng
    Hu, Bo
    Sun, Bo
    IMAGE AND VISION COMPUTING, 2024, 148
  • [5] Multi-Scale Context Aggregation Network with Attention-Guided for Crowd Counting
    Wang, Xin
    Lv, Rongrong
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 240 - 245
  • [6] Context-aware pyramid attention network for crowd counting
    Gu, Lingyu
    Pang, Chen
    Zheng, Yanjun
    Lyu, Chen
    Lyu, Lei
    APPLIED INTELLIGENCE, 2022, 52 (06) : 6164 - 6180
  • [7] Context-aware pyramid attention network for crowd counting
    Lingyu Gu
    Chen Pang
    Yanjun Zheng
    Chen Lyu
    Lei Lyu
    Applied Intelligence, 2022, 52 : 6164 - 6180
  • [8] Attention-injective scale aggregation network for crowd counting
    Zou, Haojie
    Kuang, Yingchun
    Luo, Jianqiang
    Yao, Mingwei
    Zhou, Haoyu
    Yang, Sha
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
  • [9] HANet: Hybrid Attention-aware Network for Crowd Counting
    Su, Xinxing
    Yuan, Yuchen
    Su, Xiangbo
    Zou, Zhikang
    Wen, Shilei
    Zhou, Pan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7707 - 7714
  • [10] Multi-Scale Guided Attention Network for Crowd Counting
    Li, Pengfei
    Zhang, Min
    Wan, Jian
    Jiang, Ming
    SCIENTIFIC PROGRAMMING, 2021, 2021