Attention-Guided Collaborative Counting

被引:10
|
作者
Mo, Hong [1 ]
Ren, Wenqi [2 ]
Zhang, Xiong [3 ]
Yan, Feihu [4 ]
Zhou, Zhong [1 ]
Cao, Xiaochun [2 ]
Wu, Wei [1 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Peoples R China
[3] Neolix Autonomous Vehicle, Beijing 100016, Peoples R China
[4] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Collaboration; Transformers; Task analysis; Head; Computational modeling; Computer vision; Crowd counting; attention-guided collaborative counting model; bi-directional transformer;
D O I
10.1109/TIP.2022.3207584
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing crowd counting designs usually exploit multi-branch structures to address the scale diversity problem. However, branches in these structures work in a competitive rather than collaborative way. In this paper, we focus on promoting collaboration between branches. Specifically, we propose an attention-guided collaborative counting module (AGCCM) comprising an attention-guided module (AGM) and a collaborative counting module (CCM). The CCM promotes collaboration among branches by recombining each branch's output into an independent count and joint counts with other branches. The AGM capturing the global attention map through a transformer structure with a pair of foreground-background related loss functions can distinguish the advantages of different branches. The loss functions do not require additional labels and crowd division. In addition, we design two kinds of bidirectional transformers (Bi-Transformers) to decouple the global attention to row attention and column attention. The proposed Bi-Transformers are able to reduce the computational complexity and handle images in any resolution without cropping the image into small patches. Extensive experiments on several public datasets demonstrate that the proposed algorithm performs favorably against the state-of-the-art crowd counting methods.
引用
收藏
页码:6306 / 6319
页数:14
相关论文
共 50 条
  • [1] Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution
    Tse, Tze Ho Elden
    Kim, Kwang In
    Leonardis, Ales
    Chang, Hyung Jin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1654 - 1664
  • [2] ACDNet: Attention-guided Collaborative Decision Network for effective medication recommendation
    Mi, Jiacong
    Zu, Yi
    Wang, Zhuoyuan
    He, Jieyue
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 149
  • [3] Multi-Scale Context Aggregation Network with Attention-Guided for Crowd Counting
    Wang, Xin
    Lv, Rongrong
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 240 - 245
  • [4] Crowd counting based on attention-guided multi-scale fusion networks
    Zhang, Bo
    Wang, Naiyao
    Zhao, Zheng
    Abraham, Ajith
    Liu, Hongbo
    NEUROCOMPUTING, 2021, 451 : 12 - 24
  • [5] Deep Attention-Guided Hashing
    Yang, Zhan
    Raymond, Osolo Ian
    Sun, Wuqing
    Long, Jun
    IEEE ACCESS, 2019, 7 : 11209 - 11221
  • [6] Attention-guided CNN for image denoising
    Tian, Chunwei
    Xu, Yong
    Li, Zuoyong
    Zuo, Wangmeng
    Fei, Lunke
    Liu, Hong
    NEURAL NETWORKS, 2020, 124 : 117 - 129
  • [7] Towards attention-guided human-computer collaborative reasoning for spatial configuration and design
    Bertel, Sven
    FOUNDATIONS OF AUGMENTED COGNITION, PROCEEDINGS, 2007, 4565 : 337 - 345
  • [8] AMGNet: An Attention-Guided Multi-Graph Collaborative Decision Network for Safe Medication Recommendation
    Li, Shiji
    Wang, Haitao
    He, Jianfeng
    Chen, Xing
    ELECTRONICS, 2025, 14 (04):
  • [9] 3D Crowd Counting via Geometric Attention-Guided Multi-view Fusion
    Qi Zhang
    Antoni B. Chan
    International Journal of Computer Vision, 2022, 130 : 3123 - 3139
  • [10] Attention-guided Unified Network for Panoptic Segmentation
    Li, Yanwei
    Chen, Xinze
    Zhu, Zheng
    Xie, Lingxi
    Huang, Guan
    Du, Dalong
    Wang, Xingang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7019 - 7028