Attention-Guided Collaborative Counting

被引:10
|
作者
Mo, Hong [1 ]
Ren, Wenqi [2 ]
Zhang, Xiong [3 ]
Yan, Feihu [4 ]
Zhou, Zhong [1 ]
Cao, Xiaochun [2 ]
Wu, Wei [1 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Peoples R China
[3] Neolix Autonomous Vehicle, Beijing 100016, Peoples R China
[4] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Collaboration; Transformers; Task analysis; Head; Computational modeling; Computer vision; Crowd counting; attention-guided collaborative counting model; bi-directional transformer;
D O I
10.1109/TIP.2022.3207584
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing crowd counting designs usually exploit multi-branch structures to address the scale diversity problem. However, branches in these structures work in a competitive rather than collaborative way. In this paper, we focus on promoting collaboration between branches. Specifically, we propose an attention-guided collaborative counting module (AGCCM) comprising an attention-guided module (AGM) and a collaborative counting module (CCM). The CCM promotes collaboration among branches by recombining each branch's output into an independent count and joint counts with other branches. The AGM capturing the global attention map through a transformer structure with a pair of foreground-background related loss functions can distinguish the advantages of different branches. The loss functions do not require additional labels and crowd division. In addition, we design two kinds of bidirectional transformers (Bi-Transformers) to decouple the global attention to row attention and column attention. The proposed Bi-Transformers are able to reduce the computational complexity and handle images in any resolution without cropping the image into small patches. Extensive experiments on several public datasets demonstrate that the proposed algorithm performs favorably against the state-of-the-art crowd counting methods.
引用
收藏
页码:6306 / 6319
页数:14
相关论文
共 50 条
  • [31] Unsupervised Attention-guided Image-to-Image Translation
    Mejjati, Youssef A.
    Richardt, Christian
    Tompkin, James
    Cosker, Darren
    Kim, Kwang In
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [32] Attention-guided Progressive Partition Network for Human Parsing
    Huang, Xi
    He, Chengkun
    Shao, Jie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [33] Joint Scence Network and Attention-Guided for Image Captioning
    Zhou, Dongming
    Yang, Jing
    Zhang, Canlong
    Tang, Yanping
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1535 - 1540
  • [34] Attention-guided Feature Fusion for Small Object Detection
    Yang, Jiaxiong
    Liu, Xianhui
    Liu, Zhuang
    IST 2023 - IEEE International Conference on Imaging Systems and Techniques, Proceedings, 2023,
  • [35] Attention-Guided Fusion and Classification for Hyperspectral and LiDAR Data
    Huang, Jing
    Zhang, Yinghao
    Yang, Fang
    Chai, Li
    Tansey, Kevin
    REMOTE SENSING, 2024, 16 (01)
  • [36] SDETR: ATTENTION-GUIDED SALIENT OBJECT DETECTION WITH TRANSFORMER
    Liu, Guanze
    Xu, Bo
    Huang, Han
    Lu, Cheng
    Guo, Yandong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1611 - 1615
  • [37] Attention-Guided Region Proposal Network for Pedestrian Detection
    Sun, Rui
    Wang, Huihui
    Zhang, Jun
    Zhang, Xudong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (10) : 2072 - 2076
  • [38] Attention-Guided Supervised Contrastive Learning for Deepfake Detection
    Waseem, Saima
    Abu Bakar, Syed Abdul Rahman Bin Syed
    Ahmed, Bilal Ashfaq
    2024 IEEE 8TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS, ICSIPA, 2024,
  • [39] Attention-Guided Answer Distillation for Machine Reading Comprehension
    Hu, Minghao
    Peng, Yuxing
    Wei, Furu
    Huang, Zhen
    Li, Dongsheng
    Yang, Nan
    Zhou, Ming
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2077 - 2086
  • [40] Attention-guided chained context aggregation for semantic segmentation*
    Tang, Quan
    Liu, Fagui
    Zhang, Tong
    Jiang, Jun
    Zhang, Yu
    IMAGE AND VISION COMPUTING, 2021, 115 (115)