Person re-identification transformer with patch attention and pruning

被引:0
|
作者
Ndayishimiye, Fabrice [1 ]
Yoon, Gang-Joon [2 ]
Lee, Joonjae [1 ]
Yoon, Sang Min [3 ]
机构
[1] Faculty of Computer Engineering, Keimyung University, 1095 Dalgubeol-daero, Dalseo-gu, Daegu,42601, Korea, Republic of
[2] National Institute for Mathematical Sciences, 70, Yuseong-daero 1689 beon-gil, Yuseong-gu, Daejeon,34047, Korea, Republic of
[3] HCI Lab., College of Computer Science, Kookmin University, 77 Jeongneung-ro, Seoul,02707, Korea, Republic of
基金
新加坡国家研究基金会;
关键词
Convolutional neural networks;
D O I
10.1016/j.jvcir.2024.104348
中图分类号
学科分类号
摘要
Person re-identification (Re-ID), which is widely used in surveillance and tracking systems, aims to search individuals as they move between different camera views by maintaining identity across various camera views. In the realm of person re-identification (Re-ID), recent advancements have introduced convolutional neural networks (CNNs) and vision transformers (ViTs) as promising solutions. While CNN-based methods excel in local feature extraction, ViTs have emerged as effective alternatives to CNN-based person Re-ID, offering the ability to capture long-range dependencies through multi-head self-attention without relying on convolution and downsampling. However, it still faces challenges such as changes in illumination, viewpoint, pose, low resolutions, and partial occlusions. To address the limitations of widely used person Re-ID datasets and improve the generalization, we present a novel person Re-ID method that enhances global and local information interactions using self-attention modules within a ViT network. It leverages dynamic pruning to extract and prioritize essential image patches effectively. The designed patch selection and pruning for person Re-ID model resulted in a robust feature extractor even in scenarios with partial occlusion, background clutter, and illumination variations. Empirical validation demonstrates its superior performance compared to previous approaches and its adaptability across various domains. © 2024 Elsevier Inc.
引用
收藏
相关论文
共 50 条
  • [1] Attention Map Guided Transformer Pruning for Occluded Person Re-Identification on Edge Device
    Mao, Junzhu
    Yao, Yazhou
    Sun, Zeren
    Huang, Xingguo
    Shen, Fumin
    Shen, Heng-Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1592 - 1599
  • [2] A Patch Information Supplement Transformer for Person Re-Identification
    Zhu, Li
    Jiang, Chenglong
    Wu, Minghu
    ELECTRONICS, 2023, 12 (09)
  • [3] Joint learning dynamic pruning and attention for person re-identification
    Ru Cheng
    Lukun Wang
    Mingrun Wei
    Chunpeng Tian
    Multimedia Tools and Applications, 2022, 81 : 39409 - 39429
  • [4] Joint learning dynamic pruning and attention for person re-identification
    Cheng, Ru
    Wang, Lukun
    Wei, Mingrun
    Tian, Chunpeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (27) : 39409 - 39429
  • [5] Patch Features Reconstruction Transformer for Occluded Person Re-Identification
    Zhao, Yunbin
    Zhu, Songhao
    Liang, Zhiwei
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6273 - 6278
  • [6] A person re-identification method for fusing convolutional attention and Transformer architecture
    Wang J.
    Li P.
    Zhao R.
    Zhang Y.
    Ma Z.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2024, 50 (02): : 466 - 476
  • [7] Attention driven person re-identification
    Yang, Fan
    Yan, Ke
    Lu, Shijian
    Jia, Huizhu
    Xie, Xiaodong
    Gao, Wen
    PATTERN RECOGNITION, 2019, 86 : 143 - 155
  • [8] Dual-branch adaptive attention transformer for occluded person re-identification
    Lu, Yunhua
    Jiang, Mingzi
    Liu, Zhi
    Mu, Xinyu
    IMAGE AND VISION COMPUTING, 2023, 131
  • [9] Dynamic Attention Vision-Language Transformer Network for Person Re-identification
    Guifang Zhang
    Shijun Tan
    Zhe Ji
    Yuming Fang
    International Journal of Computer Vision, 2025, 133 (4) : 1927 - 1939
  • [10] Completed Part Transformer for Person Re-Identification
    Zhang, Zhong
    He, Di
    Liu, Shuang
    Xiao, Baihua
    Durrani, Tariq S.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2303 - 2313