Person re-identification transformer with patch attention and pruning

被引:0
|
作者
Ndayishimiye, Fabrice [1 ]
Yoon, Gang-Joon [2 ]
Lee, Joonjae [1 ]
Yoon, Sang Min [3 ]
机构
[1] Faculty of Computer Engineering, Keimyung University, 1095 Dalgubeol-daero, Dalseo-gu, Daegu,42601, Korea, Republic of
[2] National Institute for Mathematical Sciences, 70, Yuseong-daero 1689 beon-gil, Yuseong-gu, Daejeon,34047, Korea, Republic of
[3] HCI Lab., College of Computer Science, Kookmin University, 77 Jeongneung-ro, Seoul,02707, Korea, Republic of
基金
新加坡国家研究基金会;
关键词
Convolutional neural networks;
D O I
10.1016/j.jvcir.2024.104348
中图分类号
学科分类号
摘要
Person re-identification (Re-ID), which is widely used in surveillance and tracking systems, aims to search individuals as they move between different camera views by maintaining identity across various camera views. In the realm of person re-identification (Re-ID), recent advancements have introduced convolutional neural networks (CNNs) and vision transformers (ViTs) as promising solutions. While CNN-based methods excel in local feature extraction, ViTs have emerged as effective alternatives to CNN-based person Re-ID, offering the ability to capture long-range dependencies through multi-head self-attention without relying on convolution and downsampling. However, it still faces challenges such as changes in illumination, viewpoint, pose, low resolutions, and partial occlusions. To address the limitations of widely used person Re-ID datasets and improve the generalization, we present a novel person Re-ID method that enhances global and local information interactions using self-attention modules within a ViT network. It leverages dynamic pruning to extract and prioritize essential image patches effectively. The designed patch selection and pruning for person Re-ID model resulted in a robust feature extractor even in scenarios with partial occlusion, background clutter, and illumination variations. Empirical validation demonstrates its superior performance compared to previous approaches and its adaptability across various domains. © 2024 Elsevier Inc.
引用
收藏
相关论文
共 50 条
  • [31] Filter pruning based on evolutionary algorithms for person re-identification
    Zhao, Jiaqi
    Chen, Ying
    Zhong, Yufeng
    Zhou, Yong
    Yao, Rui
    Zhang, Lixu
    Xia, Shixiong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 32569 - 32586
  • [32] Diff attention: A novel attention scheme for person re-identification
    Lin, Xin
    Zhu, Li
    Yang, Shuyu
    Wang, Yaxiong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 228
  • [33] Learning transformer-based attention region with multiple scales for occluded person re-identification
    Liu, Zhi
    Mu, Xingyu
    Lu, Yunhua
    Zhang, Tingting
    Tian, Yingli
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 229
  • [34] Cross-Attention Fusion Learning of Transformer-CNN Features for Person Re-Identification
    Xiang, Jun
    Zhang, Jincheng
    Jiang, Xiaoping
    Hou, Jianhua
    Computer Engineering and Applications, 2024, 60 (16) : 94 - 104
  • [35] A Multi-Scale Graph Attention-Based Transformer for Occluded Person Re-Identification
    Ma, Ming
    Wang, Jianming
    Zhao, Bohan
    Applied Sciences (Switzerland), 14 (18):
  • [36] Self and Channel Attention Network for Person Re-Identification
    Munir, Asad
    Martinel, Niki
    Micheloni, Christian
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4025 - 4031
  • [37] Dual Branch Attention Network for Person Re-Identification
    Fan, Denghua
    Wang, Liejun
    Cheng, Shuli
    Li, Yongming
    SENSORS, 2021, 21 (17)
  • [38] Deep Pyramidal Pooling With Attention for Person Re-Identification
    Martinel, Niki
    Foresti, Gian Luca
    Micheloni, Christian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 7306 - 7316
  • [39] Multi-layer attention for person re-identification
    Zhang, Yuele
    Guo, Jie
    Huang, Zheng
    Qiu, Weidong
    Fan, Hexiaohui
    2018 INTERNATIONAL JOINT CONFERENCE ON METALLURGICAL AND MATERIALS ENGINEERING (JCMME 2018), 2019, 277
  • [40] DUAL REVERSE ATTENTION NETWORKS FOR PERSON RE-IDENTIFICATION
    Liu, Shuangwei
    Qi, Lin
    Zhang, Yunzhou
    Shi, Weidong
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1232 - 1236