Cross-modal pedestrian re-identification technique based on multi-scale feature attention and strategy balancing

被引:0
|
作者
Lai, Yiqiang [1 ]
机构
[1] Guangdong Univ Foreign Stusdies, South China Business Coll, Guangzhou 510545, Guangdong, Peoples R China
来源
ENGINEERING RESEARCH EXPRESS | 2025年 / 7卷 / 01期
关键词
multi-scale feature extraction; attention mechanism; cross-modal learning; pedestrian re-identification; strategy balancing mechanism; DEEP;
D O I
10.1088/2631-8695/adb93c
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper proposes a cross-modal pedestrian re-recognition technique based on the balance of attention and strategy of multi-scale features. The technique improves recognition accuracy by integrating information from different scales, dynamically adjusting attention, and balancing contributions from different modalities. The model architecture includes a multi-scale feature extraction module, an attention mechanism, a strategy balancing mechanism, and a classifier. Experimental results show that the proposed model exhibits superior performance on several public datasets such as Market-1501, DukeMTMC-reID, and CUHK03, especially on the Market-1501 dataset, where MAP and Rank-1 reach 0.83 and 0.89, respectively, which outperforms the existing baseline model and other methods. In addition, by integrating RGB and Thermal modal information, the model's recognition ability is further improved, showing the effectiveness of cross-modal information integration.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] A Person Re-Identification Method with Multi-Scale and Multi-Feature Fusion
    Liu, Li
    Li, Xi
    Lei, Xuemei
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (12): : 1868 - 1876
  • [42] Progressive learning with multi-scale attention network for cross-domain vehicle re-identification
    Yang WANG
    Jinjia PENG
    Huibing WANG
    Meng WANG
    Science China(Information Sciences), 2022, 65 (06) : 33 - 47
  • [43] An efficient multi-scale channel attention network for person re-identification
    Qian Luo
    Jie Shao
    Wanli Dang
    Long Geng
    Huaiyu Zheng
    Chang Liu
    The Visual Computer, 2024, 40 : 3515 - 3527
  • [44] MSNet: a lightweight multi-scale deep learning network for pedestrian re-identification
    Keyu Pan
    Yishi Zhao
    Tao Wang
    Shihong Yao
    Signal, Image and Video Processing, 2023, 17 : 3091 - 3098
  • [45] A pedestrian re-identification method based on multi-feature fusion
    Liu, Yan
    Li, Qingwu
    Chou, Chunchun
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 73 - 76
  • [46] Leader-Based Multi-Scale Attention Deep Architecture for Person Re-Identification
    Qian, Xuelin
    Fu, Yanwei
    Xiang, Tao
    Jiang, Yu-Gang
    Xue, Xiangyang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 371 - 385
  • [47] LOCAL TO GLOBAL WITH MULTI-SCALE ATTENTION NETWORK FOR PERSON RE-IDENTIFICATION
    Sun, Lingchuan
    Liu, Jianlei
    Zhu, Yingxin
    Jiang, Zhuqing
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2254 - 2258
  • [48] An efficient multi-scale channel attention network for person re-identification
    Luo, Qian
    Shao, Jie
    Dang, Wanli
    Geng, Long
    Zheng, Huaiyu
    Liu, Chang
    VISUAL COMPUTER, 2024, 40 (05): : 3515 - 3527
  • [49] MSNet: a lightweight multi-scale deep learning network for pedestrian re-identification
    Pan, Keyu
    Zhao, Yishi
    Wang, Tao
    Yao, Shihong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 3091 - 3098
  • [50] An Efficient Multi-Scale Focusing Attention Network for Person Re-Identification
    Huang, Wei
    Li, Yongying
    Zhang, Kunlin
    Hou, Xiaoyu
    Xu, Jihui
    Su, Ruidan
    Xu, Huaiyu
    APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 16