Cross-modal pedestrian re-identification technique based on multi-scale feature attention and strategy balancing

被引:0
|
作者
Lai, Yiqiang [1 ]
机构
[1] Guangdong Univ Foreign Stusdies, South China Business Coll, Guangzhou 510545, Guangdong, Peoples R China
来源
ENGINEERING RESEARCH EXPRESS | 2025年 / 7卷 / 01期
关键词
multi-scale feature extraction; attention mechanism; cross-modal learning; pedestrian re-identification; strategy balancing mechanism; DEEP;
D O I
10.1088/2631-8695/adb93c
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper proposes a cross-modal pedestrian re-recognition technique based on the balance of attention and strategy of multi-scale features. The technique improves recognition accuracy by integrating information from different scales, dynamically adjusting attention, and balancing contributions from different modalities. The model architecture includes a multi-scale feature extraction module, an attention mechanism, a strategy balancing mechanism, and a classifier. Experimental results show that the proposed model exhibits superior performance on several public datasets such as Market-1501, DukeMTMC-reID, and CUHK03, especially on the Market-1501 dataset, where MAP and Rank-1 reach 0.83 and 0.89, respectively, which outperforms the existing baseline model and other methods. In addition, by integrating RGB and Thermal modal information, the model's recognition ability is further improved, showing the effectiveness of cross-modal information integration.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Chest radiology report generation based on cross-modal multi-scale feature fusion
    Pan, Yu
    Liu, Li -Jun
    Yang, Xiao-Bing
    Peng, Wei
    Huang, Qing-Song
    JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES, 2024, 17 (01)
  • [32] Depth map reconstruction method based on multi-scale cross-modal feature fusion
    Yang J.
    Xie T.
    Yue H.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2023, 51 (03): : 52 - 59
  • [33] Feature Enhancement and Multi-scale Cross-Modal Attention for RGB-D Salient Object Detection
    Wan, Xin
    Yang, Gang
    Zhou, Boyi
    Liu, Chang
    Wang, Hangxu
    Wang, Yutao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 409 - 420
  • [34] Reciprocal bi-directional generative adversarial network for cross-modal pedestrian re-identification
    Wei Z.
    Yang X.
    Wang N.
    Yang D.
    Gao X.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (02): : 205 - 212
  • [35] A Cross-modal Person Re-identification Method Based on Hybrid Channel Augmentation with Structured Dual Attention
    Zhuang J.
    Zhuang Y.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (02): : 518 - 526
  • [36] AXM-Net: Implicit Cross-Modal Feature Alignment for Person Re-identification
    Farooq, Ammarah
    Awais, Muhammad
    Kittler, Josef
    Khalid, Syed Safwan
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4477 - 4485
  • [37] Multi-Branch Person Re-Identification Basedon Multi-Scale Attention
    Cong, Li
    Min, Jiang
    Jun, Kong
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (20)
  • [38] Progressive learning with multi-scale attention network for cross-domain vehicle re-identification
    Yang Wang
    Jinjia Peng
    Huibing Wang
    Meng Wang
    Science China Information Sciences, 2022, 65
  • [39] Progressive learning with multi-scale attention network for cross-domain vehicle re-identification
    Wang, Yang
    Peng, Jinjia
    Wang, Huibing
    Wang, Meng
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (06)
  • [40] Multi-scale and multi-branch feature representation for person re-identification
    Jiao, Shanshan
    Pan, Zhisong
    Hu, Guyu
    Shen, Qing
    Du, Lin
    Chen, Yutian
    Wang, Jiabao
    NEUROCOMPUTING, 2020, 414 : 120 - 130