Graph based Spatial-temporal Fusion for Multi-modal Person Re-identification

被引:0
|
作者
Zhang, Yaobin [1 ]
Lv, Jianming [1 ]
Liu, Chen [2 ]
Cai, Hongmin [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
关键词
Unsupervised Person re-ID; Spatio-temporal; Graph; Re-ranking; ADAPTATION;
D O I
10.1145/3581783.3613757
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a challenging task, unsupervised person re-identification (Re-ID) aims to optimize the pedestrian matching model based on the unlabeled image frames from surveillance videos. Recently, the fusion with the spatio-temporal clues of pedestrians have been proven effective to improve the performance of classification. However, most of these methods adopt some hard combination approaches by multiplying the visual scores with the spatio-temporal scores, which are sensitive to the noise caused by imprecise estimation of the spatio-temporal patterns in unlabeled datasets and limit the advantage of the fusion model. In this paper, we propose a Graph based Spatio-Temporal Fusion model for high-performance multi-modal person Re-ID, namely G-Fusion, to mitigate the impact of noise. In particular, we construct a graph of pedestrian images by selecting neighboring nodes based on the visual information and the transition time between cameras. Then we use a randomly initialized two-layer GraphSAGE model to obtain the multi-modal affinity matrix between images, and deploy the distillation learning to optimize the visual model by learning the affinity between the nodes. Finally, a graph-based multi-modal re-ranking method is deployed to make the decision in the testing phase for precise person Re-ID. Comprehensive experiments are conducted on two large-scale Re-ID datasets, and the results show that our method achieves a significant improvement of the performance while combined with SOTA unsupervised person Re-ID methods. Specifically, the mAP scores can reach 92.2%, and 80.4% on the Market-1501, and MSMT17 datasets respectively.
引用
收藏
页码:3736 / 3744
页数:9
相关论文
共 50 条
  • [31] Joint graph regularized dictionary learning and sparse ranking for multi-modal multi-shot person re-identification
    Zheng, Aihua
    Li, Hongchao
    Jiang, Bo
    Zheng, Wei-Shi
    Luo, Bin
    PATTERN RECOGNITION, 2020, 104 (104)
  • [32] Heterogeneous Test-Time Training for Multi-Modal Person Re-identification
    Wang, Zi
    Huang, Huaibo
    Zheng, Aihua
    He, Ran
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5850 - 5858
  • [33] LRMM: Low rank multi-scale multi-modal fusion for person re-identification based on RGB-NI-TI
    Wu, Di
    Liu, Zhihui
    Chen, Zihan
    Gan, Shenglong
    Tan, Kaiwen
    Wan, Qin
    Wang, Yaonan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
  • [34] Spatial-Temporal Omni-Scale Feature Learning for Person Re-Identification
    Ploco, Aida
    Rodriguez, Andrea Macarulla
    Geradts, Zeno
    2020 8TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF 2020), 2020,
  • [35] Spatial-Temporal Federated Learning for Lifelong Person Re-Identification on Distributed Edges
    Zhang, Lei
    Gao, Guanyu
    Zhang, Huaizheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1884 - 1896
  • [36] Combined visual and spatial-temporal information for appearance change person re-identification
    Bilakeri, Shavantrevva
    Kotegar, Karunakar A.
    COGENT ENGINEERING, 2023, 10 (01):
  • [37] Spatial-temporal representatives selection and weighted patch descriptor for person re-identification
    Zheng, Aihua
    Wang, Foqin
    Hussain, Amir
    Tang, Jin
    Jiang, Bo
    NEUROCOMPUTING, 2018, 290 : 121 - 129
  • [38] Learning Instance-level Spatial-Temporal Patterns for Person Re-identification
    Ren, Min
    He, Lingxiao
    Liao, Xingyu
    Liu, Wu
    Wang, Yunlong
    Tan, Tieniu
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14910 - 14919
  • [39] Joint Attentive Spatial-Temporal Feature Aggregation for Video-Based Person Re-Identification
    Chen, Lin
    Yang, Hua
    Gao, Zhiyong
    IEEE ACCESS, 2019, 7 : 41230 - 41240
  • [40] Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification
    Chen, Guangyi
    Lu, Jiwen
    Yang, Ming
    Zhou, Jie
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4192 - 4205