Graph based Spatial-temporal Fusion for Multi-modal Person Re-identification

被引:0
|
作者
Zhang, Yaobin [1 ]
Lv, Jianming [1 ]
Liu, Chen [2 ]
Cai, Hongmin [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
关键词
Unsupervised Person re-ID; Spatio-temporal; Graph; Re-ranking; ADAPTATION;
D O I
10.1145/3581783.3613757
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a challenging task, unsupervised person re-identification (Re-ID) aims to optimize the pedestrian matching model based on the unlabeled image frames from surveillance videos. Recently, the fusion with the spatio-temporal clues of pedestrians have been proven effective to improve the performance of classification. However, most of these methods adopt some hard combination approaches by multiplying the visual scores with the spatio-temporal scores, which are sensitive to the noise caused by imprecise estimation of the spatio-temporal patterns in unlabeled datasets and limit the advantage of the fusion model. In this paper, we propose a Graph based Spatio-Temporal Fusion model for high-performance multi-modal person Re-ID, namely G-Fusion, to mitigate the impact of noise. In particular, we construct a graph of pedestrian images by selecting neighboring nodes based on the visual information and the transition time between cameras. Then we use a randomly initialized two-layer GraphSAGE model to obtain the multi-modal affinity matrix between images, and deploy the distillation learning to optimize the visual model by learning the affinity between the nodes. Finally, a graph-based multi-modal re-ranking method is deployed to make the decision in the testing phase for precise person Re-ID. Comprehensive experiments are conducted on two large-scale Re-ID datasets, and the results show that our method achieves a significant improvement of the performance while combined with SOTA unsupervised person Re-ID methods. Specifically, the mAP scores can reach 92.2%, and 80.4% on the Market-1501, and MSMT17 datasets respectively.
引用
收藏
页码:3736 / 3744
页数:9
相关论文
共 50 条
  • [41] Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification
    Xu, Shuangjie
    Cheng, Yu
    Gu, Kang
    Yang, Yang
    Chang, Shiyu
    Zhou, Pan
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4743 - 4752
  • [42] A multi-modal spatial-temporal model for accurate motion forecasting with visual fusion
    Wang, Xiaoding
    Liu, Jianmin
    Lin, Hui
    Garg, Sahil
    Alrashoud, Mubarak
    INFORMATION FUSION, 2024, 102
  • [43] Multi-modal Gait Recognition via Effective Spatial-Temporal Feature Fusion
    Cui, Yufeng
    Kang, Yimei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17949 - 17957
  • [44] Multi-modal uniform deep learning for RGB-D person re-identification
    Ren, Liangliang
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    PATTERN RECOGNITION, 2017, 72 : 446 - 457
  • [45] Occluded Video-Based Person Re-Identification Based on Spatial- Temporal Trajectory Fusion
    Yun Xiao
    Song Kaili
    Zhang Xiaoguang
    Yuan Xinchao
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (10)
  • [46] STA: Spatial-Temporal Attention for Large-Scale Video-Based Person Re-Identification
    Fu, Yang
    Wang, Xiaoyang
    Wei, Yunchao
    Huang, Thomas
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8287 - 8294
  • [47] Separable Spatial-Temporal Residual Graph for Cloth-Changing Group Re-Identification
    Zhang, Quan
    Lai, Jianhuang
    Xie, Xiaohua
    Jin, Xiaofeng
    Huang, Sien
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5791 - 5805
  • [48] Multi-Level Fusion Temporal-Spatial Co-Attention for Video-Based Person Re-Identification
    Pei, Shengyu
    Fan, Xiaoping
    ENTROPY, 2021, 23 (12)
  • [49] A Weighted Center Graph Fusion Method for Person Re-Identification
    Geng, Shuze
    Yu, Ming
    Guo, Yingchun
    Yu, Yang
    IEEE ACCESS, 2019, 7 : 23329 - 23342
  • [50] Person Re-identification by Multi-hypergraph Fusion
    An, Le
    Chen, Xiaojing
    Yang, Songfan
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (11) : 2763 - 2774