Graph based Spatial-temporal Fusion for Multi-modal Person Re-identification

被引:0
|
作者
Zhang, Yaobin [1 ]
Lv, Jianming [1 ]
Liu, Chen [2 ]
Cai, Hongmin [1 ]
机构
[1] South China Univ Technol, Guangzhou, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
关键词
Unsupervised Person re-ID; Spatio-temporal; Graph; Re-ranking; ADAPTATION;
D O I
10.1145/3581783.3613757
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a challenging task, unsupervised person re-identification (Re-ID) aims to optimize the pedestrian matching model based on the unlabeled image frames from surveillance videos. Recently, the fusion with the spatio-temporal clues of pedestrians have been proven effective to improve the performance of classification. However, most of these methods adopt some hard combination approaches by multiplying the visual scores with the spatio-temporal scores, which are sensitive to the noise caused by imprecise estimation of the spatio-temporal patterns in unlabeled datasets and limit the advantage of the fusion model. In this paper, we propose a Graph based Spatio-Temporal Fusion model for high-performance multi-modal person Re-ID, namely G-Fusion, to mitigate the impact of noise. In particular, we construct a graph of pedestrian images by selecting neighboring nodes based on the visual information and the transition time between cameras. Then we use a randomly initialized two-layer GraphSAGE model to obtain the multi-modal affinity matrix between images, and deploy the distillation learning to optimize the visual model by learning the affinity between the nodes. Finally, a graph-based multi-modal re-ranking method is deployed to make the decision in the testing phase for precise person Re-ID. Comprehensive experiments are conducted on two large-scale Re-ID datasets, and the results show that our method achieves a significant improvement of the performance while combined with SOTA unsupervised person Re-ID methods. Specifically, the mAP scores can reach 92.2%, and 80.4% on the Market-1501, and MSMT17 datasets respectively.
引用
收藏
页码:3736 / 3744
页数:9
相关论文
共 50 条
  • [1] Spatial-Temporal Person Re-Identification
    Wang, Guangcong
    Lai, Jianhuang
    Huang, Peigen
    Xie, Xiaohua
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8933 - 8940
  • [2] Spatial-Temporal Graph Convolutional Network for Video-based Person Re-identification
    Yang, Jinrui
    Zheng, Wei-Shi
    Yang, Qize
    Chen, Ying-Cong
    Tian, Qi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3286 - 3296
  • [3] Deep Spatial-Temporal Fusion Network for Video-Based Person Re-Identification
    Chen, Lin
    Yang, Hua
    Zhu, Ji
    Zhou, Qin
    Wu, Shuang
    Gao, Zhiyong
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1478 - 1485
  • [4] MULTI-SCALE SPATIAL-TEMPORAL NETWORK FOR PERSON RE-IDENTIFICATION
    Wang, Zhikang
    He, Lihuo
    Gao, Xinbo
    Huang, Yuanfei
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2052 - 2056
  • [5] Person Re-Identification with Weighted Spatial-Temporal Features
    Zhang, Dongyu
    Chen, Rongcong
    Qiu, Zhilin
    Zhang, Wei
    Wang, Qing
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1426 - 1431
  • [6] Multi-modal person re-identification based on transformer relational regularization
    Zheng, Xiangtian
    Huang, Xiaohua
    Ji, Chen
    Yang, Xiaolin
    Sha, Pengcheng
    Cheng, Liang
    INFORMATION FUSION, 2024, 103
  • [7] TriReID: Towards Multi-Modal Person Re-Identification via Descriptive Fusion Model
    Zhai, Yajing
    Zeng, Yawen
    Cao, Da
    Lu, Shaofei
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 63 - 71
  • [8] Person re-identification with activity prediction based on hierarchical spatial-temporal model
    Li, Minxian
    Shen, Fumin
    Wang, Jingya
    Guan, Chao
    Tang, Jinhui
    NEUROCOMPUTING, 2018, 275 : 1200 - 1207
  • [9] Spatial-temporal aware network for video-based person re-identification
    Jun Wang
    Qi Zhao
    Di Jia
    Ziqing Huang
    Miaohui Zhang
    Xing Ren
    Multimedia Tools and Applications, 2024, 83 : 36355 - 36373
  • [10] Spatial-temporal aware network for video-based person re-identification
    Wang, Jun
    Zhao, Qi
    Jia, Di
    Huang, Ziqing
    Zhang, Miaohui
    Ren, Xing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36355 - 36373