Multi-granular inter-frame relation exploration and global residual embedding for video-based person re-identification

被引:0
|
作者
机构
[1] [1,Zhu, Zhiqin
[2] Chen, Sixin
[3] Qi, Guanqiu
[4] Li, Huafeng
[5] 1,Gao, Xinbo
关键词
Adversarial machine learning;
D O I
10.1016/j.image.2024.117240
中图分类号
学科分类号
摘要
In recent years, the field of video-based person re-identification (re-ID) has conducted in-depth research on how to effectively utilize spatiotemporal clues, which has attracted attention for its potential in providing comprehensive view representations of pedestrians. However, although the discriminability and correlation of spatiotemporal features are often studied, the exploration of the complex relationships between these features has been relatively neglected. Especially when dealing with multi-granularity features, how to depict the different spatial representations of the same person under different perspectives becomes a challenge. To address this challenge, this paper proposes a multi-granularity inter-frame relationship exploration and global residual embedding network specifically designed to solve the above problems. This method successfully extracts more comprehensive and discriminative feature representations by deeply exploring the interactions and global differences between multi-granularity features. Specifically, by simulating the dynamic relationship of different granularity features in long video sequences and using a structured perceptual adjacency matrix to synthesize spatiotemporal information, cross-granularity information is effectively integrated into individual features. In addition, by introducing a residual learning mechanism, this method can also guide the diversified development of global features and reduce the negative impacts caused by factors such as occlusion. Experimental results verify the effectiveness of this method on three mainstream benchmark datasets, significantly surpassing state-of-the-art solutions. This shows that this paper successfully solves the challenging problem of how to accurately identify and utilize the complex relationships between multi-granularity spatiotemporal features in video-based person re-ID. © 2024 Elsevier B.V.
引用
收藏
相关论文
共 50 条
  • [1] Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification
    Yan, Yichao
    Qin, Jie
    Chen, Jiaxin
    Liu, Li
    Zhu, Fan
    Tai, Ying
    Shao, Ling
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2896 - 2905
  • [2] Video-based person re-identification by intra-frame and inter-frame graph neural network
    Liu, Guiqing
    Wu, Jinzhao
    [J]. IMAGE AND VISION COMPUTING, 2021, 106
  • [3] MSTN: A Multi-granular Spatial-Temporal Network for video-based person re-identification
    Zhao, Wei
    Zhang, Bo
    Yang, Cong
    Chen, Xianfu
    Chen, Hui
    [J]. INTERNET OF THINGS, 2022, 20
  • [4] Relation network based on multi-granular hypergraphs for person re-identification
    Guo, Chenchen
    Zhao, Xiaoming
    Zou, Qiang
    [J]. APPLIED INTELLIGENCE, 2022, 52 (10) : 11394 - 11406
  • [5] Relation network based on multi-granular hypergraphs for person re-identification
    Chenchen Guo
    Xiaoming Zhao
    Qiang Zou
    [J]. Applied Intelligence, 2022, 52 : 11394 - 11406
  • [6] Video person re-identification using key frame screening with index and feature reorganization based on inter-frame relation
    Zeng Lu
    Ganghan Zhang
    Guoheng Huang
    Zhiwen Yu
    Chi-Man Pun
    Weiwen Zhang
    Junan Chen
    Wing-Kuen Ling
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 2745 - 2761
  • [7] Video person re-identification using key frame screening with index and feature reorganization based on inter-frame relation
    Lu, Zeng
    Zhang, Ganghan
    Huang, Guoheng
    Yu, Zhiwen
    Pun, Chi-Man
    Zhang, Weiwen
    Chen, Junan
    Ling, Wing-Kuen
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (09) : 2745 - 2761
  • [8] Video-based person re-identification with scene and person attributes
    Gong, Xun
    Luo, Bin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8117 - 8128
  • [9] Video-based person re-identification with scene and person attributes
    Xun Gong
    Bin Luo
    [J]. Multimedia Tools and Applications, 2024, 83 : 8117 - 8128
  • [10] Video-Based Convolutional Attention for Person Re-Identification
    Zamprogno, Marco
    Passon, Marco
    Martinel, Niki
    Serra, Giuseppe
    Lancioni, Giuseppe
    Micheloni, Christian
    Tasso, Carlo
    Foresti, Gian Luca
    [J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 : 3 - 14