Multi-granular inter-frame relation exploration and global residual embedding for video-based person re-identification

被引:0
|
作者
机构
[1] [1,Zhu, Zhiqin
[2] Chen, Sixin
[3] Qi, Guanqiu
[4] Li, Huafeng
[5] 1,Gao, Xinbo
关键词
Adversarial machine learning;
D O I
10.1016/j.image.2024.117240
中图分类号
学科分类号
摘要
In recent years, the field of video-based person re-identification (re-ID) has conducted in-depth research on how to effectively utilize spatiotemporal clues, which has attracted attention for its potential in providing comprehensive view representations of pedestrians. However, although the discriminability and correlation of spatiotemporal features are often studied, the exploration of the complex relationships between these features has been relatively neglected. Especially when dealing with multi-granularity features, how to depict the different spatial representations of the same person under different perspectives becomes a challenge. To address this challenge, this paper proposes a multi-granularity inter-frame relationship exploration and global residual embedding network specifically designed to solve the above problems. This method successfully extracts more comprehensive and discriminative feature representations by deeply exploring the interactions and global differences between multi-granularity features. Specifically, by simulating the dynamic relationship of different granularity features in long video sequences and using a structured perceptual adjacency matrix to synthesize spatiotemporal information, cross-granularity information is effectively integrated into individual features. In addition, by introducing a residual learning mechanism, this method can also guide the diversified development of global features and reduce the negative impacts caused by factors such as occlusion. Experimental results verify the effectiveness of this method on three mainstream benchmark datasets, significantly surpassing state-of-the-art solutions. This shows that this paper successfully solves the challenging problem of how to accurately identify and utilize the complex relationships between multi-granularity spatiotemporal features in video-based person re-ID. © 2024 Elsevier B.V.
引用
收藏
相关论文
共 50 条
  • [21] Incorporating texture and silhouette for video-based person re-identification
    Bai, Shutao
    Chang, Hong
    Ma, Bingpeng
    PATTERN RECOGNITION, 2024, 156
  • [22] Effective Similarity Measurement for Video-based Person Re-identification
    Liu, Yiheng
    Xie, Chao
    Zhou, Wengang
    Li, Houqiang
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [23] Recurrent Convolutional Network for Video-based Person Re-Identification
    McLaughlin, Niall
    del Rincon, Jesus Martinez
    Miller, Paul
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1325 - 1334
  • [24] Top-push Video-based Person Re-identification
    You, Jinjie
    Wu, Ancong
    Li, Xiang
    Zheng, Wei-Shi
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1345 - 1353
  • [25] Dense Interaction Learning for Video-based Person Re-identification
    He, Tianyu
    Jin, Xin
    Shen, Xu
    Huang, Jianqiang
    Chen, Zhibo
    Hua, Xian-Sheng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1470 - 1481
  • [26] Motion Feature Aggregation for Video-Based Person Re-Identification
    Gu, Xinqian
    Chang, Hong
    Ma, Bingpeng
    Shan, Shiguang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3908 - 3919
  • [27] Triplet Attention Network for Video-Based Person Re-Identification
    Sun, Rui
    Liang, Qili
    Yang, Zi
    Zhao, Zhenghui
    Zhang, Xudong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (10) : 1775 - 1779
  • [28] Video-based Person Re-identification without Bells and Whistles
    Liu, Chih-Ting
    Chen, Jun-Cheng
    Chen, Chu-Song
    Chien, Shao-Yi
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1491 - 1500
  • [29] Keypoint Message Passing for Video-Based Person Re-Identification
    Chen, Di
    Doring, Andreas
    Zhang, Shanshan
    Yang, Jian
    Gall, Juergen
    Schiele, Bernt
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 239 - 247
  • [30] Local and global aligned spatiotemporal attention network for video-based person re-identification
    Li Cheng
    Xiao-Yuan Jing
    Xiaoke Zhu
    Chang-Hui Hu
    Guangwei Gao
    Songsong Wu
    Multimedia Tools and Applications, 2020, 79 : 34489 - 34512