Video-based person re-identification by intra-frame and inter-frame graph neural network

被引:7
|
作者
Liu, Guiqing [1 ,2 ,4 ]
Wu, Jinzhao [1 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Chengdu Inst Comp Applicat, Chengdu 610041, Sichuan, Peoples R China
[2] Guangxi Univ Nationalities, Coll ASEAN Studies, Nanning 530006, Peoples R China
[3] Guangxi Univ, Coll Math & Informat Sci, Nanning 530004, Peoples R China
[4] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Person re-identification; Graph neural network; Intra and inter frame; Body part; Video matching;
D O I
10.1016/j.imavis.2020.104068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past few years, video-based person re-identification (Re-ID) have attracted growing research attention. The crucial problem for this task is how to learn robust video feature representation, which can weaken the influence of factors such as occlusion, illumination, and background etc. A great deal of previous works utilize spatio-temporal information to represent pedestrian video, but the correlations between parts of human body are ignored. In order to take advantage of the relationship among different parts, we propose a novel Intraframe and Inter-frame Graph Neural Network (I2GNN) to solve the video-based person Re-ID task. Specifically, (1) the features from each part are treated as graph nodes from each frame; (2) the intra-frame edges are established by the correlation between different parts; (3) the inter-frame edges are constructed between the same parts across adjacent frames. I2GNN learns video representations by employing the adjacent matrix of the graph and input features to conduct graph convolution, and then adopts projection metric learning on Grassman manifold to measure the similarities between learned pedestrian features. Moreover, this paper proposes a novel occlusion-invariant term to make the part features close to their center, which can relive several uncontrolled complicated factors, such as occlusion and pose invariance. Besides, we have carried out extensive experiments on four widely used datasets: MARS, DukeMTMC-VideoReID, PRID2011, and iLIDS-VID. The experimental results demonstrate that our proposed I2GNN model is more competitive than other state-of-the-art methods. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [2] Video person re-identification using key frame screening with index and feature reorganization based on inter-frame relation
    Zeng Lu
    Ganghan Zhang
    Guoheng Huang
    Zhiwen Yu
    Chi-Man Pun
    Weiwen Zhang
    Junan Chen
    Wing-Kuen Ling
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 2745 - 2761
  • [3] Video person re-identification using key frame screening with index and feature reorganization based on inter-frame relation
    Lu, Zeng
    Zhang, Ganghan
    Huang, Guoheng
    Yu, Zhiwen
    Pun, Chi-Man
    Zhang, Weiwen
    Chen, Junan
    Ling, Wing-Kuen
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (09) : 2745 - 2761
  • [4] Blind MV-based video steganalysis based on joint inter-frame and intra-frame statistics
    Ghamsarian, Negin
    Schoeffmann, Klaus
    Khademi, Morteza
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9137 - 9159
  • [5] On the importance of intra-frame and inter-frame covariances in frame transformation theory
    C. Kotsakis
    A. Vatalis
    F. Sansò
    [J]. Journal of Geodesy, 2014, 88 : 1187 - 1201
  • [6] Blind MV-based video steganalysis based on joint inter-frame and intra-frame statistics
    Negin Ghamsarian
    Klaus Schoeffmann
    Morteza Khademi
    [J]. Multimedia Tools and Applications, 2021, 80 : 9137 - 9159
  • [7] On the importance of intra-frame and inter-frame covariances in frame transformation theory
    Kotsakis, C.
    Vatalis, A.
    Sanso, F.
    [J]. JOURNAL OF GEODESY, 2014, 88 (12) : 1187 - 1201
  • [8] A sparse graph wavelet convolution neural network for video-based person re-identification
    Yao, Yingmao
    Jiang, Xiaoyan
    Fujita, Hamido
    Fang, Zhijun
    [J]. PATTERN RECOGNITION, 2022, 129
  • [9] Intra-Frame Deblurring by Leveraging Inter-Frame Camera Motion
    Zhana, Haichao
    Yana, Jianchao
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4036 - 4044
  • [10] A region-based intra-frame rate control scheme by jointing inter-frame dependency and inter-frame correlation
    Hai-Miao Hu
    Mingliang Zhou
    Yang Liu
    Naiyu Yin
    [J]. Multimedia Tools and Applications, 2017, 76 : 12917 - 12940