Video-Based Person Re-Identification by an End-To-End Learning Architecture with Hybrid Deep Appearance-Temporal Feature

被引:5
|
作者
Sun, Rui [1 ]
Huang, Qiheng [1 ]
Xia, Miaomiao [1 ]
Zhang, Jun [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Feicui Rd 420, Hefei 230000, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
person re-identification; end-to-end architecture; appearance-temporal features; Siamese network; pivotal frames;
D O I
10.3390/s18113669
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Video-based person re-identification is an important task with the challenges of lighting variation, low-resolution images, background clutter, occlusion, and human appearance similarity in the multi-camera visual sensor networks. In this paper, we propose a video-based person re-identification method called the end-to-end learning architecture with hybrid deep appearance-temporal feature. It can learn the appearance features of pivotal frames, the temporal features, and the independent distance metric of different features. This architecture consists of two-stream deep feature structure and two Siamese networks. For the first-stream structure, we propose the Two-branch Appearance Feature (TAF) sub-structure to obtain the appearance information of persons, and used one of the two Siamese networks to learn the similarity of appearance features of a pairwise person. To utilize the temporal information, we designed the second-stream structure that consisting of the Optical flow Temporal Feature (OTF) sub-structure and another Siamese network, to learn the person's temporal features and the distances of pairwise features. In addition, we select the pivotal frames of video as inputs to the Inception-V3 network on the Two-branch Appearance Feature sub-structure, and employ the salience-learning fusion layer to fuse the learned global and local appearance features. Extensive experimental results on the PRID2011, iLIDS-VID, and Motion Analysis and Re-identification Set (MARS) datasets showed that the respective proposed architectures reached 79%, 59% and 72% at Rank-1 and had advantages over state-of-the-art algorithms. Meanwhile, it also improved the feature representation ability of persons.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Video-based person re-identification with scene and person attributes
    Xun Gong
    Bin Luo
    Multimedia Tools and Applications, 2024, 83 : 8117 - 8128
  • [42] A review on video person re-identification based on deep learning
    Ma, Haifei
    Zhang, Canlong
    Zhang, Yifeng
    Li, Zhixin
    Wang, Zhiwen
    Wei, Chunrong
    NEUROCOMPUTING, 2024, 609
  • [43] STFE: A Comprehensive Video-Based Person Re-Identification Network Based on Spatio-Temporal Feature Enhancement
    Yang, Xi
    Wang, Xian
    Liu, Liangchen
    Wang, Nannan
    Gao, Xinbo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7237 - 7249
  • [44] Exploring Frontier Technologies in Video-Based Person Re-Identification: A Survey on Deep Learning Approach
    Wang, Jiahe
    Gao, Xizhan
    Zhu, Fa
    Chen, Xingchi
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (01): : 25 - 51
  • [45] A SPATIO-TEMPORAL APPEARANCE REPRESENTATION FOR VIDEO-BASED PEDESTRIAN RE-IDENTIFICATION
    Liu, Kan
    Ma, Bingpeng
    Zhang, Wei
    Huang, Rui
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3810 - 3818
  • [46] Video-based person re-identification based on regularised hull distance learning
    Xu, Xiaoyue
    Chen, Ying
    IET COMPUTER VISION, 2019, 13 (04) : 385 - 394
  • [47] Temporal Attention Quality Aware Network for Video-based Person Re-Identification
    Xu, Boqin
    Liu, Changhong
    Xue, Shengjun
    Jiang, Aiwen
    Wang, Shimin
    Ye, Jihua
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [48] Video-Based Person Re-identification with Improved Temporal Attention And Spatial Memory
    Liu, Peishun
    Chen, He
    Tang, Ruichun
    Wang, Xuefang
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 448 - 454
  • [49] Saliency and Granularity: Discovering Temporal Coherence for Video-Based Person Re-Identification
    Chen, Cuiqun
    Ye, Mang
    Qi, Meibin
    Wu, Jingjing
    Liu, Yimin
    Jiang, Jianguo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6100 - 6112
  • [50] Temporal-Contextual Attention Network for Video-Based Person Re-identification
    Chen, Di
    Zha, Zheng-Jun
    Liu, Jiawei
    Xie, Hongtao
    Zhang, Yongdong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 146 - 157