Video-Based Person Re-Identification by an End-To-End Learning Architecture with Hybrid Deep Appearance-Temporal Feature

被引:5
|
作者
Sun, Rui [1 ]
Huang, Qiheng [1 ]
Xia, Miaomiao [1 ]
Zhang, Jun [1 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Feicui Rd 420, Hefei 230000, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
person re-identification; end-to-end architecture; appearance-temporal features; Siamese network; pivotal frames;
D O I
10.3390/s18113669
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Video-based person re-identification is an important task with the challenges of lighting variation, low-resolution images, background clutter, occlusion, and human appearance similarity in the multi-camera visual sensor networks. In this paper, we propose a video-based person re-identification method called the end-to-end learning architecture with hybrid deep appearance-temporal feature. It can learn the appearance features of pivotal frames, the temporal features, and the independent distance metric of different features. This architecture consists of two-stream deep feature structure and two Siamese networks. For the first-stream structure, we propose the Two-branch Appearance Feature (TAF) sub-structure to obtain the appearance information of persons, and used one of the two Siamese networks to learn the similarity of appearance features of a pairwise person. To utilize the temporal information, we designed the second-stream structure that consisting of the Optical flow Temporal Feature (OTF) sub-structure and another Siamese network, to learn the person's temporal features and the distances of pairwise features. In addition, we select the pivotal frames of video as inputs to the Inception-V3 network on the Two-branch Appearance Feature sub-structure, and employ the salience-learning fusion layer to fuse the learned global and local appearance features. Extensive experimental results on the PRID2011, iLIDS-VID, and Motion Analysis and Re-identification Set (MARS) datasets showed that the respective proposed architectures reached 79%, 59% and 72% at Rank-1 and had advantages over state-of-the-art algorithms. Meanwhile, it also improved the feature representation ability of persons.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Deep multi-instance learning for end-to-end person re-identification
    Yuan, Caihong
    Xu, Chunyan
    Wang, Tianjiang
    Liu, Fang
    Zhao, Zhiqiang
    Feng, Ping
    Guo, Jingjuan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (10) : 12437 - 12467
  • [2] Deep multi-instance learning for end-to-end person re-identification
    Caihong Yuan
    Chunyan Xu
    Tianjiang Wang
    Fang Liu
    Zhiqiang Zhao
    Ping Feng
    Jingjuan Guo
    Multimedia Tools and Applications, 2018, 77 : 12437 - 12467
  • [3] Learning Compact Appearance Representation for Video-Based Person Re-Identification
    Zhang, Wei
    Hu, Shengnan
    Liu, Kan
    Zha, Zhengjun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2442 - 2452
  • [4] Video-based Person Re-identification by Deep Feature Guided Pooling
    Li, Youjiao
    Zhuo, Li
    Li, Jiafeng
    Zhang, Jing
    Liang, Xi
    Tian, Qi
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1454 - 1461
  • [5] Sequences consistency feature learning for video-based person re-identification
    Zhao, Kai
    Cheng, Deqiang
    Kou, Qiqi
    Li, Jiahan
    Liu, Ruihang
    ELECTRONICS LETTERS, 2022, 58 (04) : 142 - 144
  • [6] Feature Aggregation With Reinforcement Learning for Video-Based Person Re-Identification
    Zhang, Wei
    He, Xuanyu
    Lu, Weizhi
    Qiao, Hong
    Li, Yibin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (12) : 3847 - 3852
  • [7] Deep Regression Neural Network for End-to-End Person Re-Identification
    Guo, Yingchun
    Zhao, Kunpeng
    Hao, Xiaoke
    Yu, Ming
    IEEE ACCESS, 2019, 7 : 92825 - 92837
  • [8] Temporal Extension Topology Learning for Video-Based Person Re-identification
    Ning, Jiaqi
    Li, Fei
    Liu, Rujie
    Takeuchi, Shun
    Suzuki, Genta
    COMPUTER VISION - ACCV 2022 WORKSHOPS, 2023, 13848 : 213 - 225
  • [9] Learning Bidirectional Temporal Cues for Video-Based Person Re-Identification
    Zhang, Wei
    Yu, Xiaodong
    He, Xuanyu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2768 - 2776
  • [10] End-to-End Video Surveillance Framework for Anomaly Detection and Person Re-identification
    Nandan, Rohan
    Lingeri, Rohan
    Mehta, Rohan
    Kanwal, Preet
    Atluri, Rishita
    DEEP LEARNING THEORY AND APPLICATIONS, PT I, DELTA 2024, 2024, 2171 : 328 - 339