Scale-fusion framework for improving video-based person re-identification performance

被引:0
|
作者
Li Cheng
Xiao-Yuan Jing
Xiaoke Zhu
Fei Ma
Chang-Hui Hu
Ziyun Cai
Fumin Qi
机构
[1] Wuhan University,School of Computer Science
[2] Guangdong University of Petrochemical Technology,School of Computer
[3] Nanjing University of Posts and Telecommunications,College of Automation
[4] Henan University,School of Computer and Information Engineering
[5] Pingdingshan University,School of Computer Science
[6] National Supercomputing Center in Shenzhen,undefined
来源
关键词
3D convolution; Short-term fast-varying motion information; Recurrent; Scale-fusion; Species invasion;
D O I
暂无
中图分类号
学科分类号
摘要
Video-based person re-identification (re-id), which aims to match people through videos captured by non-overlapping camera views, has attracted lots of research interest recently. In this paper, we first propose a novel hybrid 2D and 3D convolution-based recurrent neural network (HCRN) for video-based person re-id task. Specifically, the 3D convolutional module can explore the local short-term fast-varying motion information, while the recurrent layer can leverage the global long-term spatial–temporal information. Based on HCRN, we design a scale-fusion framework to make full use of features of different scales to further improve the performance of video-based person re-id. More concretely, the scale-fusion framework preserves a complete subnetwork similar to HCRN for each scale to extract features and exchanges information between all subnetworks at several stages of the framework. Besides, we propose a training method called species invasion to further improve the performance of HCRN and scale-fusion framework by utilizing a large amount of unlabeled data. Experimental results on the publicly available PRID 2011, iLIDS-VID and MARS multi-shot pedestrian re-id datasets demonstrate the effectiveness of the proposed HCRN, scale-fusion framework and species invasion training method.
引用
收藏
页码:12841 / 12858
页数:17
相关论文
共 50 条
  • [21] Recurrent Convolutional Network for Video-based Person Re-Identification
    McLaughlin, Niall
    del Rincon, Jesus Martinez
    Miller, Paul
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1325 - 1334
  • [22] Top-push Video-based Person Re-identification
    You, Jinjie
    Wu, Ancong
    Li, Xiang
    Zheng, Wei-Shi
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1345 - 1353
  • [23] Dense Interaction Learning for Video-based Person Re-identification
    He, Tianyu
    Jin, Xin
    Shen, Xu
    Huang, Jianqiang
    Chen, Zhibo
    Hua, Xian-Sheng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1470 - 1481
  • [24] Motion Feature Aggregation for Video-Based Person Re-Identification
    Gu, Xinqian
    Chang, Hong
    Ma, Bingpeng
    Shan, Shiguang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3908 - 3919
  • [25] Video-based Person Re-identification without Bells and Whistles
    Liu, Chih-Ting
    Chen, Jun-Cheng
    Chen, Chu-Song
    Chien, Shao-Yi
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1491 - 1500
  • [26] Keypoint Message Passing for Video-Based Person Re-Identification
    Chen, Di
    Doring, Andreas
    Zhang, Shanshan
    Yang, Jian
    Gall, Juergen
    Schiele, Bernt
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 239 - 247
  • [27] AN UNBIASED TEMPORAL REPRESENTATION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
    Zhang, Xiu
    Bhanu, Bir
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 838 - 842
  • [28] Robust Video-Based Person Re-Identification by Hierarchical Mining
    Wang, Zhikang
    He, Lihuo
    Tu, Xiaoguang
    Zhao, Jian
    Gao, Xinbo
    Shen, Shengmei
    Feng, Jiashi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8179 - 8191
  • [29] A Three-Stage Framework for Video-Based Visible-Infrared Person Re-Identification
    Hou, Wei
    Wang, Wenxuan
    Yan, Yiming
    Wu, Di
    Xia, Qingyu
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1254 - 1258
  • [30] Video-based Person Re-identification Using Refined Attention Networks
    Rahman, Tanzila
    Rochan, Mrigank
    Wang, Yang
    2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,