Scale-fusion framework for improving video-based person re-identification performance

被引：0

作者：

Li Cheng

Xiao-Yuan Jing

Xiaoke Zhu

Fei Ma

Chang-Hui Hu

Ziyun Cai

Fumin Qi

机构：

[1] Wuhan University,School of Computer Science

[2] Guangdong University of Petrochemical Technology,School of Computer

[3] Nanjing University of Posts and Telecommunications,College of Automation

[4] Henan University,School of Computer and Information Engineering

[5] Pingdingshan University,School of Computer Science

[6] National Supercomputing Center in Shenzhen,undefined

来源：

Neural Computing and Applications | 2020年 / 32卷

关键词：

3D convolution; Short-term fast-varying motion information; Recurrent; Scale-fusion; Species invasion;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Video-based person re-identification (re-id), which aims to match people through videos captured by non-overlapping camera views, has attracted lots of research interest recently. In this paper, we first propose a novel hybrid 2D and 3D convolution-based recurrent neural network (HCRN) for video-based person re-id task. Specifically, the 3D convolutional module can explore the local short-term fast-varying motion information, while the recurrent layer can leverage the global long-term spatial–temporal information. Based on HCRN, we design a scale-fusion framework to make full use of features of different scales to further improve the performance of video-based person re-id. More concretely, the scale-fusion framework preserves a complete subnetwork similar to HCRN for each scale to extract features and exchanges information between all subnetworks at several stages of the framework. Besides, we propose a training method called species invasion to further improve the performance of HCRN and scale-fusion framework by utilizing a large amount of unlabeled data. Experimental results on the publicly available PRID 2011, iLIDS-VID and MARS multi-shot pedestrian re-id datasets demonstrate the effectiveness of the proposed HCRN, scale-fusion framework and species invasion training method.

引用

页码：12841 / 12858

页数：17

共 50 条

[31] Temporal Extension Topology Learning for Video-Based Person Re-identification
Ning, Jiaqi
Li, Fei
Liu, Rujie
Takeuchi, Shun
Suzuki, Genta
COMPUTER VISION - ACCV 2022 WORKSHOPS, 2023, 13848 : 213 - 225
[32] TEMPORALLY ALIGNED POOLING REPRESENTATION FOR VIDEO-BASED PERSON RE-IDENTIFICATION
Gao, Changxin
Wang, Jin
Liu, Leyuan
Yu, Jin-Gang
Sang, Nong
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4284 - 4288
[33] A Duplex Spatiotemporal Filtering Network for Video-based Person Re-identification
Zheng, Chong
Wei, Ping
Zheng, Nanning
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7551 - 7557
[34] CONVOLUTIONAL TEMPORAL ATTENTION MODEL FOR VIDEO-BASED PERSON RE-IDENTIFICATION
Rahman, Tanzila
Rochan, Mrigank
Wang, Yang
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1102 - 1107
[35] Multiscale Aligned SpatialTemporal Interaction for Video-Based Person Re-Identification
Ran, Zhidan
Wei, Xuan
Liu, Wei
Lu, Xiaobo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8536 - 8546
[36] Diverse part attentive network for video-based person re-identification *
Shu, Xiujun
Li, Ge
Wei, Longhui
Zhong, Jia-Xing
Zang, Xianghao
Zhang, Shiliang
Wang, Yaowei
Liang, Yongsheng
Tian, Qi
PATTERN RECOGNITION LETTERS, 2021, 149 : 17 - 23
[37] Diversity Regularized Spatiotemporal Attention for Video-based Person Re-identification
Li, Shuang
Bak, Slawomir
Carr, Peter
Wang, Xiaogang
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 369 - 378
[38] Learning Compact Appearance Representation for Video-Based Person Re-Identification
Zhang, Wei
Hu, Shengnan
Liu, Kan
Zha, Zhengjun
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2442 - 2452
[39] Learning Bidirectional Temporal Cues for Video-Based Person Re-Identification
Zhang, Wei
Yu, Xiaodong
He, Xuanyu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2768 - 2776
[40] Video-Based Person Re-Identification Using Unsupervised Tracklet Matching
Riachy, Chirine
Khelifi, Fouad
Bouridane, Ahmed
IEEE ACCESS, 2019, 7 : 20596 - 20606

← 1 2 3 4 5 →