Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification

被引：217

作者：

Xu, Shuangjie ^{[1
]}

Cheng, Yu ^{[2
]}

Gu, Kang ^{[1
]}

Yang, Yang ^{[3
]}

Chang, Shiyu ^{[4
]}

Zhou, Pan ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Wuhan, Hubei, Peoples R China

[2] IBM Res, AI Fdn, Armonk, NY USA

[3] Northwestern Univ, Evanston, IL 60208 USA

[4] IBM TJ Watson Res Ctr, Ossining, NY 10562 USA

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年

关键词：

D O I：

10.1109/ICCV.2017.507

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Person Re-Identification (person re-id) is a crucial task as its applications in visual surveillance and human-computer interaction. In this work, we present a novel joint Spatial and Temporal Attention Pooling Network (ASTPN) for video-based person re-identification, which enables the feature extractor to be aware of the current input video sequences, in a way that interdependency from the matching items can directly influence the computation of each other's representation. Specifically, the spatial pooling layer is able to select regions from each frame, while the attention temporal pooling performed can select informative frames over the sequence, both pooling guided by the information from distance matching. Experiments are conduced on the iLIDS-VID, PRID-2011 and MARS datasets and the results demonstrate that this approach outperforms existing state-of-art methods. We also analyze how the joint pooling in both dimensions can boost the person re-id performance more effectively than using either of them separately(1).

引用

页码：4743 / 4752

页数：10

共 50 条

[1] Joint Attentive Spatial-Temporal Feature Aggregation for Video-Based Person Re-Identification
Chen, Lin
Yang, Hua
Gao, Zhiyong
[J]. IEEE ACCESS, 2019, 7 : 41230 - 41240
[2] Jointly Temporal Pooling Networks and Multi-loss Fusion for Video-based Person Re-Identification
Xu, Huanhuan
Sun, Xuemei
[J]. PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 432 - 435
[3] Spatial-temporal aware network for video-based person re-identification
Jun Wang
Qi Zhao
Di Jia
Ziqing Huang
Miaohui Zhang
Xing Ren
[J]. Multimedia Tools and Applications, 2024, 83 : 36355 - 36373
[4] Pyramid Spatial-Temporal Aggregation for Video-based Person Re-Identification
Wang, Yingquan
Zhang, Pingping
Gao, Shang
Geng, Xia
Lu, Hu
Wang, Dong
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12006 - 12015
[5] Spatial-temporal aware network for video-based person re-identification
Wang, Jun
Zhao, Qi
Jia, Di
Huang, Ziqing
Zhang, Miaohui
Ren, Xing
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36355 - 36373
[6] Video-based Person Re-identification with Spatial and Temporal Memory Networks
Eom, Chanho
Lee, Geon
Lee, Junghyup
Ham, Bumsub
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12016 - 12025
[7] Deep Spatial-Temporal Fusion Network for Video-Based Person Re-Identification
Chen, Lin
Yang, Hua
Zhu, Ji
Zhou, Qin
Wu, Shuang
Gao, Zhiyong
[J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1478 - 1485
[8] Video-based person re-identification with parallel spatial-temporal attention module
Kong, Jun
Teng, Zhende
Jiang, Min
Huo, Hongtao
[J]. JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (01)
[9] Spatial-Temporal Graph Convolutional Network for Video-based Person Re-identification
Yang, Jinrui
Zheng, Wei-Shi
Yang, Qize
Chen, Ying-Cong
Tian, Qi
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 3286 - 3296
[10] Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification
Chen, Guangyi
Lu, Jiwen
Yang, Ming
Zhou, Jie
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4192 - 4205

← 1 2 3 4 5 →