Multi-Scale 3D Convolution Network for Video Based Person Re-Identification

被引:0
|
作者
Li, Jianing [1 ]
Zhang, Shiliang [1 ]
Huang, Tiejun [1 ]
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
基金
北京市自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a two-stream convolution network to extract spatial and temporal cues for video based person Re-Identification (ReID). A temporal stream in this network is constructed by inserting several Multi-scale 3D (M3D) convolution layers into a 2D CNN network. The resulting M3D convolution network introduces a fraction of parameters into the 2D CNN, but gains the ability of multi-scale temporal feature learning. With this compact architecture, M3D convolution network is also more efficient and easier to optimize than existing 3D convolution networks. The temporal stream further involves Residual Attention Layers (RAL) to refine the temporal features. By jointly learning spatial-temporal attention masks in a residual manner, RAL identifies the discriminative spatial regions and temporal cues. The other stream in our network is implemented with a 2D CNN for spatial feature extraction. The spatial and temporal features from two streams are finally fused for the video based person ReID. Evaluations on three widely used benchmarks datasets, i.e., MARS, PRID2011, and iLIDS-VID demonstrate the substantial advantages of our method over existing 3D convolution networks and state-of-art methods.
引用
收藏
页码:8618 / 8625
页数:8
相关论文
共 50 条
  • [21] Multi-Scale Transformer-Based Matching Network for Generalizable Person Re-Identification
    Jiang, Jinhua
    Zhang, Wenfeng
    Ran, Ruisheng
    Hu, Wei
    Dai, Jiangyan
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1277 - 1281
  • [22] SSN3D: Self-Separated Network to Align Parts for 3D Convolution in Video Person Re-Identification
    Jiang, Xiaoke
    Qiao, Yu
    Yan, Junjie
    Li, Qichen
    Zheng, Wanrong
    Chen, Dapeng
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1691 - 1699
  • [23] Multi-scale feature combination for person re-identification
    Huang, Bailiang
    Piao, Yan
    Zhang, Hao
    Tang, Yanfeng
    [J]. IET IMAGE PROCESSING, 2022, 16 (07) : 2001 - 2011
  • [24] Multi-scale feature representation for person re-identification
    Lu, Jian
    Wang, Hang-Ying
    Chen, Xu
    Zhang, Kai-Bing
    Liu, Wei
    [J]. Kongzhi yu Juece/Control and Decision, 2021, 36 (12): : 3015 - 3022
  • [25] Multi-scale joint learning for person re-identification
    Xie, Pengyu
    Xu, Xin
    [J]. Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (03): : 613 - 622
  • [26] Person Re-identification Based on CNN with Multi-scale Contour Embedding
    Chen, Hao
    Zhao, Yan
    Zhang, Lihua
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 560 - 571
  • [27] Multi-level and multi-scale horizontal pooling network for person re-identification
    Yunzhou Zhang
    Shuangwei Liu
    Lin Qi
    Sonya Coleman
    Dermot Kerr
    Weidong Shi
    [J]. Multimedia Tools and Applications, 2020, 79 : 28603 - 28619
  • [28] Multi-level and multi-scale horizontal pooling network for person re-identification
    Zhang, Yunzhou
    Liu, Shuangwei
    Qi, Lin
    Coleman, Sonya
    Kerr, Dermot
    Shi, Weidong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 28603 - 28619
  • [29] A Multi-scale Triplet Deep Convolutional Neural Network for Person Re-identification
    Xiong, Mingfu
    Chen, Jun
    Wang, Zhongyuan
    Liang, Chao
    Lei, Bohan
    Hu, Ruimin
    [J]. IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), 2018, 10799 : 30 - 41
  • [30] Multi-Scale Semantic and Detail Extraction Network for Lightweight Person Re-Identification
    Zhang, Yunzuo
    Kang, Weili
    Liu, Yameng
    Zhu, Pengfei
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236