Attention-based deep supervised hashing for near duplicate video retrieval

被引:1
|
作者
Shi, Naifei [1 ]
Fu, Chong [1 ,2 ,3 ]
Tie, Ming [4 ]
Zhang, Wenchao [1 ]
Wang, Xingwei [1 ]
Sham, Chiu-Wing [5 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Minist Educ, Engn Res Ctr Secur Technol Complex Network Syst, Shenyang 110819, Peoples R China
[3] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110819, Peoples R China
[4] Sci & Technol Space Phys Lab, Beijing 100076, Peoples R China
[5] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 36卷 / 10期
基金
中国国家自然科学基金;
关键词
Near duplicate video retrieval; Deep video hashing; Spatio-temporal attention; Pairwise supervised learning;
D O I
10.1007/s00521-023-09342-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosive growth of video data on the Internet, near duplicate video retrieval (NDVR) has become an important and challenging issue in the field of information retrieval. Hashing is typically employed to tackle this issue owing to its low memory and fast retrieval speed. Most of the existing video hashing methods directly adopt image hashing methods or perform the frame-pooling strategy, failing to fully explore the spatio-temporal information of videos. In this paper, we propose an attention-based deep supervised video hashing (ADVH) network for NDVR. To capture richer perceptions and acquire more comprehensive video representations, we use a residual network as the backbone and incorporate an attention module to extract spatio-temporal features of videos and motion information between adjacent frames. Moreover, we design a novel pairwise constraint utilizing supervised information to learn compact and discriminative video hash codes. The experimental results on three benchmark video datasets demonstrate that our proposed model outperforms other state-of-the-art hashing methods in retrieval precision.
引用
收藏
页码:5217 / 5230
页数:14
相关论文
共 50 条
  • [41] A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
    Chen, Hanqing
    Hu, Chunyan
    Lee, Feifei
    Lin, Chaowei
    Yao, Wei
    Chen, Lu
    Chen, Qiu
    [J]. SENSORS, 2021, 21 (09)
  • [42] Gait Retrieval: A Deep Hashing Method for People Retrieval in Video
    Rauf, Muhammad
    Huang, Yongzhen
    Wang, Liang
    [J]. PATTERN RECOGNITION (CCPR 2016), PT I, 2016, 662 : 383 - 391
  • [43] Self-Supervised Temporal Sensitive Hashing for Video Retrieval
    Li, Qihua
    Tian, Xing
    Ng, Wing W. Y.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9021 - 9035
  • [44] ASVTDECTOR: A Practical Near Duplicate Video Retrieval System
    Zhou, Xiangmin
    Chen, Lei
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 1348 - 1351
  • [45] Deep Supervised Hashing by Fusing Multiscale Deep Features for Image Retrieval
    Redaoui, Adil
    Belalia, Amina
    Belloulata, Kamel
    [J]. INFORMATION, 2024, 15 (03)
  • [46] Online supervised attention-based recurrent depth estimation from monocular video
    Maslov, Dmitrii
    Makarov, Ilya
    [J]. PEERJ COMPUTER SCIENCE, 2020,
  • [47] Attention-based video streaming
    Dikici, Cagatay
    Bozma, H. Isil
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (10) : 745 - 760
  • [48] Online supervised attention-based recurrent depth estimation from monocular video
    Maslov, Dmitrii
    Makarov, Ilya
    [J]. Maslov, Dmitrii (dvmaslov@edu.hse.ru), 1600, PeerJ Inc. (06): : 1 - 22
  • [49] Deep spatial attention hashing network for image retrieval
    Ge, Lin-Wei
    Zhang, Jun
    Xia, Yi
    Chen, Peng
    Wang, Bing
    Zheng, Chun-Hou
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 63
  • [50] Deep attention sampling hashing for efficient image retrieval
    Feng, Hao
    Wang, Nian
    Zhao, Fa
    Huo, Wei
    [J]. NEUROCOMPUTING, 2023, 559