Attention-Based Video Hashing for Large-Scale Video Retrieval

被引:11
|
作者
Wang, Yingxin [1 ]
Nie, Xiushan [2 ]
Shi, Yang [3 ]
Zhou, Xin [1 ]
Yin, Yilong [3 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan 250101, Peoples R China
[2] Shandong Jianzhu Univ, Sch Comp Sci & Technol, Jinan 250101, Peoples R China
[3] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural networks; Data models; Deep learning; Quantization (signal); Binary codes; Feature extraction; hashing; video hashing; video retrieval; IMAGE RETRIEVAL; BINARY; QUANTIZATION;
D O I
10.1109/TCDS.2019.2963339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large-scale video retrieval is a challenging problem because of the exponential growth of video collections on the Internet. To address this challenge, we propose an attention-based video hashing (AVH) method for large-scale video retrieval. Unlike most of the existing video hashing methods, which consider different frames within a video separately for hash learning, we use a convolutional neural network and long short-term memory (LSTM) network as the backbone to learn compact and discriminative hash codes by exploiting the structural information among different frames. To better capture informative clues in the video, an attention mechanism is added into the backbone, which can assign different weights to different LSTM time steps. Experiments were conducted to evaluate the proposed AVH method in comparison with existing methods. The experimental results on two widely used data sets show that our method outperforms existing state-of-the-art methods.
引用
收藏
页码:491 / 502
页数:12
相关论文
共 50 条
  • [1] DLSTM Approach to Video Modeling with Hashing for Large-Scale Video Retrieval
    Zhuang, Naifan
    Ye, Jun
    Hua, Kien A.
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3222 - 3227
  • [2] Attention-based deep supervised hashing for near duplicate video retrieval
    Shi, Naifei
    Fu, Chong
    Tie, Ming
    Zhang, Wenchao
    Wang, Xingwei
    Sham, Chiu-Wing
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 36 (10): : 5217 - 5230
  • [3] Attention-based deep supervised hashing for near duplicate video retrieval
    Naifei Shi
    Chong Fu
    Ming Tie
    Wenchao Zhang
    Xingwei Wang
    Chiu-Wing Sham
    [J]. Neural Computing and Applications, 2024, 36 : 5217 - 5230
  • [4] Unsupervised Deep Video Hashing via Balanced Code for Large-Scale Video Retrieval
    Wu, Gengshen
    Han, Jungong
    Guo, Yuchen
    Liu, Li
    Ding, Guiguang
    Ni, Qiang
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1993 - 2007
  • [5] Classification-enhancement deep hashing for large-scale video retrieval
    Nie, Xiushan
    Zhou, Xin
    Shi, Yang
    Sun, Jiande
    Yin, Yilong
    [J]. APPLIED SOFT COMPUTING, 2021, 109
  • [6] Stochastic Multiview Hashing for Large-Scale Near-Duplicate Video Retrieval
    Hao, Yanbin
    Mu, Tingting
    Hong, Richang
    Wang, Meng
    An, Ning
    Goulermas, John Y.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (01) : 1 - 14
  • [7] A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
    Chen, Hanqing
    Hu, Chunyan
    Lee, Feifei
    Lin, Chaowei
    Yao, Wei
    Chen, Lu
    Chen, Qiu
    [J]. SENSORS, 2021, 21 (09)
  • [8] Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval
    Song, Jingkuan
    Yang, Yi
    Huang, Zi
    Shen, Heng Tao
    Luo, Jiebo
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (08) : 1997 - 2008
  • [9] Large-Scale Video Hashing via Structure Learning
    Ye, Guangnan
    Liu, Dong
    Wang, Jun
    Chang, Shih-Fu
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2272 - 2279
  • [10] Face Retrieval on Large-Scale Video Data
    Herrmann, Christian
    Beyerer, Juergen
    [J]. 2015 12TH CONFERENCE ON COMPUTER AND ROBOT VISION CRV 2015, 2015, : 192 - 199