DLSTM Approach to Video Modeling with Hashing for Large-Scale Video Retrieval

被引:0
|
作者
Zhuang, Naifan [1 ]
Ye, Jun [1 ]
Hua, Kien A. [1 ]
机构
[1] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although Query-by-Example techniques based on Euclidean distance in a multidimensional feature space have proved to be effective for image databases, this approach cannot be effectively applied to video since the number of dimensions would be massive due to the richness and complexity of video data. The above issue has been addressed in two recent solutions, namely Deterministic Quantization (DQ) and Dynamic Temporal Quantization (DTQ). DQ divides the video into equal segments and extracts a visual feature vector for each segment. The bag-of-word feature is then encoded by hashing to facilitate approximate nearest neighbor search using Hamming distance. One weakness of this approach is the deterministic segmentation of video data. DTQ improves on this by using dynamic video segmentation to obtain varied-length video segments. As a result, feature vectors extracted from these video segments can better capture the semantic content of the video. To support very large video databases, it is desirable to minimize the number of segments in order to keep the size of the feature representation as small as possible. We achieve this by using only one video segment (i.e., no video data segmentation is even necessary) with even better retrieval performance. Our scheme models video using differential long short-term memory (DLSTM) recurrent neural networks and obtains a highly compact fixed-size feature representation with the output of hidden states of the DLSTM. Each of these features are further compressed by hashing them into binary bits via quantization. Experimental results based on two public data sets, UCF101 and MSRActionPairs, indicate that the proposed video modeling technique outperforms DTQ by a significant margin.
引用
收藏
页码:3222 / 3227
页数:6
相关论文
共 50 条
  • [41] Efficient indexing and retrieval of large-scale geo-tagged video databases
    Lu, Ying
    Shahabi, Cyrus
    Kim, Seon Ho
    [J]. GEOINFORMATICA, 2016, 20 (04) : 829 - 857
  • [42] Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval
    Jiang, Chen
    Huang, Kaiming
    He, Sifeng
    Yang, Xudong
    Zhang, Wei
    Zhang, Xiaobo
    Cheng, Yuan
    Yang, Lei
    Wang, Qing
    Xu, Furong
    Pan, Tan
    Chu, Wei
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1618 - 1626
  • [43] Temporal Aggregation of Visual Features for Large-Scale Image-to-Video Retrieval
    Garcia, Noa
    [J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 489 - 492
  • [44] Large-scale video monitoring system
    Kobayashi, Kazuaki
    [J]. NEC Technical Journal, 2010, 5 (03): : 39 - 42
  • [45] A gradual approach to knowledge distillation in deep supervised hashing for large-scale image retrieval
    Hussain, Abid
    li, Heng-Chao
    Hussain, Mehboob
    Ali, Muqadar
    Abbas, Shaheen
    Ali, Danish
    Rehman, Amir
    [J]. Computers and Electrical Engineering, 2024, 120
  • [46] Stochastic Non-linear Hashing for Near-Duplicate Video Retrieval using Deep Feature applicable to Large-scale Datasets
    Byun, Sung-Woo
    Lee, Seok-Pil
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (08): : 4300 - 4314
  • [47] Compact binary hashing for efficient large-scale image retrieval
    Irie, Go
    [J]. Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2015, 69 (02): : 124 - 130
  • [48] An Enhanced Deep Hashing Method for Large-Scale Image Retrieval
    Chen, Cong
    Tong, Weiqin
    Ding, Xuehai
    Zhi, Xiaoli
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 382 - 393
  • [49] Online Supervised Sketching Hashing for Large-Scale Image Retrieval
    Weng, Zhenyu
    Zhu, Yuesheng
    [J]. IEEE ACCESS, 2019, 7 : 88369 - 88379
  • [50] Spatial pyramid deep hashing for large-scale image retrieval
    Zhao, Wanqing
    Luo, Hangzai
    Peng, Jinye
    Fan, Jianping
    [J]. NEUROCOMPUTING, 2017, 243 : 166 - 173