DLSTM Approach to Video Modeling with Hashing for Large-Scale Video Retrieval

被引:0
|
作者
Zhuang, Naifan [1 ]
Ye, Jun [1 ]
Hua, Kien A. [1 ]
机构
[1] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although Query-by-Example techniques based on Euclidean distance in a multidimensional feature space have proved to be effective for image databases, this approach cannot be effectively applied to video since the number of dimensions would be massive due to the richness and complexity of video data. The above issue has been addressed in two recent solutions, namely Deterministic Quantization (DQ) and Dynamic Temporal Quantization (DTQ). DQ divides the video into equal segments and extracts a visual feature vector for each segment. The bag-of-word feature is then encoded by hashing to facilitate approximate nearest neighbor search using Hamming distance. One weakness of this approach is the deterministic segmentation of video data. DTQ improves on this by using dynamic video segmentation to obtain varied-length video segments. As a result, feature vectors extracted from these video segments can better capture the semantic content of the video. To support very large video databases, it is desirable to minimize the number of segments in order to keep the size of the feature representation as small as possible. We achieve this by using only one video segment (i.e., no video data segmentation is even necessary) with even better retrieval performance. Our scheme models video using differential long short-term memory (DLSTM) recurrent neural networks and obtains a highly compact fixed-size feature representation with the output of hidden states of the DLSTM. Each of these features are further compressed by hashing them into binary bits via quantization. Experimental results based on two public data sets, UCF101 and MSRActionPairs, indicate that the proposed video modeling technique outperforms DTQ by a significant margin.
引用
收藏
页码:3222 / 3227
页数:6
相关论文
共 50 条
  • [21] Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search
    Sauter, Loris
    Parian, Mahnaz Amiri
    Gasser, Ralph
    Heller, Silvan
    Rossetto, Luca
    Schuldt, Heiko
    [J]. MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 760 - 765
  • [22] Large-scale video copy retrieval with temporal-concentration SIFT
    Zhu, Yingying
    Huang, Xiaoyan
    Huang, Qiang
    Tian, Qi
    [J]. NEUROCOMPUTING, 2016, 187 : 83 - 91
  • [23] Large-Scale Video Retrieval via Deep Local Convolutional Features
    Zhang, Chen
    Hu, Bin
    Suo, Yucong
    Zou, Zhiqiang
    Ji, Yimu
    [J]. ADVANCES IN MULTIMEDIA, 2020, 2020
  • [24] An Adaptive Search Path Traverse for Large-scale Video Frame Retrieval
    Diep Thi-Ngoc Nguyen
    Kiyoki, Yasushi
    [J]. INFORMATION MODELLING AND KNOWLEDGE BASES XXVI, 2014, 272 : 324 - 342
  • [25] TEMPORAL AGGREGATION FOR LARGE-SCALE QUERY-BY-IMAGE VIDEO RETRIEVAL
    Araujo, Andre
    Chaves, Jason
    Angst, Roland
    Girod, Bernd
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1518 - 1522
  • [26] A supervised deep convolutional based bidirectional long short term memory video hashing for large scale video retrieval applications
    Anuranji, R.
    Srimathi, H.
    [J]. DIGITAL SIGNAL PROCESSING, 2020, 102
  • [27] Large-scale image retrieval with supervised sparse hashing
    Xu, Yan
    Shen, Fumin
    Xu, Xing
    Gao, Lianli
    Wang, Yuan
    Tan, Xiao
    [J]. NEUROCOMPUTING, 2017, 229 : 45 - 53
  • [28] Large-scale image retrieval with Sparse Embedded Hashing
    Ding, Guiguang
    Zhou, Jile
    Guo, Yuchen
    Lin, Zijia
    Zhao, Sicheng
    Han, Jungong
    [J]. NEUROCOMPUTING, 2017, 257 : 24 - 36
  • [29] Modeling large-scale live video streaming client behavior
    Thiago Guarnieri
    Idilio Drago
    Ítalo Cunha
    Breno Almeida
    Jussara M. Almeida
    Alex B. Vieira
    [J]. Multimedia Systems, 2021, 27 : 1101 - 1124
  • [30] Modeling large-scale live video streaming client behavior
    Guarnieri, Thiago
    Drago, Idilio
    Cunha, Italo
    Almeida, Breno
    Almeida, Jussara M.
    Vieira, Alex B.
    [J]. MULTIMEDIA SYSTEMS, 2021, 27 (06) : 1101 - 1124