DLSTM Approach to Video Modeling with Hashing for Large-Scale Video Retrieval

被引：0

作者：

Zhuang, Naifan ^{[1
]}

Ye, Jun ^{[1
]}

Hua, Kien A. ^{[1
]}

机构：

[1] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA

来源：

2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although Query-by-Example techniques based on Euclidean distance in a multidimensional feature space have proved to be effective for image databases, this approach cannot be effectively applied to video since the number of dimensions would be massive due to the richness and complexity of video data. The above issue has been addressed in two recent solutions, namely Deterministic Quantization (DQ) and Dynamic Temporal Quantization (DTQ). DQ divides the video into equal segments and extracts a visual feature vector for each segment. The bag-of-word feature is then encoded by hashing to facilitate approximate nearest neighbor search using Hamming distance. One weakness of this approach is the deterministic segmentation of video data. DTQ improves on this by using dynamic video segmentation to obtain varied-length video segments. As a result, feature vectors extracted from these video segments can better capture the semantic content of the video. To support very large video databases, it is desirable to minimize the number of segments in order to keep the size of the feature representation as small as possible. We achieve this by using only one video segment (i.e., no video data segmentation is even necessary) with even better retrieval performance. Our scheme models video using differential long short-term memory (DLSTM) recurrent neural networks and obtains a highly compact fixed-size feature representation with the output of hidden states of the DLSTM. Each of these features are further compressed by hashing them into binary bits via quantization. Experimental results based on two public data sets, UCF101 and MSRActionPairs, indicate that the proposed video modeling technique outperforms DTQ by a significant margin.

引用

页码：3222 / 3227

页数：6

共 50 条

[21] Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search
Sauter, Loris
Parian, Mahnaz Amiri
Gasser, Ralph
Heller, Silvan
Rossetto, Luca
Schuldt, Heiko
[J]. MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 760 - 765
[22] Large-scale video copy retrieval with temporal-concentration SIFT
Zhu, Yingying
Huang, Xiaoyan
Huang, Qiang
Tian, Qi
[J]. NEUROCOMPUTING, 2016, 187 : 83 - 91
[23] Large-Scale Video Retrieval via Deep Local Convolutional Features
Zhang, Chen
Hu, Bin
Suo, Yucong
Zou, Zhiqiang
Ji, Yimu
[J]. ADVANCES IN MULTIMEDIA, 2020, 2020
[24] An Adaptive Search Path Traverse for Large-scale Video Frame Retrieval
Diep Thi-Ngoc Nguyen
Kiyoki, Yasushi
[J]. INFORMATION MODELLING AND KNOWLEDGE BASES XXVI, 2014, 272 : 324 - 342
[25] TEMPORAL AGGREGATION FOR LARGE-SCALE QUERY-BY-IMAGE VIDEO RETRIEVAL
Araujo, Andre
Chaves, Jason
Angst, Roland
Girod, Bernd
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1518 - 1522
[26] A supervised deep convolutional based bidirectional long short term memory video hashing for large scale video retrieval applications
Anuranji, R.
Srimathi, H.
[J]. DIGITAL SIGNAL PROCESSING, 2020, 102
[27] Large-scale image retrieval with supervised sparse hashing
Xu, Yan
Shen, Fumin
Xu, Xing
Gao, Lianli
Wang, Yuan
Tan, Xiao
[J]. NEUROCOMPUTING, 2017, 229 : 45 - 53
[28] Large-scale image retrieval with Sparse Embedded Hashing
Ding, Guiguang
Zhou, Jile
Guo, Yuchen
Lin, Zijia
Zhao, Sicheng
Han, Jungong
[J]. NEUROCOMPUTING, 2017, 257 : 24 - 36
[29] Modeling large-scale live video streaming client behavior
Thiago Guarnieri
Idilio Drago
Ítalo Cunha
Breno Almeida
Jussara M. Almeida
Alex B. Vieira
[J]. Multimedia Systems, 2021, 27 : 1101 - 1124
[30] Modeling large-scale live video streaming client behavior
Guarnieri, Thiago
Drago, Idilio
Cunha, Italo
Almeida, Breno
Almeida, Jussara M.
Vieira, Alex B.
[J]. MULTIMEDIA SYSTEMS, 2021, 27 (06) : 1101 - 1124

← 1 2 3 4 5 →