DLSTM Approach to Video Modeling with Hashing for Large-Scale Video Retrieval

被引：0

作者：

Zhuang, Naifan ^{[1
]}

Ye, Jun ^{[1
]}

Hua, Kien A. ^{[1
]}

机构：

[1] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA

来源：

2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although Query-by-Example techniques based on Euclidean distance in a multidimensional feature space have proved to be effective for image databases, this approach cannot be effectively applied to video since the number of dimensions would be massive due to the richness and complexity of video data. The above issue has been addressed in two recent solutions, namely Deterministic Quantization (DQ) and Dynamic Temporal Quantization (DTQ). DQ divides the video into equal segments and extracts a visual feature vector for each segment. The bag-of-word feature is then encoded by hashing to facilitate approximate nearest neighbor search using Hamming distance. One weakness of this approach is the deterministic segmentation of video data. DTQ improves on this by using dynamic video segmentation to obtain varied-length video segments. As a result, feature vectors extracted from these video segments can better capture the semantic content of the video. To support very large video databases, it is desirable to minimize the number of segments in order to keep the size of the feature representation as small as possible. We achieve this by using only one video segment (i.e., no video data segmentation is even necessary) with even better retrieval performance. Our scheme models video using differential long short-term memory (DLSTM) recurrent neural networks and obtains a highly compact fixed-size feature representation with the output of hidden states of the DLSTM. Each of these features are further compressed by hashing them into binary bits via quantization. Experimental results based on two public data sets, UCF101 and MSRActionPairs, indicate that the proposed video modeling technique outperforms DTQ by a significant margin.

引用

页码：3222 / 3227

页数：6

共 50 条

[41] Efficient indexing and retrieval of large-scale geo-tagged video databases
Lu, Ying
Shahabi, Cyrus
Kim, Seon Ho
[J]. GEOINFORMATICA, 2016, 20 (04) : 829 - 857
[42] Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval
Jiang, Chen
Huang, Kaiming
He, Sifeng
Yang, Xudong
Zhang, Wei
Zhang, Xiaobo
Cheng, Yuan
Yang, Lei
Wang, Qing
Xu, Furong
Pan, Tan
Chu, Wei
[J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1618 - 1626
[43] Temporal Aggregation of Visual Features for Large-Scale Image-to-Video Retrieval
Garcia, Noa
[J]. ICMR '18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2018, : 489 - 492
[44] Large-scale video monitoring system
Kobayashi, Kazuaki
[J]. NEC Technical Journal, 2010, 5 (03): : 39 - 42
[45] A gradual approach to knowledge distillation in deep supervised hashing for large-scale image retrieval
Hussain, Abid
li, Heng-Chao
Hussain, Mehboob
Ali, Muqadar
Abbas, Shaheen
Ali, Danish
Rehman, Amir
[J]. Computers and Electrical Engineering, 2024, 120
[46] Stochastic Non-linear Hashing for Near-Duplicate Video Retrieval using Deep Feature applicable to Large-scale Datasets
Byun, Sung-Woo
Lee, Seok-Pil
[J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (08): : 4300 - 4314
[47] Compact binary hashing for efficient large-scale image retrieval
Irie, Go
[J]. Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2015, 69 (02): : 124 - 130
[48] An Enhanced Deep Hashing Method for Large-Scale Image Retrieval
Chen, Cong
Tong, Weiqin
Ding, Xuehai
Zhi, Xiaoli
[J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 382 - 393
[49] Online Supervised Sketching Hashing for Large-Scale Image Retrieval
Weng, Zhenyu
Zhu, Yuesheng
[J]. IEEE ACCESS, 2019, 7 : 88369 - 88379
[50] Spatial pyramid deep hashing for large-scale image retrieval
Zhao, Wanqing
Luo, Hangzai
Peng, Jinye
Fan, Jianping
[J]. NEUROCOMPUTING, 2017, 243 : 166 - 173

← 1 2 3 4 5 →