Semantic Video Retrieval using Deep Learning Techniques

被引：0

作者：

Yasin, Danish

Sohail, Ashbal

Siddiqi, Imran

机构：

来源：

PROCEEDINGS OF 2020 17TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST) | 2020年

关键词：

Semantic Retrieval; Deep Convolutional Neural Networks (CNNs); Long-Short Term Memory Networks (LSTMs); IMAGE;

D O I：

10.1109/ibcast47879.2020.9044601

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Content based video retrieval has been an active research area for many decades. Unlike tagged-based search engines which rely on user-assigned annotations to retrieve the desired content, content based retrieval systems match the actual content of video with the provided query to fetch the required set of videos. Thanks to the recent advancements in deep learning, the traditional pipeline of content based systems (pre-processing, segmentation, object classification, action recognition etc.) is being replaced by end-to-end trainable systems which are not only effective and robust but also avoid the complex processing in the conventional image based techniques. The present study exploits these developments to develop a semantic video retrieval system accepting natural language queries and retrieving the relevant videos. We focus on key individuals appearing in certain scenarios as queries in the current study. Persons appearing in a video are recognized by tuning FaceNet to our set of images while caption generation is exploited to make sense of the scenario within a given video frame. The outputs of the two modules are combined to generate a description of the frame. During the retrieval phase, natural language queries are provided to the system and the concept of word embeddings is employed to find similar words to those appearing in the query text. For a given query, all videos where the queried individuals and scenarios have appeared are returned by the system. The preliminary experimental study on a collection of 50 videos reported promising retrieval results.

引用

页码：338 / 343

页数：6

共 50 条

[31] How deep learning is empowering semantic segmentation Traditional and deep learning techniques for semantic segmentation: A comparison
Sehar, Uroosa
Naseem, Muhammad Luqman
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30519 - 30544
[32] Deep learning for video-text retrieval: a review
Zhu, Cunjuan
Jia, Qi
Chen, Wei
Guo, Yanming
Liu, Yu
[J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (01)
[33] Special issue on deep learning in image and video retrieval
Ard Oerlemans
Yanming Guo
Michael S. Lew
Tat-Seng Chua
[J]. International Journal of Multimedia Information Retrieval, 2020, 9 : 61 - 62
[34] Special issue on deep learning in image and video retrieval
Oerlemans, Ard
Guo, Yanming
Lew, Michael S.
Chua, Tat-Seng
[J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2020, 9 (02) : 61 - 62
[35] Deep learning for video-text retrieval: a review
Cunjuan Zhu
Qi Jia
Wei Chen
Yanming Guo
Yu Liu
[J]. International Journal of Multimedia Information Retrieval, 2023, 12
[36] LEARNING DEEP SEMANTIC ATTRIBUTES FOR USER VIDEO SUMMARIZATION
Sun, Ke
Zhu, Jiasong
Lei, Zhuo
Hou, Xianxu
Zhang, Qian
Duan, Jiang
Qiu, Guoping
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 643 - 648
[37] Video Summarization by Learning Deep Side Semantic Embedding
Yuan, Yitian
Mei, Tao
Cui, Peng
Zhu, Wenwu
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (01) : 226 - 237
[38] Deep Reinforcement Learning for Video Summarization with Semantic Reward
Sun, Haoran
Zhu, Xiaolong
Zhou, Conghua
[J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 754 - 755
[39] Cross-Modal Event Retrieval: A Dataset and a Baseline Using Deep Semantic Learning
Situ, Runwei
Yang, Zhenguo
Lv, Jianming
Li, Qing
Liu, Wenyin
[J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 147 - 157
[40] The myth of semantic video retrieval
Dimitrova, N
[J]. ACM COMPUTING SURVEYS, 1995, 27 (04) : 584 - 586

← 1 2 3 4 5 →