Semantic Video Retrieval using Deep Learning Techniques

被引:0
|
作者
Yasin, Danish
Sohail, Ashbal
Siddiqi, Imran
机构
关键词
Semantic Retrieval; Deep Convolutional Neural Networks (CNNs); Long-Short Term Memory Networks (LSTMs); IMAGE;
D O I
10.1109/ibcast47879.2020.9044601
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Content based video retrieval has been an active research area for many decades. Unlike tagged-based search engines which rely on user-assigned annotations to retrieve the desired content, content based retrieval systems match the actual content of video with the provided query to fetch the required set of videos. Thanks to the recent advancements in deep learning, the traditional pipeline of content based systems (pre-processing, segmentation, object classification, action recognition etc.) is being replaced by end-to-end trainable systems which are not only effective and robust but also avoid the complex processing in the conventional image based techniques. The present study exploits these developments to develop a semantic video retrieval system accepting natural language queries and retrieving the relevant videos. We focus on key individuals appearing in certain scenarios as queries in the current study. Persons appearing in a video are recognized by tuning FaceNet to our set of images while caption generation is exploited to make sense of the scenario within a given video frame. The outputs of the two modules are combined to generate a description of the frame. During the retrieval phase, natural language queries are provided to the system and the concept of word embeddings is employed to find similar words to those appearing in the query text. For a given query, all videos where the queried individuals and scenarios have appeared are returned by the system. The preliminary experimental study on a collection of 50 videos reported promising retrieval results.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 50 条
  • [31] How deep learning is empowering semantic segmentation Traditional and deep learning techniques for semantic segmentation: A comparison
    Sehar, Uroosa
    Naseem, Muhammad Luqman
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) : 30519 - 30544
  • [32] Deep learning for video-text retrieval: a review
    Zhu, Cunjuan
    Jia, Qi
    Chen, Wei
    Guo, Yanming
    Liu, Yu
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (01)
  • [33] Special issue on deep learning in image and video retrieval
    Ard Oerlemans
    Yanming Guo
    Michael S. Lew
    Tat-Seng Chua
    [J]. International Journal of Multimedia Information Retrieval, 2020, 9 : 61 - 62
  • [34] Special issue on deep learning in image and video retrieval
    Oerlemans, Ard
    Guo, Yanming
    Lew, Michael S.
    Chua, Tat-Seng
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2020, 9 (02) : 61 - 62
  • [35] Deep learning for video-text retrieval: a review
    Cunjuan Zhu
    Qi Jia
    Wei Chen
    Yanming Guo
    Yu Liu
    [J]. International Journal of Multimedia Information Retrieval, 2023, 12
  • [36] LEARNING DEEP SEMANTIC ATTRIBUTES FOR USER VIDEO SUMMARIZATION
    Sun, Ke
    Zhu, Jiasong
    Lei, Zhuo
    Hou, Xianxu
    Zhang, Qian
    Duan, Jiang
    Qiu, Guoping
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 643 - 648
  • [37] Video Summarization by Learning Deep Side Semantic Embedding
    Yuan, Yitian
    Mei, Tao
    Cui, Peng
    Zhu, Wenwu
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (01) : 226 - 237
  • [38] Deep Reinforcement Learning for Video Summarization with Semantic Reward
    Sun, Haoran
    Zhu, Xiaolong
    Zhou, Conghua
    [J]. 2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 754 - 755
  • [39] Cross-Modal Event Retrieval: A Dataset and a Baseline Using Deep Semantic Learning
    Situ, Runwei
    Yang, Zhenguo
    Lv, Jianming
    Li, Qing
    Liu, Wenyin
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 147 - 157
  • [40] The myth of semantic video retrieval
    Dimitrova, N
    [J]. ACM COMPUTING SURVEYS, 1995, 27 (04) : 584 - 586