Semantic Video Retrieval using Deep Learning Techniques

被引:0
|
作者
Yasin, Danish
Sohail, Ashbal
Siddiqi, Imran
机构
关键词
Semantic Retrieval; Deep Convolutional Neural Networks (CNNs); Long-Short Term Memory Networks (LSTMs); IMAGE;
D O I
10.1109/ibcast47879.2020.9044601
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Content based video retrieval has been an active research area for many decades. Unlike tagged-based search engines which rely on user-assigned annotations to retrieve the desired content, content based retrieval systems match the actual content of video with the provided query to fetch the required set of videos. Thanks to the recent advancements in deep learning, the traditional pipeline of content based systems (pre-processing, segmentation, object classification, action recognition etc.) is being replaced by end-to-end trainable systems which are not only effective and robust but also avoid the complex processing in the conventional image based techniques. The present study exploits these developments to develop a semantic video retrieval system accepting natural language queries and retrieving the relevant videos. We focus on key individuals appearing in certain scenarios as queries in the current study. Persons appearing in a video are recognized by tuning FaceNet to our set of images while caption generation is exploited to make sense of the scenario within a given video frame. The outputs of the two modules are combined to generate a description of the frame. During the retrieval phase, natural language queries are provided to the system and the concept of word embeddings is employed to find similar words to those appearing in the query text. For a given query, all videos where the queried individuals and scenarios have appeared are returned by the system. The preliminary experimental study on a collection of 50 videos reported promising retrieval results.
引用
收藏
页码:338 / 343
页数:6
相关论文
共 50 条
  • [1] Deep Learning Based Semantic Video Indexing and Retrieval
    Podlesnaya, Anna
    Podlesnyy, Sergey
    [J]. PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 359 - 372
  • [2] A Survey on Near Duplicate Video Retrieval Using Deep Learning Techniques and Framework
    Phalke, Dhanashree Ajay
    Jahirabadkar, Sunita
    [J]. 2020 IEEE PUNE SECTION INTERNATIONAL CONFERENCE (PUNECON), 2020, : 124 - 128
  • [3] A survey on deep learning techniques for image and video semantic segmentation
    Garcia-Garcia, Alberto
    Orts-Escolano, Sergio
    Oprea, Sergiu
    Villena-Martinez, Victor
    Martinez-Gonzalez, Pablo
    Garcia-Rodriguez, Jose
    [J]. APPLIED SOFT COMPUTING, 2018, 70 : 41 - 65
  • [4] Large-scale semantic web image retrieval using bimodal deep learning techniques
    Huang, Changqin
    Xu, Haijiao
    Xie, Liang
    Zhu, Jia
    Xu, Chunyan
    Tang, Yong
    [J]. INFORMATION SCIENCES, 2018, 430 : 331 - 348
  • [5] Survey on semantic segmentation using deep learning techniques
    Lateef, Fahad
    Ruichek, Yassine
    [J]. NEUROCOMPUTING, 2019, 338 : 321 - 348
  • [6] Video retrieval using semantic data
    Del Bimbo, A
    [J]. STATE-OF-THE-ART IN CONTENT-BASED IMAGE AND VIDEO RETRIEVAL, 2001, 22 : 279 - 295
  • [7] Towards a Semantic Video Analysis using Deep Learning and Ontology
    Bornia, Jemai
    Mahmoudi, Sidi Ahmed
    Frihida, Ali
    Manneback, Pierre
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGIES AND APPLICATIONS (CLOUDTECH), 2018,
  • [8] Describing video scenarios using deep learning techniques
    Huang, Yin-Fu
    Shih, Li-Ping
    Tsai, Chia-Hsin
    Shen, Guan-Ting
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (06) : 2465 - 2490
  • [9] A Survey on Image Semantic Segmentation Using Deep Learning Techniques
    Cheng, Jieren
    Li, Hua
    Li, Dengbo
    Hua, Shuai
    Sheng, Victor S.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1941 - 1957
  • [10] Semantic Segmentation of Herbarium Specimens Using Deep Learning Techniques
    Hussein, Burhan Rashid
    Malik, Owais Ahmed
    Ong, Wee-Hong
    Slik, Johan Willem Frederik
    [J]. COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST 2019), 2020, 603 : 321 - 330