Visual versus Textual Embedding for Video Retrieval

被引:1
|
作者
Francis, Danny [1 ]
Pidou, Paul [1 ]
Merialdo, Bernard [1 ]
Huet, Benoit [1 ]
机构
[1] EURECOM, Campus SophiaTech,450 Route Chappes, F-06410 Biot, France
关键词
D O I
10.1007/978-3-319-70353-4_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper compares several approaches of natural language access to video databases. We present two main strategies. The first one is visual, and consists in comparing keyframes with images retrieved from Google Images. The second one is textual and consists in generating a text-based description of the keyframes, and comparing these descriptions with the query. We study the effect of several parameters and find out that substantial improvement is possible by choosing the right strategy for a given topic. Finally we investigate a method for choosing the right approach for a given topic.
引用
收藏
页码:386 / 395
页数:10
相关论文
共 50 条
  • [21] Nonlinear embedding neural codes for visual instance retrieval
    Li, Yang
    Miao, Zhuang
    Wang, Jiabao
    Zhang, Yafei
    NEUROCOMPUTING, 2018, 275 : 1275 - 1281
  • [22] Deep Unsupervised Embedding for Remote Sensing Image Retrieval Using Textual Cues
    Rahhal, Mohamad M. Al
    Bazi, Yakoub
    Abdullah, Taghreed
    Mekhalfi, Mohamed L.
    Zuair, Mansour
    APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 14
  • [23] Combination of Visual and Textual Similarity Retrieval from Medical Documents
    Eggel, Ivan
    Mueller, Henning
    MEDICAL INFORMATICS IN A UNITED AND HEALTHY EUROPE, 2009, 150 : 841 - 845
  • [24] A multi-embedding neural model for incident video retrieval
    Chiang, Ting-Hui
    Tseng, Yi-Chun
    Tseng, Yu-Chee
    PATTERN RECOGNITION, 2022, 130
  • [25] Sign Language Video Retrieval with Free-Form Textual Queries
    Duarte, Amanda
    Albanie, Samuel
    Giro-i-Nieto, Xavier
    Varol, Gul
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14074 - 14084
  • [26] VIVA: visual information retrieval in video archives
    Muehling, Markus
    Korfhage, Nikolaus
    Pustu-Iren, Kader
    Bars, Joanna
    Knapp, Mario
    Bellafkir, Hicham
    Vogelbacher, Markus
    Schneider, Daniel
    Hoerth, Angelika
    Ewerth, Ralph
    Freisleben, Bernd
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2022, 23 (04) : 319 - 333
  • [27] Audio visual cues for video indexing and retrieval
    Muneesawang, Paisarn
    Amin, Tahir
    Guan, Ling
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3331 : 642 - 649
  • [28] Visual Information Retrieval in Endoscopic Video Archives
    Carlos, Jennifer Roldan
    Lux, Mathias
    Giro-i-Nieto, Xavier
    Munoz, Pia
    Anagnostopoulos, Nektarios
    2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,
  • [29] Composition and retrieval of visual information for video databases
    Cheng, PJ
    Yang, WP
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2001, 12 (06): : 627 - 656
  • [30] Audio visual cues for video indexing and retrieval
    Muneesawang, P
    Amin, T
    Guan, L
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 642 - 649