Visual versus Textual Embedding for Video Retrieval

被引:1
|
作者
Francis, Danny [1 ]
Pidou, Paul [1 ]
Merialdo, Bernard [1 ]
Huet, Benoit [1 ]
机构
[1] EURECOM, Campus SophiaTech,450 Route Chappes, F-06410 Biot, France
关键词
D O I
10.1007/978-3-319-70353-4_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper compares several approaches of natural language access to video databases. We present two main strategies. The first one is visual, and consists in comparing keyframes with images retrieved from Google Images. The second one is textual and consists in generating a text-based description of the keyframes, and comparing these descriptions with the query. We study the effect of several parameters and find out that substantial improvement is possible by choosing the right strategy for a given topic. Finally we investigate a method for choosing the right approach for a given topic.
引用
收藏
页码:386 / 395
页数:10
相关论文
共 50 条
  • [1] Scalable Video Event Retrieval by Visual State Binary Embedding
    Yu, Litao
    Huang, Zi
    Cao, Jiewei
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (08) : 1590 - 1603
  • [2] Story Based Video Retrieval using Deep Visual and Textual Information
    Hassan, Muhammad A.
    Saleem, Summra
    Khan, Muhammad Zeeshan
    Khan, Muhammad Usman Ghani
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING AND DIGITAL SYSTEMS (C-CODE), 2019, : 166 - 171
  • [3] Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
    Nguyen, Huy Manh
    Miyazaki, Tomo
    Sugaya, Yoshihiro
    Omachi, Shinichiro
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [4] Regim VID A Semantic and Personalized Framework for News Video Retrieval Based on Textual and Visual Transcripts
    Karray, Hichem
    Ben Ammar, Anis
    Alimi, Adel M.
    [J]. JOURNAL OF DECISION SYSTEMS, 2011, 20 (04) : 467 - 490
  • [5] Exploiting Evidential Theory in the Fusion of Textual, Audio, and Visual Modalities for Affective Music Video Retrieval
    Nemati, Shahla
    Naghsh-Nilchi, Ahmad Reza
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (IPRIA), 2017, : 222 - 228
  • [6] Visual and textual content based indexing and retrieval
    Chabane Djeraba
    Marinette Bouet
    Henri Briand
    Ali Khenchaf
    [J]. International Journal on Digital Libraries, 2000, 2 (4) : 269 - 287
  • [7] Combining textual and visual features for image retrieval
    Martinez-Fernandez, J. L.
    Villena Roman, Julio
    Garcia-Serrano, Ana M.
    Gonzalez-Cristobal, Jose Carlos
    [J]. ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 680 - 691
  • [8] Relational Visual-Textual Information Retrieval
    Messina, Nicola
    [J]. SIMILARITY SEARCH AND APPLICATIONS, SISAP 2020, 2020, 12440 : 405 - 411
  • [9] Multipage Document Retrieval by Textual and Visual Representations
    Rusinol, Marcal
    Karatzas, Dimosthenis
    Bagdanov, Andrew D.
    Llados, Josep
    [J]. 2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 521 - 524
  • [10] Multi visual and textual embedding on visual question answering for blind people
    Tung Le
    Huy Tien Nguyen
    Minh Le Nguyen
    [J]. NEUROCOMPUTING, 2021, 465 : 451 - 464