Visual versus Textual Embedding for Video Retrieval

被引：1

作者：

Francis, Danny ^{[1
]}

Pidou, Paul ^{[1
]}

Merialdo, Bernard ^{[1
]}

Huet, Benoit ^{[1
]}

机构：

[1] EURECOM, Campus SophiaTech,450 Route Chappes, F-06410 Biot, France

来源：

ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017) | 2017年 / 10617卷

关键词：

D O I：

10.1007/978-3-319-70353-4_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper compares several approaches of natural language access to video databases. We present two main strategies. The first one is visual, and consists in comparing keyframes with images retrieved from Google Images. The second one is textual and consists in generating a text-based description of the keyframes, and comparing these descriptions with the query. We study the effect of several parameters and find out that substantial improvement is possible by choosing the right strategy for a given topic. Finally we investigate a method for choosing the right approach for a given topic.

引用

页码：386 / 395

页数：10

共 50 条

[1] Scalable Video Event Retrieval by Visual State Binary Embedding
Yu, Litao
Huang, Zi
Cao, Jiewei
Shen, Heng Tao
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (08) : 1590 - 1603
[2] Story Based Video Retrieval using Deep Visual and Textual Information
Hassan, Muhammad A.
Saleem, Summra
Khan, Muhammad Zeeshan
Khan, Muhammad Usman Ghani
2019 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING AND DIGITAL SYSTEMS (C-CODE), 2019, : 166 - 171
[3] Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
Nguyen, Huy Manh
Miyazaki, Tomo
Sugaya, Yoshihiro
Omachi, Shinichiro
APPLIED SCIENCES-BASEL, 2021, 11 (07):
[4] Regim VID A Semantic and Personalized Framework for News Video Retrieval Based on Textual and Visual Transcripts
Karray, Hichem
Ben Ammar, Anis
Alimi, Adel M.
JOURNAL OF DECISION SYSTEMS, 2011, 20 (04) : 467 - 490
[5] Exploiting Evidential Theory in the Fusion of Textual, Audio, and Visual Modalities for Affective Music Video Retrieval
Nemati, Shahla
Naghsh-Nilchi, Ahmad Reza
2017 3RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (IPRIA), 2017, : 222 - 228
[6] Visual and textual content based indexing and retrieval
Chabane Djeraba
Marinette Bouet
Henri Briand
Ali Khenchaf
International Journal on Digital Libraries, 2000, 2 (4) : 269 - 287
[7] Combining textual and visual features for image retrieval
Martinez-Fernandez, J. L.
Villena Roman, Julio
Garcia-Serrano, Ana M.
Gonzalez-Cristobal, Jose Carlos
ACCESSING MULTILINGUAL INFORMATION REPOSITORIES, 2006, 4022 : 680 - 691
[8] Relational Visual-Textual Information Retrieval
Messina, Nicola
SIMILARITY SEARCH AND APPLICATIONS, SISAP 2020, 2020, 12440 : 405 - 411
[9] Multipage Document Retrieval by Textual and Visual Representations
Rusinol, Marcal
Karatzas, Dimosthenis
Bagdanov, Andrew D.
Llados, Josep
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 521 - 524
[10] Multi visual and textual embedding on visual question answering for blind people
Tung Le
Huy Tien Nguyen
Minh Le Nguyen
NEUROCOMPUTING, 2021, 465 : 451 - 464

← 1 2 3 4 5 →