Visual versus Textual Embedding for Video Retrieval

被引：1

作者：

Francis, Danny ^{[1
]}

Pidou, Paul ^{[1
]}

Merialdo, Bernard ^{[1
]}

Huet, Benoit ^{[1
]}

机构：

[1] EURECOM, Campus SophiaTech,450 Route Chappes, F-06410 Biot, France

来源：

ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS (ACIVS 2017) | 2017年 / 10617卷

关键词：

D O I：

10.1007/978-3-319-70353-4_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper compares several approaches of natural language access to video databases. We present two main strategies. The first one is visual, and consists in comparing keyframes with images retrieved from Google Images. The second one is textual and consists in generating a text-based description of the keyframes, and comparing these descriptions with the query. We study the effect of several parameters and find out that substantial improvement is possible by choosing the right strategy for a given topic. Finally we investigate a method for choosing the right approach for a given topic.

引用

页码：386 / 395

页数：10

共 50 条

[21] Nonlinear embedding neural codes for visual instance retrieval
Li, Yang
Miao, Zhuang
Wang, Jiabao
Zhang, Yafei
NEUROCOMPUTING, 2018, 275 : 1275 - 1281
[22] Deep Unsupervised Embedding for Remote Sensing Image Retrieval Using Textual Cues
Rahhal, Mohamad M. Al
Bazi, Yakoub
Abdullah, Taghreed
Mekhalfi, Mohamed L.
Zuair, Mansour
APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 14
[23] Combination of Visual and Textual Similarity Retrieval from Medical Documents
Eggel, Ivan
Mueller, Henning
MEDICAL INFORMATICS IN A UNITED AND HEALTHY EUROPE, 2009, 150 : 841 - 845
[24] A multi-embedding neural model for incident video retrieval
Chiang, Ting-Hui
Tseng, Yi-Chun
Tseng, Yu-Chee
PATTERN RECOGNITION, 2022, 130
[25] Sign Language Video Retrieval with Free-Form Textual Queries
Duarte, Amanda
Albanie, Samuel
Giro-i-Nieto, Xavier
Varol, Gul
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 14074 - 14084
[26] VIVA: visual information retrieval in video archives
Muehling, Markus
Korfhage, Nikolaus
Pustu-Iren, Kader
Bars, Joanna
Knapp, Mario
Bellafkir, Hicham
Vogelbacher, Markus
Schneider, Daniel
Hoerth, Angelika
Ewerth, Ralph
Freisleben, Bernd
INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2022, 23 (04) : 319 - 333
[27] Audio visual cues for video indexing and retrieval
Muneesawang, Paisarn
Amin, Tahir
Guan, Ling
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3331 : 642 - 649
[28] Visual Information Retrieval in Endoscopic Video Archives
Carlos, Jennifer Roldan
Lux, Mathias
Giro-i-Nieto, Xavier
Munoz, Pia
Anagnostopoulos, Nektarios
2015 13TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2015,
[29] Composition and retrieval of visual information for video databases
Cheng, PJ
Yang, WP
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2001, 12 (06): : 627 - 656
[30] Audio visual cues for video indexing and retrieval
Muneesawang, P
Amin, T
Guan, L
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 1, PROCEEDINGS, 2004, 3331 : 642 - 649

← 1 2 3 4 5 →