Vibro: Video Browsing with Semantic and Visual Image Embeddings

被引：6

作者：

Schall, Konstantin ^{[1
]}

Hezel, Nico ^{[1
]}

Jung, Klaus ^{[1
]}

Barthel, Kai Uwe ^{[1
]}

机构：

[1] Univ Appl Sci, HTW Berlin, Visual Comp Grp, Wilhelminenhofstr 75, D-12459 Berlin, Germany

来源：

MULTIMEDIA MODELING, MMM 2023, PT I | 2023年 / 13833卷

关键词：

Content-based video retrieval; Exploration; Visualization; Image browsing; Visual and textual co-embeddings;

D O I：

10.1007/978-3-031-27077-2_56

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Vibro represents a powerful tool for interactive video retrieval and browsing and is the winner of the Video Browser Showdown 2022. Following the saying of "never change a winning system" we did not change any of the underlying concepts nor added any new features. Instead, we focused on improving the three existing cornerstones of the software, which are text-to-image search, image-to-image search and browsing results with 2D sorted maps. The changes to these three parts are summarized in this paper, and in addition, an overview of the AVS-mode of vibro is given.

引用

页码：665 / 670

页数：6

共 50 条

[41] LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval
Sun, Siqi
Chen, Yen-Chun
Li, Linjie
Wang, Shuohang
Fang, Yuwei
Liu, Jingjing
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 982 - 997
[42] VisualFlamenco:: Dependable, interactive image browsing based on visual properties
Mueller, Wolfgang
Zech, Markus
Henrich, Andreas
Blank, Daniel
2008 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2008, : 552 - 559
[43] Visual Semantic Role Labeling for Video Understanding
Sadhu, Arka
Gupta, Tanmay
Yatskar, Mark
Nevatia, Ram
Kembhavi, Aniruddha
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5585 - 5596
[44] Generating Semantic Visual Templates for video databases
Chen, W
Chang, SF
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1337 - 1340
[45] Semantic video labeling by developmental visual agents
Gori, Marco
Lippi, Marco
Maggini, Marco
Melacci, Stefano
COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 146 : 9 - 26
[46] Learning semantic visual concepts from video
Liu, JC
Bhanu, B
16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 1061 - 1064
[47] A Review of Video Retrieval Based on Image and Video Semantic Understanding
Haseyama, Miki
Ogawa, Takahiro
Yagi, Nobuyuki
ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2013, 1 (01): : 2 - 9
[48] Semantic image and video in broad domains indexing
Worring, Marcel
Schreiber, Guus
IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (05) : 909 - 911
[49] Multimodal Analogy-Based Image Retrieval by Improving Semantic Embeddings
Ota, Kosuke
Shirai, Keiichiro
Miyao, Hidetoshi
Maruyama, Minoru
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2022, 26 (06) : 995 - 1003
[50] Scalable Nonlinear Embeddings for Semantic Category-based Image Retrieval
Sharma, Gaurav
Schiele, Bernt
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1296 - 1304

← 1 2 3 4 5 →