Vibro: Video Browsing with Semantic and Visual Image Embeddings

被引:6
|
作者
Schall, Konstantin [1 ]
Hezel, Nico [1 ]
Jung, Klaus [1 ]
Barthel, Kai Uwe [1 ]
机构
[1] Univ Appl Sci, HTW Berlin, Visual Comp Grp, Wilhelminenhofstr 75, D-12459 Berlin, Germany
来源
关键词
Content-based video retrieval; Exploration; Visualization; Image browsing; Visual and textual co-embeddings;
D O I
10.1007/978-3-031-27077-2_56
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vibro represents a powerful tool for interactive video retrieval and browsing and is the winner of the Video Browser Showdown 2022. Following the saying of "never change a winning system" we did not change any of the underlying concepts nor added any new features. Instead, we focused on improving the three existing cornerstones of the software, which are text-to-image search, image-to-image search and browsing results with 2D sorted maps. The changes to these three parts are summarized in this paper, and in addition, an overview of the AVS-mode of vibro is given.
引用
收藏
页码:665 / 670
页数:6
相关论文
共 50 条
  • [41] LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval
    Sun, Siqi
    Chen, Yen-Chun
    Li, Linjie
    Wang, Shuohang
    Fang, Yuwei
    Liu, Jingjing
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 982 - 997
  • [42] VisualFlamenco:: Dependable, interactive image browsing based on visual properties
    Mueller, Wolfgang
    Zech, Markus
    Henrich, Andreas
    Blank, Daniel
    2008 INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING, 2008, : 552 - 559
  • [43] Visual Semantic Role Labeling for Video Understanding
    Sadhu, Arka
    Gupta, Tanmay
    Yatskar, Mark
    Nevatia, Ram
    Kembhavi, Aniruddha
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5585 - 5596
  • [44] Generating Semantic Visual Templates for video databases
    Chen, W
    Chang, SF
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1337 - 1340
  • [45] Semantic video labeling by developmental visual agents
    Gori, Marco
    Lippi, Marco
    Maggini, Marco
    Melacci, Stefano
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 146 : 9 - 26
  • [46] Learning semantic visual concepts from video
    Liu, JC
    Bhanu, B
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 1061 - 1064
  • [47] A Review of Video Retrieval Based on Image and Video Semantic Understanding
    Haseyama, Miki
    Ogawa, Takahiro
    Yagi, Nobuyuki
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2013, 1 (01): : 2 - 9
  • [48] Semantic image and video in broad domains indexing
    Worring, Marcel
    Schreiber, Guus
    IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (05) : 909 - 911
  • [49] Multimodal Analogy-Based Image Retrieval by Improving Semantic Embeddings
    Ota, Kosuke
    Shirai, Keiichiro
    Miyao, Hidetoshi
    Maruyama, Minoru
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2022, 26 (06) : 995 - 1003
  • [50] Scalable Nonlinear Embeddings for Semantic Category-based Image Retrieval
    Sharma, Gaurav
    Schiele, Bernt
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1296 - 1304