Semantic Reasoning in Zero Example Video Event Retrieval

被引:13
|
作者
de Boer, Maaike H. T. [1 ,2 ]
Lu, Yi-Jie [3 ]
Zhang, Hao [3 ]
Schutte, Klamer [4 ]
Ngo, Chong-Wah [3 ]
Kraaij, Wessel [5 ,6 ]
机构
[1] TNO, Intelligent Imaging Dept, Oude Waalsdorperweg 63, NL-2597 AK The Hague, Netherlands
[2] Radboud Univ Nijmegen, Data Sci Dept, Fac Sci, Toernooiveld 212, NL-6525 EC Nijmegen, Netherlands
[3] City Univ Hong Kong, 83 Tat Chee Ave, Kowloon, Hong Kong, Peoples R China
[4] TNO, Oude Waalsdorperweg 63, NL-2597 AK The Hague, Netherlands
[5] TNO, Anna Van Buerenpl 1 & Niels Bohrweg 1, NL-2595 DA The Hague, Netherlands
[6] Leiden Univ, NL-2333 CA Leiden, Netherlands
关键词
Content-based visual information retrieval; multimedia event detection; zero shot; semantics;
D O I
10.1145/3131288
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Searching in digital video data for high-level events, such as a parade or a car accident, is challenging when the query is textual and lacks visual example images or videos. Current research in deep neural networks is highly beneficial for the retrieval of high-level events using visual examples, but without examples it is still hard to (1) determine which concepts are useful to pre-train (Vocabulary challenge) and (2) which pre-trained concept detectors are relevant for a certain unseen high-level event (Concept Selection challenge). In our article, we present our Semantic Event Retrieval Systemwhich (1) shows the importance of high-level concepts in a vocabulary for the retrieval of complex and generic high-level events and (2) uses a novel concept selection method (i-w2v) based on semantic embeddings. Our experiments on the international TRECVID Multimedia Event Detection benchmark show that a diverse vocabulary including high-level concepts improves performance on the retrieval of high-level events in videos and that our novel method outperforms a knowledge-based concept selection method.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [21] Video Question Answering With Semantic Disentanglement and Reasoning
    Liu, Jin
    Wang, Guoxiang
    Xie, Jialong
    Zhou, Fengyu
    Xu, Huijuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3663 - 3673
  • [22] Semantic Information Retrieval Study Based on Knowledge Reasoning
    You, Ming-ying
    Liang, Lei
    Peng, Jun
    Chen, Cheng-yuan
    FUZZY INFORMATION AND ENGINEERING, VOLUME 2, 2009, 62 : 271 - 280
  • [23] A Review of Video Retrieval Based on Image and Video Semantic Understanding
    Haseyama, Miki
    Ogawa, Takahiro
    Yagi, Nobuyuki
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2013, 1 (01): : 2 - 9
  • [24] Semantic-based surveillance video retrieval
    Hu, Weiming
    Xie, Dan
    Fu, Zhouyu
    Zeng, Wenrong
    Maybank, Steve
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (04) : 1168 - 1181
  • [25] Content-based video retrieval by example video clip
    Dimitrova, N
    AbdelMottaleb, M
    STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES V, 1997, 3022 : 59 - 70
  • [26] Semantic modeling approach for video retrieval by content
    Ardizzone, Edoardo
    Hacid, Mohand-Said
    International Conference on Multimedia Computing and Systems -Proceedings, 1999, 2 : 158 - 162
  • [27] Video Retrieval System for Bridging the Semantic Gap
    Jung, Min Young
    Park, Sung Han
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2009, E92D (12): : 2516 - 2519
  • [28] A semantic modeling approach for video retrieval by content
    Ardizzone, E
    Hacid, MS
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 2, 1999, : 158 - 162
  • [29] Semantic video retrieval using audio analysis
    Bakker, EM
    Lew, MS
    IMAGE AND VIDEO RETRIEVAL, 2002, 2383 : 271 - 277
  • [30] Detecting automobiles and people for semantic video retrieval
    Visser, R
    Sebe, N
    Lew, MS
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2002, : 733 - 736