Semantic Reasoning in Zero Example Video Event Retrieval

被引:13
|
作者
de Boer, Maaike H. T. [1 ,2 ]
Lu, Yi-Jie [3 ]
Zhang, Hao [3 ]
Schutte, Klamer [4 ]
Ngo, Chong-Wah [3 ]
Kraaij, Wessel [5 ,6 ]
机构
[1] TNO, Intelligent Imaging Dept, Oude Waalsdorperweg 63, NL-2597 AK The Hague, Netherlands
[2] Radboud Univ Nijmegen, Data Sci Dept, Fac Sci, Toernooiveld 212, NL-6525 EC Nijmegen, Netherlands
[3] City Univ Hong Kong, 83 Tat Chee Ave, Kowloon, Hong Kong, Peoples R China
[4] TNO, Oude Waalsdorperweg 63, NL-2597 AK The Hague, Netherlands
[5] TNO, Anna Van Buerenpl 1 & Niels Bohrweg 1, NL-2595 DA The Hague, Netherlands
[6] Leiden Univ, NL-2333 CA Leiden, Netherlands
关键词
Content-based visual information retrieval; multimedia event detection; zero shot; semantics;
D O I
10.1145/3131288
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Searching in digital video data for high-level events, such as a parade or a car accident, is challenging when the query is textual and lacks visual example images or videos. Current research in deep neural networks is highly beneficial for the retrieval of high-level events using visual examples, but without examples it is still hard to (1) determine which concepts are useful to pre-train (Vocabulary challenge) and (2) which pre-trained concept detectors are relevant for a certain unseen high-level event (Concept Selection challenge). In our article, we present our Semantic Event Retrieval Systemwhich (1) shows the importance of high-level concepts in a vocabulary for the retrieval of complex and generic high-level events and (2) uses a novel concept selection method (i-w2v) based on semantic embeddings. Our experiments on the international TRECVID Multimedia Event Detection benchmark show that a diverse vocabulary including high-level concepts improves performance on the retrieval of high-level events in videos and that our novel method outperforms a knowledge-based concept selection method.
引用
下载
收藏
页数:17
相关论文
共 50 条
  • [1] Dual Encoding for Zero-Example Video Retrieval
    Dong, Jianfeng
    Li, Xirong
    Xu, Chaoxi
    Ji, Shouling
    He, Yuan
    Yang, Gang
    Wang, Xun
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9338 - 9347
  • [2] Semantic Event Retrieval from Surveillance Video Databases
    Chen, Xin
    Zhang, Chengcui
    ISM: 2008 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, 2008, : 625 - 630
  • [3] Exploiting Visual Semantic Reasoning for Video-Text Retrieval
    Feng, Zerun
    Zeng, Zhimin
    Guo, Caili
    Li, Zheng
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1005 - 1011
  • [4] Integrated semantic-syntactic video event modeling for search and retrieval
    Ekin, A
    Tekalp, AM
    Mehrotra, R
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 141 - 144
  • [5] EXPLORING AUDIO SEMANTIC CONCEPTS FOR EVENT-BASED VIDEO RETRIEVAL
    Wang, Yipei
    Rawat, Shourabh
    Metze, Florian
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] Semantic retrieval of video
    Xiong, ZY
    Zhou, XS
    Tian, Q
    Rui, Y
    Huang, TS
    IEEE SIGNAL PROCESSING MAGAZINE, 2006, 23 (02) : 18 - 27
  • [7] The myth of semantic video retrieval
    Dimitrova, N
    ACM COMPUTING SURVEYS, 1995, 27 (04) : 584 - 586
  • [8] On Semantic Similarity in Video Retrieval
    Wray, Michael
    Doughty, Hazel
    Damen, Dima
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3649 - 3659
  • [9] Semantic Video Event Search for Surveillance Video
    Choe, Tae Eun
    Lee, Mun Wai
    Guo, Feng
    Taylor, Geoffrey
    Yu, Li
    Haering, Niels
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [10] Semantic retrieval method based on the fuzzy reasoning
    Cao, Jia-Heng
    Liu, Juan
    Peng, Min
    Shu, Feng-Di
    Wuhan University Journal of Natural Sciences, 2002, 7 (02) : 169 - 173