Robust spatial-temporal deep model for multimedia event detection

被引:6
|
作者
Yu, Litao [1 ]
Sun, Xiaoshuai [1 ]
Huang, Zi [1 ]
机构
[1] Univ Queensland, Brisbane, Qld 4072, Australia
关键词
Multimedia event detection; Deep learning; GRU; NEURAL-NETWORKS;
D O I
10.1016/j.neucom.2016.03.102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The task of multimedia event detection (MED) aims at training a set of models that can automatically detect the most event-relevant videos from large datasets. In this paper, we attempt to build a robust spatial-temporal deep neural network for large-scale video event detection. In our setting, each video follows a multiple instance assumption, where its visual segments contain both spatial and temporal properties of events. Regarding these properties, we try to implement the MED system by a two-step training phase: unsupervised recurrent video reconstruction and supervised fine-tuning. We conduct extensive experiments on the challenging TRECVID MED14 dataset, which indicate that with the consideration of both spatial and temporal information, the detection performance can be further boosted compared with the state-of-the-art MED models. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [1] MULTIMEDIA EVENT DETECTION VIA DEEP SPATIAL-TEMPORAL NEURAL NETWORKS
    Hou, Jingyi
    Wu, Xinxiao
    Yu, Feiwu
    Jia, Yunde
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [2] Spatial-temporal event prediction: A new model
    Liu, H
    Brown, DE
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 2933 - 2937
  • [3] Deep spatial-temporal networks for flame detection
    Shahid, Mohammad
    Chien, I-Feng
    Sarapugdi, Wannaporn
    Miao, Lili
    Hua, Kai-Lung
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (28-29) : 35297 - 35318
  • [4] Deep spatial-temporal networks for flame detection
    Mohammad Shahid
    I-Feng Chien
    Wannaporn Sarapugdi
    Lili Miao
    Kai-Lung Hua
    [J]. Multimedia Tools and Applications, 2021, 80 : 35297 - 35318
  • [5] Spatial-Temporal Event Correlation
    Buford, John F.
    Wu, Xiaotao
    Krishnaswamy, Venkatesh
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-8, 2009, : 2617 - 2622
  • [6] A robust spatial-temporal prediction model for photovoltaic power generation based on deep learning
    Wang, Zun
    Wang, Yashun
    Cao, Shenglei
    Fan, Siyuan
    Zhang, Yanhui
    Liu, Yuning
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2023, 110
  • [7] Early Event Detection Based on Multivariate Spatial-Temporal Fusion
    Liu, Xuying
    Lu, Wenda
    Wang, Jian
    Wang, Xue
    Wang, Qing
    [J]. Computer Engineering and Applications, 2023, 59 (17): : 116 - 122
  • [8] Trajectory-Pooled Spatial-Temporal Architecture of Deep Convolutional Neural Networks for Video Event Detection
    Li, Yonggang
    Ge, Rui
    Ji, Yi
    Gong, Shengrong
    Liu, Chunping
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (09) : 2683 - 2692
  • [9] Deep spatial-temporal structure learning for rumor detection on Twitter
    Huang, Qi
    Zhou, Chuan
    Wu, Jia
    Liu, Luchen
    Wang, Bin
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (18): : 12995 - 13005
  • [10] A hybrid spatial-temporal deep learning architecture for lane detection
    Dong, Yongqi
    Patil, Sandeep
    van Arem, Bart
    Farah, Haneen
    [J]. COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2023, 38 (01) : 67 - 86