LSTM-based multi-label video event detection

被引:28
|
作者
Liu, An-An [1 ]
Shao, Zhuang [1 ]
Wong, Yongkang [2 ]
Li, Junnan [3 ]
Su, Yu-Ting [1 ]
Kankanhalli, Mohan [4 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Natl Univ Singapore, Smart Syst Inst, Singapore, Singapore
[3] Natl Univ Singapore, NUS Grad Sch Integrat Sci & Engn, Singapore, Singapore
[4] Natl Univ Singapore, Sch Comp, Singapore, Singapore
基金
中国国家自然科学基金; 新加坡国家研究基金会;
关键词
Concurrent event detections; Recurrent neural network; HISTOGRAMS; RECOGNITION; FLOW;
D O I
10.1007/s11042-017-5532-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since large-scale surveillance videos always contain complex visual events, how to generate video descriptions effectively and efficiently without human supervision has become mandatory. To address this problem, we propose a novel architecture for jointly recognizing multiple events in a given surveillance video, motivated by the sequence to sequence network. The proposed architecture can predict what happens in a video directly without the preprocessing of object detection and tracking. We evaluate several variants of the proposed architecture with different visual features on a novel dataset perpared by our group. Moreover, we compute a wide range of quantitative metrics to evaluate this architecture. We further compare it to the popular Support Vector Machine-based visual event detection method. The comparison results suggest that the proposal method can outperform the traditional computer vision pipelines for visual event detection.
引用
收藏
页码:677 / 695
页数:19
相关论文
共 50 条
  • [1] LSTM-based multi-label video event detection
    An-An Liu
    Zhuang Shao
    Yongkang Wong
    Junnan Li
    Yu-Ting Su
    Mohan Kankanhalli
    Multimedia Tools and Applications, 2019, 78 : 677 - 695
  • [2] LSTM-based Quick Event Detection in Power Systems
    Wang, Boyu
    Li, Yan
    Yang, Jing
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [3] Complex video event detection via pairwise fusion of trajectory and multi-label hypergraphs
    Chen, Xiao-jun
    Zhan, Yong-zhao
    Ke, Jia
    Chen, Xiao-bo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (22) : 15079 - 15100
  • [4] Complex video event detection via pairwise fusion of trajectory and multi-label hypergraphs
    Xiao-jun Chen
    Yong-zhao Zhan
    Jia Ke
    Xiao-bo Chen
    Multimedia Tools and Applications, 2016, 75 : 15079 - 15100
  • [5] Optimizing Football Formation Analysis via LSTM-Based Event Detection
    Orr, Benjamin
    Pan, Ephraim
    Lee, Dah-Jye
    ELECTRONICS, 2024, 13 (20)
  • [6] LSTM-BASED WHISPER DETECTION
    Raeesy, Zeynab
    Gillespie, Kellen
    Ma, Chengyuan
    Drugman, Thomas
    Gu, Jiacheng
    Maas, Roland
    Rastrow, Ariya
    Hoffmeister, Bjorn
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 139 - 144
  • [7] Time Aggregation Operators for Multi-label Audio Event Detection
    Joshi, Pankaj
    Gautam, Digvijaysingh
    Ramakrishnan, Ganesh
    Jyothi, Preethi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3309 - 3313
  • [8] A transductive multi-label learning approach for video concept detection
    Wang, Jingdong
    Zhao, Yinghai
    Wu, Xiuqing
    Hua, Xian-Sheng
    PATTERN RECOGNITION, 2011, 44 (10-11) : 2274 - 2286
  • [9] Multi-Label Vulnerability Detection of Smart Contracts Based on Bi-LSTM and Attention Mechanism
    Qian, Shenyi
    Ning, Haohan
    He, Yaqiong
    Chen, Mengqi
    ELECTRONICS, 2022, 11 (19)
  • [10] Detecting frog calling activity based on acoustic event detection and multi-label learning
    Xie, Jie
    Michael, Towsey
    Zhang, Jinglan
    Roe, Paul
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 627 - 638