Recurrent Compressed Convolutional Networks for Short Video Event Detection

被引:4
|
作者
Li, Ping [1 ]
Xu, Xianghua [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金;
关键词
Compressed domain; event analysis; recurrent neural networks; short video event detection; temporal dependency; ACTION RECOGNITION;
D O I
10.1109/ACCESS.2020.3003939
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Short videos are popular information carriers on the Internet, and detecting events from them can well benefit widespread applications, e.g., video browsing, management, retrieval and recommendation. Existing video analysis methods always require decoding all frames of videos in advance, which is very costly in time and computation power. These short videos are often untrimmed, noisy and even incomplete, adding much difficulty to event analysis. Unlike previous works focusing on actions, we target short video event detection and propose Recurrent Compressed Convolutional Networks (RCCN) for discovering the underlying event patterns within short videos possibly including a large proportion of non-event videos. Instead of using the whole videos, RCCN performs representation learning at much lower cost within the compressed domain where the encoded motion information reflecting the spatial relations among frames can be easily obtained to capture dynamic tendency of event videos. This alleviates the information incompleteness problem that frequently emerges in user-generated short videos. In particular, RCCN leverages convolutional networks as the backbone and the Long Short-Term Memory components to model the variable-range temporal dependency among untrimmed video frames. RCCN not only learns the common representation shared by the short videos of the same event, but also obtains the discriminative ability to detect dissimilar videos. We benchmark the model performance on a set of short videos generated from publicly available event detection database YLIMED, and compare RCCN with several baselines and state-of-the-art alternatives. Empirical studies have verified the preferable performance of RCCN.
引用
收藏
页码:114162 / 114171
页数:10
相关论文
共 50 条
  • [1] Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection
    Cakir, Emre
    Parascandolo, Giambattista
    Heittola, Toni
    Huttunen, Heikki
    Virtanen, Tuomas
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1291 - 1303
  • [2] SOUND EVENT DETECTION VIA DILATED CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Li, Yanxiong
    Liu, Mingle
    Drossos, Konstantinos
    Virtanen, Tuomas
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 286 - 290
  • [3] Smoke Detection on Video Sequences Using Convolutional and Recurrent Neural Networks
    Filonenko, Alexander
    Kurnianggoro, Laksono
    Jo, Kang-Hyun
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2017, PT II, 2017, 10449 : 558 - 566
  • [4] CONVOLUTIONAL GATED RECURRENT NETWORKS FOR VIDEO SEGMENTATION
    Siam, Mennatullah
    Valipour, Sepehr
    Jagersand, Martin
    Ray, Nilanjan
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3090 - 3094
  • [5] Recurrent Fully Convolutional Networks for Video Segmentation
    Valipour, Sepehr
    Siam, Mennatullah
    Jagersand, Martin
    Ray, Nilanjan
    [J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 29 - 36
  • [6] Event detection from MPEG video in the compressed domain
    Yoon, K
    DeMenthon, D
    Doermann, D
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 819 - 822
  • [7] Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks
    Adavanne, Sharath
    Politis, Archontis
    Nikunen, Joonas
    Virtanen, Tuomas
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 34 - 48
  • [8] Deep-fake video detection approaches using convolutional - recurrent neural networks
    Suratkar, Shraddha
    Bhiungade, Sayali
    Pitale, Jui
    Soni, Komal
    Badgujar, Tushar
    Kazi, Faruk
    [J]. JOURNAL OF CONTROL AND DECISION, 2023, 10 (02) : 198 - 214
  • [9] Video Anomaly Detection Based on Convolutional Recurrent AutoEncoder
    Wang, Bokun
    Yang, Caiqian
    [J]. SENSORS, 2022, 22 (12)
  • [10] Video Object Detection Using Event-Aware Convolutional Lstm and Object Relation Networks
    Zhang, Chen
    Xia, Zhengyu
    Kim, Joohee
    [J]. ELECTRONICS, 2021, 10 (16)