Learning Temporal Regularity in Video Sequences

被引:775
|
作者
Hasan, Mahmudul [1 ]
Choi, Jonghyun [2 ]
Neumann, Jan [2 ]
Roy-Chowdhury, Amit K. [1 ]
Davis, Larry S. [3 ]
机构
[1] UC Riverside, Riverside, CA 92521 USA
[2] Comcast Labs DC, Washington, DC USA
[3] Univ Maryland, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2016.86
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Perceiving meaningful activities in a long video sequence is a challenging problem due to ambiguous definition of 'meaningfulness' as well as clutters in the scene. We approach this problem by learning a generative model for regular motion patterns (termed as regularity) using multiple sources with very limited supervision. Specifically, we propose two methods that are built upon the autoencoders for their ability to work with little to no supervision. We first leverage the conventional handcrafted spatio-temporal local features and learn a fully connected autoencoder on them. Second, we build a fully convolutional feed-forward autoencoder to learn both the local features and the classifiers as an end-to-end learning framework. Our model can capture the regularities from multiple datasets. We evaluate our methods in both qualitative and quantitative ways-showing the learned regularity of videos in various aspects and demonstrating competitive performance on anomaly detection datasets as an application.
引用
收藏
页码:733 / 742
页数:10
相关论文
共 50 条
  • [1] Temporal registration of video sequences
    Cheng, H
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 489 - 492
  • [2] On temporal span scalability of video sequences
    Tang, S
    Bigdeli, A
    Porat, M
    Salcic, Z
    [J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 881 - 884
  • [3] DETECTION OF TEMPORAL INTERPOLATION IN VIDEO SEQUENCES
    Bestagini, P.
    Battaglia, S.
    Milani, S.
    Tagliasacchi, M.
    Tubaro, S.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3033 - 3037
  • [4] Learning Temporal Features for Detection on Maritime Airborne Video Sequences Using Convolutional LSTM
    Cruz, Goncalo
    Bernardino, Alexandre
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (09): : 6565 - 6576
  • [5] Finding the optimal temporal partitioning of video sequences
    Truong, BT
    Venkatesh, S
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 1183 - 1186
  • [6] Temporal Segmentation of Facial Expressions in Video Sequences
    Xue, Yu
    Mei, Xue
    Bian, Jiali
    Wu, Liang
    Ding, Yao
    [J]. PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 10789 - 10794
  • [7] Temporal Segmentation of Human Actions in Video Sequences
    Maria Carmona, Josep
    Climent, Joan
    [J]. PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 786 - 790
  • [8] Computational approaches to temporal sampling of video sequences
    Liu, Tiecheng
    Kender, John R.
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2007, 3 (02)
  • [9] Temporal alignment of video sequences for watermarking systems
    Delannay, D
    de Roover, C
    Macq, B
    [J]. SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS V, 2003, 5020 : 481 - 492
  • [10] Quality metric for video sequences with temporal scalability
    Feghali, R
    Wang, DM
    Speranza, F
    Vincent, A
    [J]. 2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 3297 - 3300