Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning

被引:277
|
作者
Ali, Saad [1 ]
Shah, Mubarak [2 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
[2] Univ Cent Florida, Comp Vis Lab, Sch Elect Engn & Comp Sci, Harris Corp Engn Ctr, Orlando, FL 32816 USA
关键词
Action recognition; motion; video analysis; principal component analysis; kinematic features;
D O I
10.1109/TPAMI.2008.284
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a set of kinematic features that are derived from the optical flow for human action recognition in videos. The set of kinematic features includes divergence, vorticity, symmetric and antisymmetric flow fields, second and third principal invariants of flow gradient and rate of strain tensor, and third principal invariant of rate of rotation tensor. Each kinematic feature, when computed from the optical flow of a sequence of images, gives rise to a spatiotemporal pattern. It is then assumed that the representative dynamics of the optical flow are captured by these spatiotemporal patterns in the form of dominant kinematic trends or kinematic modes. These kinematic modes are computed by performing Principal Component Analysis (PCA) on the spatiotemporal volumes of the kinematic features. For classification, we propose the use of multiple instance learning (MIL) in which each action video is represented by a bag of kinematic modes. Each video is then embedded into a kinematic-mode-based feature space and the coordinates of the video in that space are used for classification using the nearest neighbor algorithm. The qualitative and quantitative results are reported on the benchmark data sets.
引用
收藏
页码:288 / 303
页数:16
相关论文
共 50 条
  • [1] A Novel Dictionary Learning based Multiple Instance Learning Approach to Action Recognition from Videos
    Roy, Abhinaba
    Banerjee, Biplab
    Murino, Vittorio
    ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 519 - 526
  • [2] Human Action Recognition in Videos Using Hybrid Motion Features
    Liu, Si
    Liu, Jing
    Zhang, Tianzhu
    Lu, Hanqing
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 411 - 421
  • [3] Localized Multiple Kernel Learning for Realistic Human Action Recognition in Videos
    Song, Yan
    Zheng, Yan-Tao
    Tang, Sheng
    Zhou, Xiangdong
    Zhang, Yongdong
    Lin, Shouxun
    Chua, Tat-Seng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (09) : 1193 - 1202
  • [4] Using Deep Multiple Instance Learning for Action Recognition in Still Images
    Bas, Cagdas
    Zalluhoglu, Cemil
    Ikizler-Cinbis, Nazli
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [5] Human action recognition with graph-based multiple-instance learning
    Yi, Yang
    Lin, Maoqing
    PATTERN RECOGNITION, 2016, 53 : 148 - 162
  • [6] Kinematic Features For Human Action Recognition Using Restricted Boltzmann Machines
    Arinaldi, Ahmad
    Fanany, Mohamad Ivan
    2016 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2016,
  • [7] MULTIPLE INSTANCE DISCRIMINATIVE DICTIONARY LEARNING FOR ACTION RECOGNITION
    Li, Hongyang
    Chen, Jun
    Xu, Zengmin
    Chen, Huafeng
    Hu, Ruimin
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 2014 - 2018
  • [8] Learning correlations for human action recognition in videos
    Yi, Yun
    Wang, Hanli
    Zhang, Bowen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (18) : 18891 - 18913
  • [9] Learning correlations for human action recognition in videos
    Yun Yi
    Hanli Wang
    Bowen Zhang
    Multimedia Tools and Applications, 2017, 76 : 18891 - 18913
  • [10] Human Action Recognition in Unconstrained Videos Using Deep Learning Techniques
    Priya, G. G. Lakshmi
    Jain, Mrinal
    Perumal, R. Srinivasa
    Mouli, P. V. S. S. R. Chandra
    INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 737 - 744