Joint Segmentation and Classification of Human Actions in Video

被引:0
|
作者
Minh Hoai [1 ]
Lan, Zhen-Zhong [1 ]
De la Torre, Fernando [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic video segmentation and action recognition has been a long-standing problem in computer vision. Much work in the literature treats video segmentation and action recognition as two independent problems; while segmentation is often done without a temporal model of the activity, action recognition is usually performed on pre-segmented clips. In this paper we propose a novel method that avoids the limitations of the above approaches by jointly performing video segmentation and action recognition. Unlike standard approaches based on extensions of dynamic Bayesian networks, our method is based on a discriminative temporal extension of the spatial bag-of-words model that has been very popular in object recognition. The classification is performed robustly within a multi-class SVM framework whereas the inference over the segments is done efficiently with dynamic programming. Experimental results on honeybee, Weizmann, and Hollywood datasets illustrate the benefits of our approach compared to state-of-the-art methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Temporal Segmentation of Human Actions in Video Sequences
    Maria Carmona, Josep
    Climent, Joan
    [J]. PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 786 - 790
  • [2] Simultaneous segmentation and classification of human actions in video streams using deeply optimized Hough transform
    Chan-Hon-Tong, Adrien
    Achard, Catherine
    Lucat, Laurent
    [J]. PATTERN RECOGNITION, 2014, 47 (12) : 3807 - 3818
  • [3] Joint Segmentation and Classification of actions using a Conditional Random Field
    Kosmopoulos, Dimitrios
    Maglogiannis, Ilias
    [J]. 8TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2015), 2015,
  • [4] End-to-End Joint Semantic Segmentation of Actors and Actions in Video
    Ji, Jingwei
    Buch, Shyamal
    Soto, Alvaro
    Niebles, Juan Carlos
    [J]. COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 734 - 749
  • [5] A discriminative structural model for joint segmentation and recognition of human actions
    Liu, Cuiwei
    Hou, Jingyi
    Wu, Xinxiao
    Jia, Yunde
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (24) : 31627 - 31645
  • [6] A discriminative structural model for joint segmentation and recognition of human actions
    Cuiwei Liu
    Jingyi Hou
    Xinxiao Wu
    Yunde Jia
    [J]. Multimedia Tools and Applications, 2018, 77 : 31627 - 31645
  • [7] Joint video scene segmentation and classification based on hidden Markov model
    Huang, JC
    Liu, Z
    Wang, Y
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1551 - 1554
  • [8] Video shot segmentation and classification
    Gong, YH
    Liu, X
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 860 - 863
  • [9] Distributed Segmentation and Classification of Human Actions Using a Wearable Motion Sensor Network
    Yang, Allen Y.
    Iyengar, Sameer
    Sastry, Shankar
    Bajcsy, Ruzena
    Kuryloski, Philip
    Jafari, Roozbeh
    [J]. 2008 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, VOLS 1-3, 2008, : 1628 - +
  • [10] JOINT CLASSIFICATION OF ACTIONS WITH MATRIX COMPLETION
    Bomma, Sushma
    Robertson, Neil M.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2766 - 2770