Joint Segmentation and Classification of Human Actions in Video

被引:0
|
作者
Minh Hoai [1 ]
Lan, Zhen-Zhong [1 ]
De la Torre, Fernando [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2011年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic video segmentation and action recognition has been a long-standing problem in computer vision. Much work in the literature treats video segmentation and action recognition as two independent problems; while segmentation is often done without a temporal model of the activity, action recognition is usually performed on pre-segmented clips. In this paper we propose a novel method that avoids the limitations of the above approaches by jointly performing video segmentation and action recognition. Unlike standard approaches based on extensions of dynamic Bayesian networks, our method is based on a discriminative temporal extension of the spatial bag-of-words model that has been very popular in object recognition. The classification is performed robustly within a multi-class SVM framework whereas the inference over the segments is done efficiently with dynamic programming. Experimental results on honeybee, Weizmann, and Hollywood datasets illustrate the benefits of our approach compared to state-of-the-art methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Statistical Descriptors for Human Actions Classification
    Syrris, Vassilis
    Petridis, Vassilios
    MED: 2009 17TH MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-3, 2009, : 412 - 415
  • [42] Video Object Segmentation by Hierarchical Localized Classification of Regions
    Zhang, Chenguang
    Ai, Haizhou
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 244 - 248
  • [43] Video analysis for segmentation and classification of players at soccer games
    Gomez C, Angela M.
    Trejos L, Luisa F.
    Osorio A, Estefany
    Calvo S, Andres F.
    Holguin L, Mauricio
    Holguin L, German A.
    2015 10TH COMPUTING COLOMBIAN CONFERENCE (10CCC), 2015, : 331 - 338
  • [44] Temporal segmentation and assignment of successive actions in a long-term video
    Lu, Guoliang
    Kudo, Mineichi
    Toyama, Jun
    PATTERN RECOGNITION LETTERS, 2013, 34 (15) : 1936 - 1944
  • [45] Violence region localization in video and the school violent actions classification
    Ha, Ngo Duong
    Tran, Nhu Y.
    Thuy, Le Nhi Lam
    Shimizu, Ikuko
    Bao, Pham The
    FRONTIERS IN COMPUTER SCIENCE, 2023, 5
  • [46] Deep Learning for Joint Classification and Segmentation of Histopathology Image
    Park, Hyun-Cheol
    Ghimire, Raman
    Poudel, Sahadev
    Lee, Sang-Woong
    JOURNAL OF INTERNET TECHNOLOGY, 2022, 23 (04): : 903 - 910
  • [47] A joint model for lesion segmentation and classification of MS and NMOSD
    Huang, Lan
    Shao, Yangguang
    Yang, Hui
    Guo, Chunjie
    Wang, Yan
    Zhao, Ziqi
    Gong, Yingchun
    FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [48] Joint object segmentation and Behavior classification in image sequences
    Gui, Laura
    Thiran, Jean-Philippe
    Paragios, Nikos
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2024 - +
  • [49] Joint Phoneme Segmentation Inference and Classification using CRFs
    Palaz, Dimitri
    Magimai-Doss, Mathew
    Collobert, Ronan
    2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 587 - 591
  • [50] Joint retina segmentation and classification for early glaucoma diagnosis
    Wang, Jie
    Wang, Zhe
    Li, Fei
    Qu, Guoxiang
    Qiao, Yu
    Lv, Hairong
    Zhang, Xiulan
    BIOMEDICAL OPTICS EXPRESS, 2019, 10 (05) : 2639 - 2656