Joint Segmentation and Classification of Human Actions in Video

被引:0
|
作者
Minh Hoai [1 ]
Lan, Zhen-Zhong [1 ]
De la Torre, Fernando [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2011年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic video segmentation and action recognition has been a long-standing problem in computer vision. Much work in the literature treats video segmentation and action recognition as two independent problems; while segmentation is often done without a temporal model of the activity, action recognition is usually performed on pre-segmented clips. In this paper we propose a novel method that avoids the limitations of the above approaches by jointly performing video segmentation and action recognition. Unlike standard approaches based on extensions of dynamic Bayesian networks, our method is based on a discriminative temporal extension of the spatial bag-of-words model that has been very popular in object recognition. The classification is performed robustly within a multi-class SVM framework whereas the inference over the segments is done efficiently with dynamic programming. Experimental results on honeybee, Weizmann, and Hollywood datasets illustrate the benefits of our approach compared to state-of-the-art methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Motion and Appearance Nonparametric Joint Entropy for Video Segmentation
    Sylvain Boltz
    Ariane Herbulot
    Eric Debreuve
    Michel Barlaud
    Gilles Aubert
    International Journal of Computer Vision, 2008, 80 : 242 - 259
  • [32] Motion and appearance nonparametric joint entropy for video segmentation
    Boltz, Sylvain
    Herbulot, Ariane
    Debreuve, Eric
    Barlaud, Michel
    Aubert, Gilles
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 80 (02) : 242 - 259
  • [33] Joint Rendering and Segmentation of Free-Viewpoint Video
    Masato Ishii
    Keita Takahashi
    Takeshi Naemura
    EURASIP Journal on Image and Video Processing, 2010
  • [34] Joint Inductive and Transductive Learning for Video Object Segmentation
    Mao, Yunyao
    Wang, Ning
    Zhou, Wengang
    Li, Houqiang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9650 - 9659
  • [35] Joint Rendering and Segmentation of Free-Viewpoint Video
    Ishii, Masato
    Takahashi, Keita
    Naemura, Takeshi
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2010,
  • [36] Joint Attention Mechanism for Unsupervised Video Object Segmentation
    Yao, Rui
    Xu, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Fang, Liang
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 154 - 165
  • [37] Statistical framework for shot segmentation and classification in sports video
    Yang, Ying
    Lin, Shouxun
    Zhang, Yongdong
    Tang, Sheng
    COMPUTER VISION - ACCV 2007, PT II, PROCEEDINGS, 2007, 4844 : 106 - 115
  • [38] Video object segmentation and tracking using ψ-learning classification
    Liu, Y
    Zheng, YF
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (07) : 885 - 899
  • [39] SVM-based video scene classification and segmentation
    Zhu, Yingying
    Ming, Zhong
    MUE: 2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2008, : 407 - 412
  • [40] Segmentation, classification and watermarking for image/video semantic authentication
    Lin, CY
    Tseng, BL
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 359 - 362