Joint Segmentation and Classification of Human Actions in Video

被引:0
|
作者
Minh Hoai [1 ]
Lan, Zhen-Zhong [1 ]
De la Torre, Fernando [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic video segmentation and action recognition has been a long-standing problem in computer vision. Much work in the literature treats video segmentation and action recognition as two independent problems; while segmentation is often done without a temporal model of the activity, action recognition is usually performed on pre-segmented clips. In this paper we propose a novel method that avoids the limitations of the above approaches by jointly performing video segmentation and action recognition. Unlike standard approaches based on extensions of dynamic Bayesian networks, our method is based on a discriminative temporal extension of the spatial bag-of-words model that has been very popular in object recognition. The classification is performed robustly within a multi-class SVM framework whereas the inference over the segments is done efficiently with dynamic programming. Experimental results on honeybee, Weizmann, and Hollywood datasets illustrate the benefits of our approach compared to state-of-the-art methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Joint Inductive and Transductive Learning for Video Object Segmentation
    Mao, Yunyao
    Wang, Ning
    Zhou, Wengang
    Li, Houqiang
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9650 - 9659
  • [32] Joint Attention Mechanism for Unsupervised Video Object Segmentation
    Yao, Rui
    Xu, Xin
    Zhou, Yong
    Zhao, Jiaqi
    Fang, Liang
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 154 - 165
  • [33] Joint Rendering and Segmentation of Free-Viewpoint Video
    Ishii, Masato
    Takahashi, Keita
    Naemura, Takeshi
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2010,
  • [34] Video object segmentation and tracking using ψ-learning classification
    Liu, Y
    Zheng, YF
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (07) : 885 - 899
  • [35] Statistical framework for shot segmentation and classification in sports video
    Yang, Ying
    Lin, Shouxun
    Zhang, Yongdong
    Tang, Sheng
    [J]. COMPUTER VISION - ACCV 2007, PT II, PROCEEDINGS, 2007, 4844 : 106 - 115
  • [36] Video Object Segmentation by Hierarchical Localized Classification of Regions
    Zhang, Chenguang
    Ai, Haizhou
    [J]. 2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 244 - 248
  • [37] SVM-based video scene classification and segmentation
    Zhu, Yingying
    Ming, Zhong
    [J]. MUE: 2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2008, : 407 - 412
  • [38] Segmentation, classification and watermarking for image/video semantic authentication
    Lin, CY
    Tseng, BL
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 359 - 362
  • [39] Video analysis for segmentation and classification of players at soccer games
    Gomez C, Angela M.
    Trejos L, Luisa F.
    Osorio A, Estefany
    Calvo S, Andres F.
    Holguin L, Mauricio
    Holguin L, German A.
    [J]. 2015 10TH COMPUTING COLOMBIAN CONFERENCE (10CCC), 2015, : 331 - 338
  • [40] Statistical Descriptors for Human Actions Classification
    Syrris, Vassilis
    Petridis, Vassilios
    [J]. MED: 2009 17TH MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-3, 2009, : 412 - 415