Joint Segmentation and Classification of Human Actions in Video

被引：0

作者：

Minh Hoai ^{[1
]}

Lan, Zhen-Zhong ^{[1
]}

De la Torre, Fernando ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2011年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic video segmentation and action recognition has been a long-standing problem in computer vision. Much work in the literature treats video segmentation and action recognition as two independent problems; while segmentation is often done without a temporal model of the activity, action recognition is usually performed on pre-segmented clips. In this paper we propose a novel method that avoids the limitations of the above approaches by jointly performing video segmentation and action recognition. Unlike standard approaches based on extensions of dynamic Bayesian networks, our method is based on a discriminative temporal extension of the spatial bag-of-words model that has been very popular in object recognition. The classification is performed robustly within a multi-class SVM framework whereas the inference over the segments is done efficiently with dynamic programming. Experimental results on honeybee, Weizmann, and Hollywood datasets illustrate the benefits of our approach compared to state-of-the-art methods.

引用

页数：8

共 50 条

[31] Motion and Appearance Nonparametric Joint Entropy for Video Segmentation
Sylvain Boltz
Ariane Herbulot
Eric Debreuve
Michel Barlaud
Gilles Aubert
International Journal of Computer Vision, 2008, 80 : 242 - 259
[32] Motion and appearance nonparametric joint entropy for video segmentation
Boltz, Sylvain
Herbulot, Ariane
Debreuve, Eric
Barlaud, Michel
Aubert, Gilles
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 80 (02) : 242 - 259
[33] Joint Rendering and Segmentation of Free-Viewpoint Video
Masato Ishii
Keita Takahashi
Takeshi Naemura
EURASIP Journal on Image and Video Processing, 2010
[34] Joint Inductive and Transductive Learning for Video Object Segmentation
Mao, Yunyao
Wang, Ning
Zhou, Wengang
Li, Houqiang
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9650 - 9659
[35] Joint Rendering and Segmentation of Free-Viewpoint Video
Ishii, Masato
Takahashi, Keita
Naemura, Takeshi
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2010,
[36] Joint Attention Mechanism for Unsupervised Video Object Segmentation
Yao, Rui
Xu, Xin
Zhou, Yong
Zhao, Jiaqi
Fang, Liang
PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 154 - 165
[37] Statistical framework for shot segmentation and classification in sports video
Yang, Ying
Lin, Shouxun
Zhang, Yongdong
Tang, Sheng
COMPUTER VISION - ACCV 2007, PT II, PROCEEDINGS, 2007, 4844 : 106 - 115
[38] Video object segmentation and tracking using ψ-learning classification
Liu, Y
Zheng, YF
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (07) : 885 - 899
[39] SVM-based video scene classification and segmentation
Zhu, Yingying
Ming, Zhong
MUE: 2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2008, : 407 - 412
[40] Segmentation, classification and watermarking for image/video semantic authentication
Lin, CY
Tseng, BL
PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 359 - 362

← 1 2 3 4 5 →