Temporal Action Co-segmentation in 3D Motion Capture Data and Videos

被引：2

作者：

Papoutsakis, Konstantinos ^{[1
,2
]}

Panagiotakis, Costas ^{[1
,3
]}

Argyros, Antonis A. ^{[1
,2
]}

机构：

[1] FORTH, Inst Comp Sci, Computat Vis & Robot Lab, Iraklion, Greece

[2] Univ Crete, Dept Comp Sci, Rethimnon, Greece

[3] TEI Crete, Business Adm Dept Agios Nikolaos, Iraklion, Greece

来源：

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年

基金：

欧盟地平线“2020”;

关键词：

D O I：

10.1109/CVPR.2017.231

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Given two action sequences, we are interested in spotting/co-segmenting all pairs of sub-sequences that represent the same action. We propose a totally unsupervised solution to this problem. No a-priori model of the actions is assumed to be available. The number of common sub-sequences may be unknown. The sub-sequences can be located anywhere in the original sequences, may differ in duration and the corresponding actions may be performed by a different person, in different style. We treat this type of temporal action co-segmentation as a stochastic optimization problem that is solved by employing Particle Swarm Optimization (PSO). The objective function that is minimized by PSO capitalizes on Dynamic Time Warping (DTW) to compare two action sub-sequences. Due to the generic problem formulation and solution, the proposed method can be applied to motion capture (i.e., 3D skeletal) data or to conventional RGB videos acquired in the wild. We present extensive quantitative experiments on standard data sets as well as on data sets we introduced in this paper. The obtained results demonstrate that the proposed method achieves a remarkable increase in co-segmentation quality compared to all tested state of the art methods.

引用

页码：2146 / 2155

页数：10

共 50 条

[41] Redundancy Reduction in 3D Facial Motion Capture Data for Animation
Wellein, Daniela I.
Curio, Cristobal
Buelthoff, Heinrich H.
APGV 2007: SYMPOSIUM ON APPLIED PERCEPTION IN GRAPHICS AND VISUALIZATION, PROCEEDINGS, 2007, : 136 - 136
[42] Semantic quantization of 3D human motion capture data through spatial-temporal feature extraction
Jin, Yohan
Prabhakaran, B.
ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2008, 4903 : 318 - 328
[43] Visual discomfort of stereoscopic 3D videos: Influence of 3D motion
Li, Jing
Barkowsky, Marcus
Le Callet, Patrick
DISPLAYS, 2014, 35 (01) : 49 - 57
[44] PRECISE PLAYER SEGMENTATION IN TEAM SPORTS VIDEOS USING CONTRAST-AWARE CO-SEGMENTATION
Tsai, Tsung-Yu
Lin, Yen-Yu
Liao, Hong-Yuan Mark
Jeng, Shyh-Kang
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1826 - 1830
[45] Motion Segmentation and Scene Classification from 3D LIDAR Data
Steinhauser, Dominik
Ruepp, Oliver
Burschka, Darius
2008 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2008, : 940 - 945
[46] Action Recognition in Videos with Spatio-Temporal Fusion 3D Convolutional Neural Networks
Wang, Y.
Shen, X. J.
Chen, H. P.
Sun, J. X.
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (03) : 580 - 587
[47] Action Recognition in Videos with Spatio-Temporal Fusion 3D Convolutional Neural Networks
Y. Wang
X. J. Shen
H. P. Chen
J. X. Sun
Pattern Recognition and Image Analysis, 2021, 31 : 580 - 587
[48] Efficient Single-View 3D Co-segmentation Using Shape Similarity and Spatial Part Relations
Araslanov, Nikita
Koo, Seongyong
Gall, Juergen
Behnke, Sven
PATTERN RECOGNITION, GCPR 2016, 2016, 9796 : 297 - 308
[49] 3D PET/CT Tumor Co-Segmentation Based on Background Subtraction Hybrid Active Contour Model
Li, Laquan
Jiang, Chuangbo
Wang, Patrick Shen-Pei
Zheng, Shenhai
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (08)
[50] Temporal Deformable Residual Networks for Action Segmentation in Videos
Lei, Peng
Todorovic, Sinisa
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6742 - 6751

← 1 2 3 4 5 →