Temporal Action Co-segmentation in 3D Motion Capture Data and Videos

被引:2
|
作者
Papoutsakis, Konstantinos [1 ,2 ]
Panagiotakis, Costas [1 ,3 ]
Argyros, Antonis A. [1 ,2 ]
机构
[1] FORTH, Inst Comp Sci, Computat Vis & Robot Lab, Iraklion, Greece
[2] Univ Crete, Dept Comp Sci, Rethimnon, Greece
[3] TEI Crete, Business Adm Dept Agios Nikolaos, Iraklion, Greece
基金
欧盟地平线“2020”;
关键词
D O I
10.1109/CVPR.2017.231
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given two action sequences, we are interested in spotting/co-segmenting all pairs of sub-sequences that represent the same action. We propose a totally unsupervised solution to this problem. No a-priori model of the actions is assumed to be available. The number of common sub-sequences may be unknown. The sub-sequences can be located anywhere in the original sequences, may differ in duration and the corresponding actions may be performed by a different person, in different style. We treat this type of temporal action co-segmentation as a stochastic optimization problem that is solved by employing Particle Swarm Optimization (PSO). The objective function that is minimized by PSO capitalizes on Dynamic Time Warping (DTW) to compare two action sub-sequences. Due to the generic problem formulation and solution, the proposed method can be applied to motion capture (i.e., 3D skeletal) data or to conventional RGB videos acquired in the wild. We present extensive quantitative experiments on standard data sets as well as on data sets we introduced in this paper. The obtained results demonstrate that the proposed method achieves a remarkable increase in co-segmentation quality compared to all tested state of the art methods.
引用
收藏
页码:2146 / 2155
页数:10
相关论文
共 50 条
  • [41] Redundancy Reduction in 3D Facial Motion Capture Data for Animation
    Wellein, Daniela I.
    Curio, Cristobal
    Buelthoff, Heinrich H.
    APGV 2007: SYMPOSIUM ON APPLIED PERCEPTION IN GRAPHICS AND VISUALIZATION, PROCEEDINGS, 2007, : 136 - 136
  • [42] Semantic quantization of 3D human motion capture data through spatial-temporal feature extraction
    Jin, Yohan
    Prabhakaran, B.
    ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2008, 4903 : 318 - 328
  • [43] Visual discomfort of stereoscopic 3D videos: Influence of 3D motion
    Li, Jing
    Barkowsky, Marcus
    Le Callet, Patrick
    DISPLAYS, 2014, 35 (01) : 49 - 57
  • [44] PRECISE PLAYER SEGMENTATION IN TEAM SPORTS VIDEOS USING CONTRAST-AWARE CO-SEGMENTATION
    Tsai, Tsung-Yu
    Lin, Yen-Yu
    Liao, Hong-Yuan Mark
    Jeng, Shyh-Kang
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1826 - 1830
  • [45] Motion Segmentation and Scene Classification from 3D LIDAR Data
    Steinhauser, Dominik
    Ruepp, Oliver
    Burschka, Darius
    2008 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2008, : 940 - 945
  • [46] Action Recognition in Videos with Spatio-Temporal Fusion 3D Convolutional Neural Networks
    Wang, Y.
    Shen, X. J.
    Chen, H. P.
    Sun, J. X.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2021, 31 (03) : 580 - 587
  • [47] Action Recognition in Videos with Spatio-Temporal Fusion 3D Convolutional Neural Networks
    Y. Wang
    X. J. Shen
    H. P. Chen
    J. X. Sun
    Pattern Recognition and Image Analysis, 2021, 31 : 580 - 587
  • [48] Efficient Single-View 3D Co-segmentation Using Shape Similarity and Spatial Part Relations
    Araslanov, Nikita
    Koo, Seongyong
    Gall, Juergen
    Behnke, Sven
    PATTERN RECOGNITION, GCPR 2016, 2016, 9796 : 297 - 308
  • [49] 3D PET/CT Tumor Co-Segmentation Based on Background Subtraction Hybrid Active Contour Model
    Li, Laquan
    Jiang, Chuangbo
    Wang, Patrick Shen-Pei
    Zheng, Shenhai
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (08)
  • [50] Temporal Deformable Residual Networks for Action Segmentation in Videos
    Lei, Peng
    Todorovic, Sinisa
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6742 - 6751