Semantic Decomposition and Recognition of Long and Complex Manipulation Action Sequences

被引:0
|
作者
Eren Erdal Aksoy
Adil Orhan
Florentin Wörgötter
机构
[1] Karlsruhe Institute of Technology,Institute for Anthropomatics and Robotics, High Performance Humanoid Technologies (H²T)
[2] Georg-August-Universität Göttingen,undefined
[3] BCCN,undefined
来源
关键词
Semantic decomposition; Temporal segmentation; Action recognition; Manipulation action; Semantic event chain;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding continuous human actions is a non-trivial but important problem in computer vision. Although there exists a large corpus of work in the recognition of action sequences, most approaches suffer from problems relating to vast variations in motions, action combinations, and scene contexts. In this paper, we introduce a novel method for semantic segmentation and recognition of long and complex manipulation action tasks, such as “preparing a breakfast” or “making a sandwich”. We represent manipulations with our recently introduced “Semantic Event Chain” (SEC) concept, which captures the underlying spatiotemporal structure of an action invariant to motion, velocity, and scene context. Solely based on the spatiotemporal interactions between manipulated objects and hands in the extracted SEC, the framework automatically parses individual manipulation streams performed either sequentially or concurrently. Using event chains, our method further extracts basic primitive elements of each parsed manipulation. Without requiring any prior object knowledge, the proposed framework can also extract object-like scene entities that exhibit the same role in semantically similar manipulations. We conduct extensive experiments on various recent datasets to validate the robustness of the framework.
引用
收藏
页码:84 / 115
页数:31
相关论文
共 50 条
  • [31] Recognition and prediction of manipulation actions using Enriched Semantic Event Chains
    Ziaeetabar, Fatemeh
    Kulvicius, Tomas
    Tamosiunaite, Minija
    Woergoetter, Florentin
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 110 : 173 - 188
  • [32] Human Action Recognition with Extremities as Semantic Posture Representation
    Yu, Elden
    Aggarwal, J. K.
    2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 457 - 464
  • [33] Action Recognition Based on Learnt Motion Semantic Vocabulary
    Zhao, Qiong
    Lu, Zhiwu
    Ip, Horace H. S.
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 193 - 202
  • [34] Semantic Analysis in Human Action Recognition: A Comprehensive Study
    Zhang, Zhong
    Liu, Shuang
    Liu, Shuaiqi
    Han, Liang
    Shao, Yunxue
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, 2015, 322 : 573 - 580
  • [35] Heterogeneous Semantic Level Features Fusion for Action Recognition
    Cai, Junjie
    Merler, Michele
    Pankanti, Sharath
    Tian, Qi
    ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 307 - 314
  • [36] Learning Semantic Graph with Bayesian Networks for Action Recognition
    Zhang, Runjie
    Zhu, Ziqi
    2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021), 2021, : 144 - 148
  • [37] Ear Recognition by Major Axis and Complex Vector Manipulation
    Su, Ching-Liang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (03): : 1650 - 1669
  • [38] Self-Attention-Masking Semantic Decomposition and Segmentation for Facial Attribute Manipulation
    Xia, Xuan
    Yu, Fengqi
    Li, Nan
    Qu, Yansong
    Zhang, Jiajia
    Zhu, Chengguang
    IEEE ACCESS, 2020, 8 : 36154 - 36165
  • [39] Recognition of manipulation sequences by human hand based on support vector machine
    Matsuo, Kazuya
    Murakami, Kouji
    Hasegawa, Tsutomu
    Kurazurne, Ryo
    IECON 2007: 33RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-3, CONFERENCE PROCEEDINGS, 2007, : 2801 - 2806
  • [40] Multi sentence description of complex manipulation action videos
    Ziaeetabar, Fatemeh
    Safabakhsh, Reza
    Momtazi, Saeedeh
    Tamosiunaite, Minija
    Woergoetter, Florentin
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)