Semantic Decomposition and Recognition of Long and Complex Manipulation Action Sequences

被引:0
|
作者
Eren Erdal Aksoy
Adil Orhan
Florentin Wörgötter
机构
[1] Karlsruhe Institute of Technology,Institute for Anthropomatics and Robotics, High Performance Humanoid Technologies (H²T)
[2] Georg-August-Universität Göttingen,undefined
[3] BCCN,undefined
来源
关键词
Semantic decomposition; Temporal segmentation; Action recognition; Manipulation action; Semantic event chain;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding continuous human actions is a non-trivial but important problem in computer vision. Although there exists a large corpus of work in the recognition of action sequences, most approaches suffer from problems relating to vast variations in motions, action combinations, and scene contexts. In this paper, we introduce a novel method for semantic segmentation and recognition of long and complex manipulation action tasks, such as “preparing a breakfast” or “making a sandwich”. We represent manipulations with our recently introduced “Semantic Event Chain” (SEC) concept, which captures the underlying spatiotemporal structure of an action invariant to motion, velocity, and scene context. Solely based on the spatiotemporal interactions between manipulated objects and hands in the extracted SEC, the framework automatically parses individual manipulation streams performed either sequentially or concurrently. Using event chains, our method further extracts basic primitive elements of each parsed manipulation. Without requiring any prior object knowledge, the proposed framework can also extract object-like scene entities that exhibit the same role in semantically similar manipulations. We conduct extensive experiments on various recent datasets to validate the robustness of the framework.
引用
收藏
页码:84 / 115
页数:31
相关论文
共 50 条
  • [21] Averaging Video Sequences to Improve Action Recognition
    Gao, Zhen
    Lu, Guoliang
    Yan, Peng
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 89 - 93
  • [22] Curvature: A signature for Action Recognition in Video Sequences
    Chen, He
    Chirikjian, Gregory S.
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3743 - 3750
  • [23] Action recognition by using kernels on aclets sequences
    Brun, Luc
    Percannella, Gennaro
    Saggese, Alessia
    Vento, Mario
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 144 : 3 - 13
  • [24] Action-Conditioned Generation of Bimanual Object Manipulation Sequences
    Razali, Haziq
    Demiris, Yiannis
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2146 - 2154
  • [25] DECOMPOSITION OF COMPLEX SHAPES FOR THEIR STRUCTURAL RECOGNITION
    JAKUBOWSKI, R
    INFORMATION SCIENCES, 1990, 50 (01) : 35 - 71
  • [26] A Variational Graph Autoencoder for Manipulation Action Recognition and Prediction
    Akyol, Gamze
    Sariel, Sanem
    Aksoy, Eren Erdal
    2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 968 - 973
  • [27] Recognition by action: Dissociating visual and semantic routes to action in normal observers
    Rumiati, RI
    Humphreys, GW
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1998, 24 (02) : 631 - 647
  • [28] Action recognition based on dynamic mode decomposition
    Dong, Shuai
    Zhang, Weixi
    Wang, Wei
    Zou, Kun
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (6) : 7159 - 7172
  • [29] Action recognition based on dynamic mode decomposition
    Shuai Dong
    Weixi Zhang
    Wei Wang
    Kun Zou
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 7159 - 7172
  • [30] SEMANTIC SEGMENTATION AND RECOGNITION OF TEMPORAL PATTERNS IN URBAN SAR SEQUENCES
    Che, M.
    Vizziello, A.
    Gamba, P.
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 6878 - 6881