Semantic Decomposition and Recognition of Long and Complex Manipulation Action Sequences

被引:0
|
作者
Eren Erdal Aksoy
Adil Orhan
Florentin Wörgötter
机构
[1] Karlsruhe Institute of Technology,Institute for Anthropomatics and Robotics, High Performance Humanoid Technologies (H²T)
[2] Georg-August-Universität Göttingen,undefined
[3] BCCN,undefined
来源
关键词
Semantic decomposition; Temporal segmentation; Action recognition; Manipulation action; Semantic event chain;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding continuous human actions is a non-trivial but important problem in computer vision. Although there exists a large corpus of work in the recognition of action sequences, most approaches suffer from problems relating to vast variations in motions, action combinations, and scene contexts. In this paper, we introduce a novel method for semantic segmentation and recognition of long and complex manipulation action tasks, such as “preparing a breakfast” or “making a sandwich”. We represent manipulations with our recently introduced “Semantic Event Chain” (SEC) concept, which captures the underlying spatiotemporal structure of an action invariant to motion, velocity, and scene context. Solely based on the spatiotemporal interactions between manipulated objects and hands in the extracted SEC, the framework automatically parses individual manipulation streams performed either sequentially or concurrently. Using event chains, our method further extracts basic primitive elements of each parsed manipulation. Without requiring any prior object knowledge, the proposed framework can also extract object-like scene entities that exhibit the same role in semantically similar manipulations. We conduct extensive experiments on various recent datasets to validate the robustness of the framework.
引用
收藏
页码:84 / 115
页数:31
相关论文
共 50 条
  • [1] Semantic Decomposition and Recognition of Long and Complex Manipulation Action Sequences
    Aksoy, Eren Erdal
    Orhan, Adil
    Woergoetter, Florentin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 122 (01) : 84 - 115
  • [2] Segmentation and Recognition Model for Complex Action Sequences
    Geng, Hongyang
    Huan, Zhan
    Liang, Jiuzhen
    Hou, Zhenjie
    Lv, Shiyun
    Wang, Yunliang
    IEEE SENSORS JOURNAL, 2022, 22 (05) : 4347 - 4358
  • [3] SEMANTIC ACTIVATION AND LEXICAL DECOMPOSITION IN THE RECOGNITION OF COMPLEX KANJI CHARACTERS
    DARCAIS, GBF
    SAITO, HF
    COGNITION IN INDIVIDUAL AND SOCIAL CONTEXTS, 1989, : 101 - 109
  • [4] Semantic Component Decomposition for Face Attribute Manipulation
    Chen, Ying-Cong
    Shen, Xiaohui
    Lin, Zhe
    Lu, Xin
    Pao, I-Ming
    Jia, Jiaya
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9851 - 9859
  • [5] Memory for action sequences in semantic dementia
    Adlam, Anna-Lynne R.
    de Haan, Michelle
    Hodges, John R.
    Patterson, Karalyn
    NEUROPSYCHOLOGIA, 2013, 51 (08) : 1481 - 1487
  • [6] STSD: spatial–temporal semantic decomposition transformer for skeleton-based action recognition
    Hu Cui
    Tessai Hayama
    Multimedia Systems, 2024, 30
  • [7] STSD: spatial-temporal semantic decomposition transformer for skeleton-based action recognition
    Cui, Hu
    Hayama, Tessai
    MULTIMEDIA SYSTEMS, 2024, 30 (01)
  • [8] Semantic Pyramids for Gender and Action Recognition
    Khan, Fahad Shahbaz
    van de Weijer, Joost
    Anwer, Rao Muhammad
    Felsberg, Michael
    Gatta, Carlo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (08) : 3633 - 3645
  • [9] Weakly Semantic Guided Action Recognition
    Yu, Tingzhao
    Wang, Lingfeng
    Da, Cheng
    Gu, Huxiang
    Xiang, Shiming
    Pan, Chunhong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (10) : 2504 - 2517
  • [10] Complex Question Decomposition for Semantic Parsing
    Zhang, Haoyu
    Cai, Jingjing
    Xu, Jianjun
    Wang, Ji
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4477 - 4486