HOIsim: Synthesizing Realistic 3D Human-Object Interaction Data for Human Activity Recognition

被引:5
|
作者
Zakour, Marsil [1 ]
Mellouli, Alaeddine [1 ]
Chaudhari, Rahul [1 ]
机构
[1] Tech Univ Munich, Chair Media Technol, Human Assist Ambient Intelligence Grp, Munich, Germany
关键词
D O I
10.1109/RO-MAN50785.2021.9515349
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Correct understanding of human activities is critical for meaningful assistance by robots in daily life. The development of perception algorithms and Deep Learning models of human activity requires large-scale sensor datasets. Good real-world activity data is, however, difficult and time-consuming to acquire. Several precisely calibrated and time-synchronized sensors are required, and the annotation and labeling of the collected sensor data is extremely labor intensive. To address these challenges, we present a 3D activity simulator, "HOIsim", focusing on Human-Object Interactions (HOIs). Using HOIsim, we provide a procedurally generated synthetic dataset of two sample daily life activities "lunch" and "breakfast". The dataset contains out-of-the-box ground truth annotations in the form of human and object poses, as well as ground truth activity labels. Furthermore, we introduce methods to meaningfully randomize activity flows and the environment topology. This allows us to generate a large number of random variants of these activities in very less time. Based on an abstraction of the low-level pose data in the form of spatiotemporal graphs of HOIs, we evaluate the generated Lunch dataset only with two Deep Learning models for activity recognition. The first model, based on recurrent neural networks achieves an accuracy of 87%, whereas the other, based on transformers, achieves an accuracy of 94.7%.
引用
收藏
页码:1124 / 1131
页数:8
相关论文
共 50 条
  • [1] Cascaded Human-Object Interaction Recognition
    Zhou, Tianfei
    Wang, Wenguan
    Qi, Siyuan
    Ling, Haibin
    Shen, Jianbing
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4262 - 4271
  • [2] A new Bayesian modeling for 3D human-object action recognition
    Maurice, Camille
    Madrigal, Francisco
    Monin, Andre
    Lerasle, Frederic
    [J]. 2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
  • [3] HOME: 3D Human-Object Mesh Topology-Enhanced Interaction Recognition in Images
    Peng, Weilong
    Li, Cong
    Tang, Keke
    Liu, Xianyong
    Fang, Meie
    [J]. MATHEMATICS, 2022, 10 (16)
  • [4] An Optimization Model for Human Activity Recognition Inspired by Information on Human-object Interaction
    Liu, Xinhua
    You, Tianyu
    Ma, Xiaolin
    Kuang, Hailan
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA), 2018, : 519 - 523
  • [5] Cascaded Parsing of Human-Object Interaction Recognition
    Zhou, Tianfei
    Qi, Siyuan
    Wang, Wenguan
    Shen, Jianbing
    Zhu, Song-Chun
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (06) : 2827 - 2840
  • [6] Primitive-Based 3D Human-Object Interaction Modelling and Programming
    Liu, Siqi
    Li, Yong-Lu
    Fang, Zhou
    Liu, Xinpeng
    You, Yang
    Lu, Cewu
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 3711 - 3719
  • [7] Human-Object Maps for Daily Activity Recognition
    Ishikawa, Haruya
    Ishikawa, Yuchi
    Akizuki, Shuichi
    Aoki, Yoshimitsu
    [J]. PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,
  • [8] Human-Object Interaction Recognition Based on Modeling Context
    Shuyang Li
    Wei Liang
    Qun Zhang
    [J]. Journal of Beijing Institute of Technology, 2017, 26 (02) : 215 - 222
  • [9] Distillation of human-object interaction contexts for action recognition
    Almushyti, Muna
    Li, Frederick W. B.
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (05)
  • [10] Human-Object Interaction Recognition Based on Modeling Context
    Li, Shuyang
    Liang, Wei
    Zhang, Qun
    [J]. Journal of Beijing Institute of Technology (English Edition), 2017, 26 (02): : 215 - 222