Detecting human-object interactions in videos by modeling the trajectory of objects and human skeleton

被引:2
|
作者
Li, Qiyue [1 ]
Xie, Xuemei [1 ]
Zhang, Chen [1 ]
Zhang, Jin [1 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence Engn, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Human -Object Interaction; Human skeleton; Object trajectory; Graph convolutional networks;
D O I
10.1016/j.neucom.2022.08.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article focuses on the task of detecting human-object interactions (HOI) in videos, with the goal of identifying objects interacting with humans and predicting human-object interaction classes. Two frame-works are proposed which detect human-object interactions in videos by modeling the trajectory of objects and human skeleton. The first framework (knowledge-based spatial-temporal HOI) treats the entire scene to be a HOI graph made up of the human skeleton and objects. It has fewer parameters and a higher possibility for knowledge embedding. The second framework (hierarchical spatial-temporal HOI) constructs a HOI graph after obtaining the feature of the human skeleton and objects. It outperforms the competition in terms of performance and generalization. Experimental results in CAD-120 dataset and SYSU-HOI dataset show that the proposed frameworks are more advanced than the state-of-the-art methods, with smaller parameters and shorter inference time. Such results confirm that the proposed frameworks effectively reduce parameters and inference time while maintaining detection accuracy in HOI videos.(c) 2022 Published by Elsevier B.V.
引用
收藏
页码:234 / 243
页数:10
相关论文
共 50 条
  • [41] Exemplar-Based Recognition of Human-Object Interactions
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Lai, Jianhuang
    Gong, Shaogang
    Xiang, Tao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (04) : 647 - 660
  • [42] Modeling 4D Human-Object Interactions for Joint Event Segmentation, Recognition, and Object Localization
    Wei, Ping
    Zhao, Yibiao
    Zheng, Nanning
    Zhu, Song-Chun
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1165 - 1179
  • [43] Human-Object Interaction Recognition Based on Modeling Context
    Shuyang Li
    Wei Liang
    Qun Zhang
    [J]. Journal of Beijing Institute of Technology, 2017, 26 (02) : 215 - 222
  • [44] Articulated 3D Human-Object Interactions From RGB Videos: An Empirical Analysis of Approaches and Challenges
    Haresh, Sanjay
    Sun, Xiaohao
    Jiang, Hanxiao
    Chang, Angel X.
    Savva, Manolis
    [J]. 2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 312 - 321
  • [45] Novel Anomalous Event Detection based on Human-object Interactions
    Colque, Rensso Mora
    Caetano, Carlos
    de Melo, Victor C.
    Chavez, Guillermo Camara
    Schwartz, William Robson
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 293 - 300
  • [46] SSRT: A Sequential Skeleton RGB Transformer to Recognize Fine-Grained Human-Object Interactions and Action Recognition
    Ghimire, Akash
    Kakani, Vijay
    Kim, Hakil
    [J]. IEEE ACCESS, 2023, 11 : 51930 - 51948
  • [47] Turbo Learning Framework for Human-Object Interactions Recognition and Human Pose Estimation
    Feng, Wei
    Liu, Wentao
    Li, Tong
    Peng, Jing
    Qian, Chen
    Hu, Xiaolin
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 898 - 905
  • [48] Action Prediction Based on Physically Grounded Object Affordances in Human-Object Interactions
    Dutta, Vibekananda
    Zielinska, Teresa
    [J]. 2017 11TH INTERNATIONAL WORKSHOP ON ROBOT MOTION AND CONTROL (ROMOCO), 2017, : 41 - 46
  • [49] NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions
    Jiang, Yuheng
    Jiang, Suyi
    Sun, Guoxing
    Su, Zhuo
    Guo, Kaiwen
    Wu, Minye
    Yu, Jingyi
    Xu, Lan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6145 - 6155
  • [50] Learning Human-Object Interactions by Graph Parsing Neural Networks
    Qi, Siyuan
    Wang, Wenguan
    Jia, Baoxiong
    Shen, Jianbing
    Zhu, Song-Chun
    [J]. COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423