Detecting human-object interactions in videos by modeling the trajectory of objects and human skeleton

被引:2
|
作者
Li, Qiyue [1 ]
Xie, Xuemei [1 ]
Zhang, Chen [1 ]
Zhang, Jin [1 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence Engn, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Human -Object Interaction; Human skeleton; Object trajectory; Graph convolutional networks;
D O I
10.1016/j.neucom.2022.08.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article focuses on the task of detecting human-object interactions (HOI) in videos, with the goal of identifying objects interacting with humans and predicting human-object interaction classes. Two frame-works are proposed which detect human-object interactions in videos by modeling the trajectory of objects and human skeleton. The first framework (knowledge-based spatial-temporal HOI) treats the entire scene to be a HOI graph made up of the human skeleton and objects. It has fewer parameters and a higher possibility for knowledge embedding. The second framework (hierarchical spatial-temporal HOI) constructs a HOI graph after obtaining the feature of the human skeleton and objects. It outperforms the competition in terms of performance and generalization. Experimental results in CAD-120 dataset and SYSU-HOI dataset show that the proposed frameworks are more advanced than the state-of-the-art methods, with smaller parameters and shorter inference time. Such results confirm that the proposed frameworks effectively reduce parameters and inference time while maintaining detection accuracy in HOI videos.(c) 2022 Published by Elsevier B.V.
引用
收藏
页码:234 / 243
页数:10
相关论文
共 50 条
  • [31] Human-Object Interaction Recognition by Learning the distances between the Object and the Skeleton Joints
    Meng, Meng
    Drira, Hassen
    Daoudi, Mohamed
    Boonaert, Jacques
    [J]. 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 7, 2015,
  • [32] Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities
    Yao, Bangpeng
    Li Fei-Fei
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 17 - 24
  • [33] NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions
    Zhang, Juze
    Luo, Haimin
    Yang, Hongdi
    Xu, Xinru
    Wu, Qianyang
    Shi, Ye
    Yu, Jingyi
    Xu, Lan
    Wang, Jingya
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8834 - 8845
  • [34] Recognizing Human-Object Interactions via Target Localization
    Cho, Sunyoung
    Park, Jihun
    Shin, Young Sook
    Lee, Sang-ho
    [J]. 2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 836 - 840
  • [35] The MECCANO Dataset: Understanding Human-Object Interactions from Egocentric Videos in an Industrial-like Domain
    Ragusa, Francesco
    Furnari, Antonino
    Livatino, Salvatore
    Farinella, Giovanni Maria
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1568 - 1577
  • [36] An Intelligent Framework for Recognizing Social Human-Object Interactions
    Alarfaj, Mohammed
    Waheed, Manahil
    Ghadi, Yazeed Yasin
    al Shloul, Tamara
    Alsuhibany, Suliman A.
    Jalal, Ahmad
    Park, Jeongmin
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1207 - 1223
  • [37] Predicting the Location of "interactees" in Novel Human-Object Interactions
    Chen, Chao-Yeh
    Grauman, Kristen
    [J]. COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 351 - 367
  • [38] Exemplar-Based Recognition of Human-Object Interactions
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Lai, Jianhuang
    Gong, Shaogang
    Xiang, Tao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (04) : 647 - 660
  • [39] Detection of Generic Human-Object Interactions in Video Streams
    Bruckschen, Lilli
    Amft, Sabrina
    Tanke, Julian
    Gall, Juergen
    Bennewitz, Maren
    [J]. SOCIAL ROBOTICS, ICSR 2019, 2019, 11876 : 108 - 118
  • [40] Detecting Human-Object Interaction via Fabricated Compositional Learning
    Hou, Zhi
    Yu, Baosheng
    Qiao, Yu
    Peng, Xiaojiang
    Tao, Dacheng
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14641 - 14650