Detecting human-object interactions in videos by modeling the trajectory of objects and human skeleton

被引:2
|
作者
Li, Qiyue [1 ]
Xie, Xuemei [1 ]
Zhang, Chen [1 ]
Zhang, Jin [1 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence Engn, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Human -Object Interaction; Human skeleton; Object trajectory; Graph convolutional networks;
D O I
10.1016/j.neucom.2022.08.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article focuses on the task of detecting human-object interactions (HOI) in videos, with the goal of identifying objects interacting with humans and predicting human-object interaction classes. Two frame-works are proposed which detect human-object interactions in videos by modeling the trajectory of objects and human skeleton. The first framework (knowledge-based spatial-temporal HOI) treats the entire scene to be a HOI graph made up of the human skeleton and objects. It has fewer parameters and a higher possibility for knowledge embedding. The second framework (hierarchical spatial-temporal HOI) constructs a HOI graph after obtaining the feature of the human skeleton and objects. It outperforms the competition in terms of performance and generalization. Experimental results in CAD-120 dataset and SYSU-HOI dataset show that the proposed frameworks are more advanced than the state-of-the-art methods, with smaller parameters and shorter inference time. Such results confirm that the proposed frameworks effectively reduce parameters and inference time while maintaining detection accuracy in HOI videos.(c) 2022 Published by Elsevier B.V.
引用
收藏
页码:234 / 243
页数:10
相关论文
共 50 条
  • [1] Explicit Modeling of Human-Object Interactions in Realistic Videos
    Prest, Alessandro
    Ferrari, Vittorio
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (04) : 835 - 848
  • [2] Detecting Human-Object Relationships in Videos
    Ji, Jingwei
    Desai, Rishi
    Niebles, Juan Carlos
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8086 - 8096
  • [3] Detecting and Recognizing Human-Object Interactions
    Gkioxari, Georgia
    Girshick, Ross
    Dollar, Piotr
    He, Kaiming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8359 - 8367
  • [4] Predicting Human-Object Interactions in Egocentric Videos
    Benavent-Lledo, Manuel
    Oprea, Sergiu
    Alejandro Castro-Vargas, John
    Mulero-Perez, David
    Garcia-Rodriguez, Jose
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [5] Skew-Robust Human-Object Interactions in Videos
    Agarwal, Apoorva
    Dabral, Rishabh
    Jain, Arjun
    Ramakrishnan, Ganesh
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5087 - 5096
  • [6] Detecting Subtle Human-Object Interactions Using Kinect
    Ubalde, Sebastian
    Liu, Zicheng
    Mejail, Marta
    [J]. PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 770 - 777
  • [7] Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses
    Yao, Bangpeng
    Fei-Fei, Li
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (09) : 1691 - 1703
  • [8] Detecting Human-Object Interactions via Functional Generalization
    Bansal, Ankan
    Rambhatla, Sai Saketh
    Shrivastava, Abhinav
    Chellappa, Rama
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10460 - 10469
  • [9] Spatially Conditioned Graphs for Detecting Human-Object Interactions
    Zhang, Frederic Z.
    Campbell, Dylan
    Gould, Stephen
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13299 - 13307
  • [10] Spatially Conditioned Graphs for Detecting Human-Object Interactions
    Zhang, Frederic Z.
    Campbell, Dylan
    Gould, Stephen
    [J]. Proceedings of the IEEE International Conference on Computer Vision, 2021, : 13299 - 13307