Modeling 4D Human-Object Interactions for Event and Object Recognition

被引:66
|
作者
Wei, Ping [1 ,2 ]
Zhao, Yibiao [2 ]
Zheng, Nanning [1 ]
Zhu, Song-Chun [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Univ Calif Los Angeles, Los Angeles, CA USA
关键词
D O I
10.1109/ICCV.2013.406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing the events and objects in the video sequence are two challenging tasks due to the complex temporal structures and the large appearance variations. In this paper, we propose a 4D human-object interaction model, where the two tasks jointly boost each other. Our human-object interaction is defined in 4D space: i) the co-occurrence and geometric constraints of human pose and object in 3D space; ii) the sub-events transition and objects coherence in 1D temporal dimension. We represent the structure of events, sub-events and objects in a hierarchical graph. For an input RGB-depth video, we design a dynamic programming beam search algorithm to: i) segment the video, ii) recognize the events, and iii) detect the objects simultaneously. For evaluation, we built a large-scale multiview 3D event dataset which contains 3815 video sequences and 383,036 RGBD frames captured by the Kinect cameras. The experiment results on this dataset show the effectiveness of our method.
引用
收藏
页码:3272 / 3279
页数:8
相关论文
共 50 条
  • [31] An Intelligent Framework for Recognizing Social Human-Object Interactions
    Alarfaj, Mohammed
    Waheed, Manahil
    Ghadi, Yazeed Yasin
    al Shloul, Tamara
    Alsuhibany, Suliman A.
    Jalal, Ahmad
    Park, Jeongmin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1207 - 1223
  • [32] Detecting Subtle Human-Object Interactions Using Kinect
    Ubalde, Sebastian
    Liu, Zicheng
    Mejail, Marta
    PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 770 - 777
  • [33] Detection of Generic Human-Object Interactions in Video Streams
    Bruckschen, Lilli
    Amft, Sabrina
    Tanke, Julian
    Gall, Juergen
    Bennewitz, Maren
    SOCIAL ROBOTICS, ICSR 2019, 2019, 11876 : 108 - 118
  • [34] Skew-Robust Human-Object Interactions in Videos
    Agarwal, Apoorva
    Dabral, Rishabh
    Jain, Arjun
    Ramakrishnan, Ganesh
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5087 - 5096
  • [35] HICO: A Benchmark for Recognizing Human-Object Interactions in Images
    Chao, Yu-Wei
    Wang, Zhan
    He, Yugeng
    Wang, Jiaxuan
    Deng, Jia
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1017 - 1025
  • [36] Spatially Conditioned Graphs for Detecting Human-Object Interactions
    Zhang, Frederic Z.
    Campbell, Dylan
    Gould, Stephen
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13299 - 13307
  • [37] Detecting Human-Object Interactions via Functional Generalization
    Bansal, Ankan
    Rambhatla, Sai Saketh
    Shrivastava, Abhinav
    Chellappa, Rama
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10460 - 10469
  • [38] Spatially Conditioned Graphs for Detecting Human-Object Interactions
    Zhang, Frederic Z.
    Campbell, Dylan
    Gould, Stephen
    Proceedings of the IEEE International Conference on Computer Vision, 2021, : 13299 - 13307
  • [39] Human-Object Interactions Are More than the Sum of Their Parts
    Baldassano, Christopher
    Beck, Diane M.
    Fei-Fei, Li
    CEREBRAL CORTEX, 2017, 27 (03) : 2276 - 2288
  • [40] Acoustic Signature Recognition Technique for Human-Object Interactions (HOI) in Persistent Surveillance Systems
    Alkilani, Amjad
    Shirkhodaie, Amir
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XXII, 2013, 8745