Modeling 4D Human-Object Interactions for Event and Object Recognition

被引:66
|
作者
Wei, Ping [1 ,2 ]
Zhao, Yibiao [2 ]
Zheng, Nanning [1 ]
Zhu, Song-Chun [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Univ Calif Los Angeles, Los Angeles, CA USA
关键词
D O I
10.1109/ICCV.2013.406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing the events and objects in the video sequence are two challenging tasks due to the complex temporal structures and the large appearance variations. In this paper, we propose a 4D human-object interaction model, where the two tasks jointly boost each other. Our human-object interaction is defined in 4D space: i) the co-occurrence and geometric constraints of human pose and object in 3D space; ii) the sub-events transition and objects coherence in 1D temporal dimension. We represent the structure of events, sub-events and objects in a hierarchical graph. For an input RGB-depth video, we design a dynamic programming beam search algorithm to: i) segment the video, ii) recognize the events, and iii) detect the objects simultaneously. For evaluation, we built a large-scale multiview 3D event dataset which contains 3815 video sequences and 383,036 RGBD frames captured by the Kinect cameras. The experiment results on this dataset show the effectiveness of our method.
引用
收藏
页码:3272 / 3279
页数:8
相关论文
共 50 条
  • [21] Human-Object Maps for Daily Activity Recognition
    Ishikawa, Haruya
    Ishikawa, Yuchi
    Akizuki, Shuichi
    Aoki, Yoshimitsu
    PROCEEDINGS OF MVA 2019 16TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2019,
  • [22] Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities
    Yao, Bangpeng
    Li Fei-Fei
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 17 - 24
  • [23] Learning to Detect Human-Object Interactions with Knowledge
    Xu, Bingjie
    Wong, Yongkang
    Li, Junnan
    Zhao, Qi
    Kankanhalli, Mohan S.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2019 - 2028
  • [24] Language for Learning Complex Human-Object Interactions
    Patel, Mitesh
    Ek, Carl Henrik
    Kyriazis, Nikolaos
    Argyros, Antonis
    Miro, Jaime Valls
    Kragic, Danica
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4997 - 5002
  • [25] Human-Object Interaction Recognition by Learning the distances between the Object and the Skeleton Joints
    Meng, Meng
    Drira, Hassen
    Daoudi, Mohamed
    Boonaert, Jacques
    2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 7, 2015,
  • [26] Action Prediction Based on Physically Grounded Object Affordances in Human-Object Interactions
    Dutta, Vibekananda
    Zielinska, Teresa
    2017 11TH INTERNATIONAL WORKSHOP ON ROBOT MOTION AND CONTROL (ROMOCO), 2017, : 41 - 46
  • [27] NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions
    Zhang, Juze
    Luo, Haimin
    Yang, Hongdi
    Xu, Xinru
    Wu, Qianyang
    Shi, Ye
    Yu, Jingyi
    Xu, Lan
    Wang, Jingya
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8834 - 8845
  • [28] Recognizing Human-Object Interactions via Target Localization
    Cho, Sunyoung
    Park, Jihun
    Shin, Young Sook
    Lee, Sang-ho
    2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 836 - 840
  • [29] Distillation of human-object interaction contexts for action recognition
    Almushyti, Muna
    Li, Frederick W. B.
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (05)
  • [30] Predicting the Location of "interactees" in Novel Human-Object Interactions
    Chen, Chao-Yeh
    Grauman, Kristen
    COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 351 - 367