Modeling 4D Human-Object Interactions for Event and Object Recognition

被引:66
|
作者
Wei, Ping [1 ,2 ]
Zhao, Yibiao [2 ]
Zheng, Nanning [1 ]
Zhu, Song-Chun [2 ]
机构
[1] Xi An Jiao Tong Univ, Xian, Peoples R China
[2] Univ Calif Los Angeles, Los Angeles, CA USA
关键词
D O I
10.1109/ICCV.2013.406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing the events and objects in the video sequence are two challenging tasks due to the complex temporal structures and the large appearance variations. In this paper, we propose a 4D human-object interaction model, where the two tasks jointly boost each other. Our human-object interaction is defined in 4D space: i) the co-occurrence and geometric constraints of human pose and object in 3D space; ii) the sub-events transition and objects coherence in 1D temporal dimension. We represent the structure of events, sub-events and objects in a hierarchical graph. For an input RGB-depth video, we design a dynamic programming beam search algorithm to: i) segment the video, ii) recognize the events, and iii) detect the objects simultaneously. For evaluation, we built a large-scale multiview 3D event dataset which contains 3815 video sequences and 383,036 RGBD frames captured by the Kinect cameras. The experiment results on this dataset show the effectiveness of our method.
引用
收藏
页码:3272 / 3279
页数:8
相关论文
共 50 条
  • [41] Dangerous Human Event Understanding using Human-Object Interaction Model
    Xu, Zhaozhuo
    Tian, Yuan
    Hu, Xinjue
    Pu, Fangling
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2015, : 150 - 154
  • [42] Semantic Recognition of Human-Object Interactions via Gaussian-Based Elliptical Modeling and Pixel-Level Labeling
    Khalid, Nida
    Ghadi, Yazeed Yasin
    Gochoo, Munkhjargal
    Jalal, Ahmad
    Kim, Kibum
    IEEE Access, 2021, 9 : 111249 - 111266
  • [43] HOIMotion: Forecasting Human Motion during Human-Object Interactions Using Egocentric 3D Object Bounding Boxes
    Hu, Zhiming
    Yin, Zheming
    Haeufle, Daniel
    Schmitt, Syn
    Bulling, Andreas
    IEEE Transactions on Visualization and Computer Graphics, 2024, 30 (11) : 7375 - 7385
  • [44] Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses
    Yao, Bangpeng
    Fei-Fei, Li
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (09) : 1691 - 1703
  • [45] Semantic Recognition of Human-Object Interactions via Gaussian-Based Elliptical Modeling and Pixel-Level Labeling
    Khalid, Nida
    Ghadi, Yazeed Yasin
    Gochoo, Munkhjargal
    Jalal, Ahmad
    Kim, Kibum
    IEEE ACCESS, 2021, 9 : 111249 - 111266
  • [46] Exploring Predicate Visual Context in Detecting of Human-Object Interactions
    Zhang, Frederic Z.
    Yuan, Yuhui
    Campbell, Dylan
    Zhong, Zhuoyao
    Gould, Stephen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10377 - 10387
  • [47] HUMAN-OBJECT RELATION NETWORK FOR ACTION RECOGNITION IN STILL IMAGES
    Ma, Wentao
    Liang, Shuang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [48] Human-object Interaction Recognition Using Multitask Neural Network
    Yan, Weihao
    Gao, Yue
    Liu, Qiming
    2019 3RD INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS 2019), 2019, : 323 - 328
  • [49] THORN: Temporal Human-Object Relation Network for Action Recognition
    Guermal, Mohammed
    Dai, Rui
    Bremond, Francois
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3303 - 3309
  • [50] Recognizing Human-Object Interactions Using Sparse Subspace Clustering
    Bogun, Ivan
    Ribeiro, Eraldo
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PT I, 2013, 8047 : 409 - 416