Turbo Learning Framework for Human-Object Interactions Recognition and Human Pose Estimation

被引:0
|
作者
Feng, Wei [1 ]
Liu, Wentao [1 ,2 ]
Li, Tong [1 ]
Peng, Jing [1 ]
Qian, Chen [1 ]
Hu, Xiaolin [2 ]
机构
[1] SenseTime Grp Ltd, Hong Kong, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-object interactions (HOI) recognition and pose estimation are two closely related tasks. Human pose is an essential cue for recognizing actions and localizing the interacted objects. Meanwhile, human action and their interacted objects' localizations provide guidance for pose estimation. In this paper, we propose a turbo learning framework to perform HOT recognition and pose estimation simultaneously. First, two modules are designed to enforce message passing between the tasks, i.e. pose aware HOT recognition module and HOT guided pose estimation module. Then, these two modules form a closed loop to utilize the complementary information iteratively, which can be trained in an end-to-end manner. The proposed method achieves the state-of-the-art performance on two public benchmarks including Verbs in COCO (V-COCO) and HICO-DET datasets.
引用
收藏
页码:898 / 905
页数:8
相关论文
共 50 条
  • [31] Acoustic Signature Recognition Technique for Human-Object Interactions (HOI) in Persistent Surveillance Systems
    Alkilani, Amjad
    Shirkhodaie, Amir
    [J]. SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XXII, 2013, 8745
  • [32] Recognizing Human-Object Interactions via Target Localization
    Cho, Sunyoung
    Park, Jihun
    Shin, Young Sook
    Lee, Sang-ho
    [J]. 2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 836 - 840
  • [33] Explicit Modeling of Human-Object Interactions in Realistic Videos
    Prest, Alessandro
    Ferrari, Vittorio
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (04) : 835 - 848
  • [34] InteractGAN: Learning to Generate Human-Object Interaction
    Gao, Chen
    Liu, Si
    Zhu, Defa
    Liu, Quan
    Cao, Jie
    He, Haoqian
    He, Ran
    Yan, Shuicheng
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 165 - 173
  • [35] Lifelong Learning for Human-Object Interaction Detection
    Sun, Bo
    Lu, Sixu
    He, Jun
    Yu, Lejun
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 582 - 587
  • [36] Distillation of human-object interaction contexts for action recognition
    Almushyti, Muna
    Li, Frederick W. B.
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2022, 33 (05)
  • [37] Deep learning and RGB-D based human action, human-human and human-object interaction recognition: A survey?
    Khaire, Pushpajit
    Kumar, Praveen
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 86
  • [38] Predicting the Location of "interactees" in Novel Human-Object Interactions
    Chen, Chao-Yeh
    Grauman, Kristen
    [J]. COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 351 - 367
  • [39] Detecting Subtle Human-Object Interactions Using Kinect
    Ubalde, Sebastian
    Liu, Zicheng
    Mejail, Marta
    [J]. PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 770 - 777
  • [40] Skew-Robust Human-Object Interactions in Videos
    Agarwal, Apoorva
    Dabral, Rishabh
    Jain, Arjun
    Ramakrishnan, Ganesh
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5087 - 5096