Turbo Learning Framework for Human-Object Interactions Recognition and Human Pose Estimation

被引:0
|
作者
Feng, Wei [1 ]
Liu, Wentao [1 ,2 ]
Li, Tong [1 ]
Peng, Jing [1 ]
Qian, Chen [1 ]
Hu, Xiaolin [2 ]
机构
[1] SenseTime Grp Ltd, Hong Kong, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-object interactions (HOI) recognition and pose estimation are two closely related tasks. Human pose is an essential cue for recognizing actions and localizing the interacted objects. Meanwhile, human action and their interacted objects' localizations provide guidance for pose estimation. In this paper, we propose a turbo learning framework to perform HOT recognition and pose estimation simultaneously. First, two modules are designed to enforce message passing between the tasks, i.e. pose aware HOT recognition module and HOT guided pose estimation module. Then, these two modules form a closed loop to utilize the complementary information iteratively, which can be trained in an end-to-end manner. The proposed method achieves the state-of-the-art performance on two public benchmarks including Verbs in COCO (V-COCO) and HICO-DET datasets.
引用
收藏
页码:898 / 905
页数:8
相关论文
共 50 条
  • [1] Learning to Detect Human-Object Interactions
    Chao, Yu-Wei
    Liu, Yunfan
    Liu, Xieyang
    Zeng, Huayi
    Deng, Jia
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 381 - 389
  • [2] Learning Human-Object Interactions by Attention Aggregation
    Gu, Dongzhou
    Cai, Shuang
    Ma, Shiwei
    [J]. SIXTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2021, 11913
  • [3] Learning to Detect Human-Object Interactions with Knowledge
    Xu, Bingjie
    Wong, Yongkang
    Li, Junnan
    Zhao, Qi
    Kankanhalli, Mohan S.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2019 - 2028
  • [4] Language for Learning Complex Human-Object Interactions
    Patel, Mitesh
    Ek, Carl Henrik
    Kyriazis, Nikolaos
    Argyros, Antonis
    Miro, Jaime Valls
    Kragic, Danica
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4997 - 5002
  • [5] An Intelligent Framework for Recognizing Social Human-Object Interactions
    Alarfaj, Mohammed
    Waheed, Manahil
    Ghadi, Yazeed Yasin
    al Shloul, Tamara
    Alsuhibany, Suliman A.
    Jalal, Ahmad
    Park, Jeongmin
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1207 - 1223
  • [6] Exemplar-Based Recognition of Human-Object Interactions
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Lai, Jianhuang
    Gong, Shaogang
    Xiang, Tao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (04) : 647 - 660
  • [7] Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities
    Yao, Bangpeng
    Li Fei-Fei
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 17 - 24
  • [8] Modeling 4D Human-Object Interactions for Event and Object Recognition
    Wei, Ping
    Zhao, Yibiao
    Zheng, Nanning
    Zhu, Song-Chun
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3272 - 3279
  • [9] Cascaded Human-Object Interaction Recognition
    Zhou, Tianfei
    Wang, Wenguan
    Qi, Siyuan
    Ling, Haibin
    Shen, Jianbing
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4262 - 4271
  • [10] Learning Human-Object Interactions by Graph Parsing Neural Networks
    Qi, Siyuan
    Wang, Wenguan
    Jia, Baoxiong
    Shen, Jianbing
    Zhu, Song-Chun
    [J]. COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423