Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection

被引:13
|
作者
Wu, Xiaoqian [1 ]
Li, Yong-Lu [1 ,2 ]
Liu, Xinpeng [1 ]
Zhang, Junyi [1 ]
Wu, Yuzhe [3 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[3] DongHua Univ, Shanghai, Peoples R China
来源
基金
国家重点研发计划;
关键词
Human-object interaction; Interactiveness learning; Body-part correlations;
D O I
10.1007/978-3-031-19772-7_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-Object Interaction (HOI) detection plays a crucial role in activity understanding. Though significant progress has been made, interactiveness learning remains a challenging problem in HOI detection: existing methods usually generate redundant negative H-O pair proposals and fail to effectively extract interactive pairs. Though interactiveness has been studied in both whole body- and part- level and facilitates the H-O pairing, previous works only focus on the target person once (i.e., in a local perspective) and overlook the information of the other persons. In this paper, we argue that comparing body-parts of multi-person simultaneously can afford us more useful and supplementary interactiveness cues. That said, to learn body-part interactiveness from a global perspective: when classifying a target person's body-part interactiveness, visual cues are explored not only from herself/himself but also from other persons in the image. We construct body-part saliency maps based on self-attention to mine cross-person informative cues and learn the holistic relationships between all the body-parts. We evaluate the proposed method on widely-used benchmarks HICO-DET and VCOCO. With our new perspective, the holistic global-local body-part interactiveness learning achieves significant improvements over state-of-the-art. Our code is available at https://github.com/enlighten0707/ Body-Part-Map-for-Interactiveness.
引用
收藏
页码:121 / 136
页数:16
相关论文
共 31 条
  • [21] Body Part Detection in Smoky Environments with Thermal Camera Using Deep Learning
    Gelfert, Sebastian
    [J]. 2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1508 - 1514
  • [22] Improving Human Body Part Detection using Deep Learning and Motion Consistency
    Ramanathan, Manoj
    Wei-Yun, Yau
    Khwang, Earn Teoh
    [J]. 2016 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2016,
  • [23] Body Part Detection from Neonatal Thermal Images Using Deep Learning
    Beppu, Fumika
    Yoshikawa, Hiroki
    Uchiyama, Akira
    Higashino, Teruo
    Hamada, Keisuke
    Hirakawa, Eiji
    [J]. MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES, 2022, 419 : 438 - 450
  • [24] Cross-dataset learning and person-specific normalisation for automatic Action Unit detection
    Baltrusaitis, Tadas
    Mahmoud, Marwa
    Robinson, Peter
    [J]. 2015 11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), VOL. 6, 2015,
  • [25] Holistic-Guided Disentangled Learning With Cross-Video Semantics Mining for Concurrent First-Person and Third-Person Activity Recognition
    Liu, Tianshan
    Zhao, Rui
    Jia, Wenqi
    Lam, Kin-Man
    Kong, Jun
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5211 - 5225
  • [26] Person re-identification based on multi-level feature complementarity of cross-attention with part metric learning
    Zeng Lu
    Guoheng Huang
    Chi-Man Pun
    Lianglun Cheng
    [J]. Multimedia Tools and Applications, 2020, 79 : 21409 - 21439
  • [27] Person re-identification based on multi-level feature complementarity of cross-attention with part metric learning
    Lu, Zeng
    Huang, Guoheng
    Pun, Chi-Man
    Cheng, Lianglun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (29-30) : 21409 - 21439
  • [28] Modal Invariance Feature Learning and Consistent Fine-Grained Information Mining Based Cross-Modal Person Re-identification
    Shi, Linbo
    Li, Huafeng
    Zhang, Yafei
    Xie, Minghong
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (12): : 1064 - 1077
  • [29] Detection of sarcopenia using deep learning-based artificial intelligence body part measure system (AIBMS)
    Gu, Shangzhi
    Wang, Lixue
    Han, Rong
    Liu, Xiaohong
    Wang, Yizhe
    Chen, Ting
    Zheng, Zhuozhao
    [J]. FRONTIERS IN PHYSIOLOGY, 2023, 14
  • [30] CO-DETECTOR: TOWARDS COMPLEX OBJECT DETECTION WITH CROSS-PART FEATURE LEARNING IN REMOTE SENSING
    Yuan, Shuai
    Zheng, Juepeng
    Huang, Yanlong
    Liu, Jierui
    Fu, Haohuan
    Cheung, Ray C. C.
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 1941 - 1944