PPDM plus plus : Parallel Point Detection and Matching for Fast and Accurate HOI Detection

被引:3
|
作者
Liao, Yue [1 ]
Liu, Si [1 ]
Gao, Yulu [1 ]
Zhang, Aixi [1 ]
Li, Zhimin [2 ]
Wang, Fei [3 ]
Li, Bo [1 ]
机构
[1] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
[3] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230052, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Proposals; Feature extraction; Task analysis; Detectors; Real-time systems; Matched filters; Bicycles; Human-object interaction detection; visual relationship detection; one-stage detector; dataset;
D O I
10.1109/TPAMI.2024.3386891
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-Object Interaction (HOI) detection aims to understand human activities by detecting interaction triplets. Previous HOI detection methods adopt a two-stage instance-driven paradigm. Unfortunately, many non-interactive human-object pairs generated by the first stage are the main obstacle impeding HOI detectors from high efficiency and promising performance. To remedy this, we propose a novel top-down interaction-driven paradigm, detecting interactions first and bridging interactive human-object pairs through interactions. We formulate HOI as a point triplet $< $<human point, interaction point, object point$> $> and design a Parallel Point Detection and Matching (PPDM) framework. We further take advantage of two-stage methods and propose a novel framework, PPDM++, that detects the interactive human-object pairs by PPDM, then extracts region features for each pair to predict actions. The core of PPDM/PPDM++ is to convert the instance-driven bottom-up paradigm to an interaction-driven top-down paradigm, thus avoiding additional computation costs from traversing a tremendous number of non-interactive pairs. Benefiting from the advanced paradigm, PPDM/PPDM++ has achieved significant performance gains with high efficiency. PPDM-DLA-34 has achieved 19.94 mAP with 42 FPS as the first real-time HOI detector, and PPDM++-SwinB achieves 30.1 mAP with 17 FPS on HICO-DET dataset. We also built an application-oriented database named HOI-A, a supplement to the existing datasets.
引用
收藏
页码:6826 / 6841
页数:16
相关论文
共 50 条
  • [31] The Improved Deeplabv3plus Based Fast Lane Detection Method
    Wang, Zhong
    Zhao, Yin
    Tian, Yang
    Zhang, Yahui
    Gao, Landa
    ACTUATORS, 2022, 11 (07)
  • [32] IN-SEQUENCE VIDEO DUPLICATE DETECTION WITH FAST POINT-TO-LINE MATCHING
    Liu, Bo
    Li, Zhu
    Wang, Meng
    Katsaggelos, A. K.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1037 - 1040
  • [33] The Method of Infrared Point Target Detection and Tracking Based on DSP plus FPGA
    Chen Yu
    Yu Yan Xin
    Zhao Ting
    FRONTIERS OF MECHANICAL ENGINEERING AND MATERIALS ENGINEERING II, PTS 1 AND 2, 2014, 457-458 : 1272 - 1277
  • [34] DifUnet plus plus : A Satellite Images Change Detection Network Based on Unet plus plus and Differential Pyramid
    Zhang, Xiuwei
    Yue, Yuanzeng
    Gao, Wenxiang
    Yun, Shuai
    Su, Qian
    Yin, Hanlin
    Zhang, Yanning
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [35] HLScope plus : Fast and Accurate Performance Estimation for FPGA HLS
    Choi, Young-kyu
    Zhang, Peng
    Li, Peng
    Cong, Jason
    2017 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2017, : 691 - 698
  • [36] GJK plus plus : Leveraging Acceleration Methods for Faster Collision Detection
    Montaut, Louis
    Le Lidec, Quentin
    Petrik, Vladimir
    Sivic, Josef
    Carpentier, Justin
    IEEE TRANSACTIONS ON ROBOTICS, 2024, 40 : 2564 - 2581
  • [37] BLonD plus plus : Performance Analysis and Optimizations for Enabling Complex, Accurate and Fast Beam Dynamics Studies
    Iliakis, Konstantinos
    Timko, Helga
    Xydis, Sotirios
    Soudris, Dimitrios
    2018 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION (SAMOS XVIII), 2018, : 123 - 130
  • [38] FEATURE plus plus : CROSS DIMENSION FEATURE FUSION FOR ROAD DETECTION
    He, Wenli
    Cai, Guorong
    Zhong, Zhun
    Su, Songzhi
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1662 - 1666
  • [39] An Architectural Smells Detection Tool for C and C plus plus projects
    Biaggi, Andrea
    Fontana, Francesca Arcelli
    Roveda, Riccardo
    44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 417 - 420
  • [40] Enhanced Memory Corruption Detection in C/C plus plus Programs
    Lin, Ching-Yi
    Yang, Wuu
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP-W 2023, 2023, : 71 - 78