PPDM plus plus : Parallel Point Detection and Matching for Fast and Accurate HOI Detection

被引:3
|
作者
Liao, Yue [1 ]
Liu, Si [1 ]
Gao, Yulu [1 ]
Zhang, Aixi [1 ]
Li, Zhimin [2 ]
Wang, Fei [3 ]
Li, Bo [1 ]
机构
[1] Beihang Univ, Inst Artificial Intelligence, Beijing 100191, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
[3] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230052, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Proposals; Feature extraction; Task analysis; Detectors; Real-time systems; Matched filters; Bicycles; Human-object interaction detection; visual relationship detection; one-stage detector; dataset;
D O I
10.1109/TPAMI.2024.3386891
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-Object Interaction (HOI) detection aims to understand human activities by detecting interaction triplets. Previous HOI detection methods adopt a two-stage instance-driven paradigm. Unfortunately, many non-interactive human-object pairs generated by the first stage are the main obstacle impeding HOI detectors from high efficiency and promising performance. To remedy this, we propose a novel top-down interaction-driven paradigm, detecting interactions first and bridging interactive human-object pairs through interactions. We formulate HOI as a point triplet $< $<human point, interaction point, object point$> $> and design a Parallel Point Detection and Matching (PPDM) framework. We further take advantage of two-stage methods and propose a novel framework, PPDM++, that detects the interactive human-object pairs by PPDM, then extracts region features for each pair to predict actions. The core of PPDM/PPDM++ is to convert the instance-driven bottom-up paradigm to an interaction-driven top-down paradigm, thus avoiding additional computation costs from traversing a tremendous number of non-interactive pairs. Benefiting from the advanced paradigm, PPDM/PPDM++ has achieved significant performance gains with high efficiency. PPDM-DLA-34 has achieved 19.94 mAP with 42 FPS as the first real-time HOI detector, and PPDM++-SwinB achieves 30.1 mAP with 17 FPS on HICO-DET dataset. We also built an application-oriented database named HOI-A, a supplement to the existing datasets.
引用
收藏
页码:6826 / 6841
页数:16
相关论文
共 50 条
  • [1] PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection
    Liao, Yue
    Liu, Si
    Wang, Fei
    Chen, Yanjie
    Qian, Chen
    Feng, Jiashi
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 479 - 487
  • [2] Lite-HRNet Plus: Fast and Accurate Facial Landmark Detection
    Kato, Sota
    Hotta, Kazuhiro
    Hatakeyama, Yuhki
    Konishi, Yoshinori
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1500 - 1504
  • [3] RSDet plus plus : Point-Based Modulated Loss for More Accurate Rotated Object Detection
    Qian, Wen
    Yang, Xue
    Peng, Silong
    Zhang, Xiujuan
    Yan, Junchi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7869 - 7879
  • [4] Local Nontermination Detection for Parallel C plus plus Programs
    Still, Vladimir
    Barnat, Jiri
    SOFTWARE ENGINEERING AND FORMAL METHODS (SEFM 2019), 2019, 11724 : 373 - 390
  • [5] GraphAlign plus plus : An Accurate Feature Alignment by Graph Matching for Multi-Modal 3D Object Detection
    Song, Ziying
    Jia, Caiyan
    Yang, Lei
    Wei, Haiyue
    Liu, Lin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (04) : 2619 - 2632
  • [6] ROM Plus (R): accurate point-of-care detection of ruptured fetal membranes
    McQuivey, Ross W.
    Block, Jon E.
    MEDICAL DEVICES-EVIDENCE AND RESEARCH, 2016, 9 : 69 - 74
  • [7] AtelierM plus plus : a fast and accurate marbling system
    Zhao, Hanli
    Jin, Xiaogang
    Lu, Shufang
    Mao, Xiaoyang
    Shen, Jianbing
    MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 44 (02) : 187 - 203
  • [8] RangeNet plus plus : Fast and Accurate LiDAR Semantic Segmentation
    Milioto, Andres
    Vizzo, Ignacio
    Chley, Jens
    Stachniss, Cyrill
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4213 - 4220
  • [9] MAGSAC plus plus , a fast, reliable and accurate robust estimator
    Barath, Daniel
    Noskova, Jana
    Ivashechkin, Maksym
    Matas, Jiri
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1301 - 1309
  • [10] Fast and Accurate Vanishing Point Detection in Complex Scenes
    Yang, Weibin
    Luo, Xiaosong
    Fang, Bin
    Zhang, Daiming
    Tang, Yuan Yan
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 93 - 98