FocalFormer3D: Focusing on Hard Instance for 3D Object Detection

被引:8
|
作者
Chen, Yilun [1 ]
Yu, Zhiding [3 ]
Chen, Yukang [1 ]
Lan, Shiyi [3 ]
Anandkumar, Anima [2 ,3 ]
Jia, Jiaya [1 ]
Alvarez, Jose M.
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] CALTECH, Pasadena, CA USA
[3] NVIDIA, Santa Clara, CA USA
关键词
D O I
10.1109/ICCV51070.2023.00771
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
False negatives (FN) in 3D object detection, e.g., missing predictions of pedestrians, vehicles, or other obstacles, can lead to potentially dangerous situations in autonomous driving. While being fatal, this issue is understudied in many current 3D detection methods. In this work, we propose Hard Instance Probing (HIP), a general pipeline that identifies FN in a multi- stage manner and guides the models to focus on excavating difficult instances. For 3D object detection, we instantiate this method as FocalFormer3D, a simple yet effective detector that excels at excavating difficult objects and improving prediction recall. FocalFormer3D features a multi-stage query generation to discover hard objects and a box-level transformer decoder to efficiently distinguish objects from massive object candidates. Experimental results on the nuScenes and Waymo datasets validate the superior performance of FocalFormer3D. The advantage leads to strong performance on both detection and tracking, in both LiDAR and multi-modal settings. Notably, FocalFormer3D achieves a 70.5 mAP and 73.9 NDS on nuScenes detection benchmark, while the nuScenes tracking benchmark shows 72.1 AMOTA, both ranking 1st place on the nuScenes LiDAR leaderboard. Our code is available at https: //github.com/NVlabs/FocalFormer3D.
引用
下载
收藏
页码:8360 / 8371
页数:12
相关论文
共 50 条
  • [31] 3D Object Class Detection in the Wild
    Pepik, Bojan
    Stark, Michael
    Gehler, Peter
    Ritschel, Tobias
    Schiele, Bernt
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [32] A Heterogeneous Approach for 3D Object Detection
    Lü Z.
    Yao Z.
    Jia Y.
    Bao Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (12): : 2748 - 2759
  • [33] Improved Sliding Shapes for Instance Segmentation of Amodal 3D Object
    Lin, Jinhua
    Yao, Yu
    Wang, Yanjie
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (11): : 5555 - 5567
  • [34] Faster 3D Object Detection in RGB-D Image Using 3D Selective Search and Object Pruning
    Liu, Jiang
    Chen, Hongliang
    Li, Jianxun
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 4862 - 4866
  • [35] 3D sketching for 3D object retrieval
    Li, Bo
    Yuan, Juefei
    Ye, Yuxiang
    Lu, Yijuan
    Zhang, Chaoyang
    Tian, Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) : 9569 - 9595
  • [36] 3D ResNets for 3D Object Classification
    Ioannidou, Anastasia
    Chatzilari, Elisavet
    Nikolopoulos, Spiros
    Kompatsiaris, Ioannis
    MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 495 - 506
  • [37] 3D sketching for 3D object retrieval
    Bo Li
    Juefei Yuan
    Yuxiang Ye
    Yijuan Lu
    Chaoyang Zhang
    Qi Tian
    Multimedia Tools and Applications, 2021, 80 : 9569 - 9595
  • [38] 3D Object Proposals for Accurate Object Class Detection
    Chen, Xiaozhi
    Kundu, Kaustav
    Zhu, Yukun
    Berneshawi, Andrew
    Ma, Huimin
    Fidler, Sanja
    Urtasun, Raquel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [39] Reinforcing LiDAR-Based 3D Object Detection with RGB and 3D Information
    Liu, Wenjian
    Zhou, Yue
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 : 199 - 209
  • [40] MonoSample: Synthetic 3D Data Augmentation Method in Monocular 3D Object Detection
    Qiao, Junchao
    Liu, Biao
    Yang, Jiaqi
    Wang, Baohua
    Xiu, Sanmu
    Du, Xin
    Nie, Xiaobo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (08): : 7326 - 7332