FocalFormer3D: Focusing on Hard Instance for 3D Object Detection

被引:8
|
作者
Chen, Yilun [1 ]
Yu, Zhiding [3 ]
Chen, Yukang [1 ]
Lan, Shiyi [3 ]
Anandkumar, Anima [2 ,3 ]
Jia, Jiaya [1 ]
Alvarez, Jose M.
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] CALTECH, Pasadena, CA USA
[3] NVIDIA, Santa Clara, CA USA
关键词
D O I
10.1109/ICCV51070.2023.00771
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
False negatives (FN) in 3D object detection, e.g., missing predictions of pedestrians, vehicles, or other obstacles, can lead to potentially dangerous situations in autonomous driving. While being fatal, this issue is understudied in many current 3D detection methods. In this work, we propose Hard Instance Probing (HIP), a general pipeline that identifies FN in a multi- stage manner and guides the models to focus on excavating difficult instances. For 3D object detection, we instantiate this method as FocalFormer3D, a simple yet effective detector that excels at excavating difficult objects and improving prediction recall. FocalFormer3D features a multi-stage query generation to discover hard objects and a box-level transformer decoder to efficiently distinguish objects from massive object candidates. Experimental results on the nuScenes and Waymo datasets validate the superior performance of FocalFormer3D. The advantage leads to strong performance on both detection and tracking, in both LiDAR and multi-modal settings. Notably, FocalFormer3D achieves a 70.5 mAP and 73.9 NDS on nuScenes detection benchmark, while the nuScenes tracking benchmark shows 72.1 AMOTA, both ranking 1st place on the nuScenes LiDAR leaderboard. Our code is available at https: //github.com/NVlabs/FocalFormer3D.
引用
下载
收藏
页码:8360 / 8371
页数:12
相关论文
共 50 条
  • [41] SGM3D: Stereo Guided Monocular 3D Object Detection
    Zhou, Zheyuan
    Du, Liang
    Ye, Xiaoqing
    Zou, Zhikang
    Tan, Xiao
    Zhang, Li
    Xue, Xiangyang
    Feng, Jianfeng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10478 - 10485
  • [42] KPP3D:Key Point Painting for 3D Object Detection
    Wang, Mingming
    Chen, Qingkui
    Fu, Zhibing
    Computer Engineering and Applications, 2023, 59 (17) : 195 - 204
  • [43] RoadSense3D: A Framework for Roadside Monocular 3D Object Detection
    Carta, Salvatore
    Castrillon-Santana, Modesto
    Marras, Mirko
    Mohamed, Sondos
    Podda, Alessandro Sebastian
    Saia, Roberto
    Sau, Marco
    Zimmer, Walter
    ADJUNCT PROCEEDINGS OF THE 32ND ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2024, 2024, : 452 - 459
  • [44] CoIn: Contrastive Instance Feature Mining for Outdoor 3D Object Detection with Very Limited Annotations
    Xia, Qiming
    Deng, Jinhao
    Wen, Chenglu
    Wu, Hai
    Shi, Shaoshuai
    Li, Xin
    Wang, Cheng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6231 - 6240
  • [45] GPro3D: Deriving 3D BBox from ground plane in monocular 3D object detection
    Yang, Fan
    Xu, Xinhao
    Chen, Hui
    Guo, Yuchen
    He, Yuwei
    Ni, Kai
    Ding, Guiguang
    NEUROCOMPUTING, 2023, 562
  • [46] IoU Loss for 2D/3D Object Detection
    Zhou, Dingfu
    Fang, Jin
    Song, Xibin
    Guan, Chenye
    Yin, Junbo
    Dai, Yuchao
    Yang, Ruigang
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 85 - 94
  • [47] Enhance the 3D Object Detection With 2D Prior
    Liu, Cheng
    IEEE ACCESS, 2024, 12 : 67161 - 67169
  • [48] 2D Instance-Guided Pseudo-LiDAR Point Cloud for Monocular 3D Object Detection
    Gao, Rui
    Kim, Junoh
    Cho, Kyungeun
    IEEE Access, 2024, 12 : 187813 - 187827
  • [49] 3D OBJECT RETRIEVAL BY 3D CURVE MATCHING
    Feinen, Christian
    Czajkowska, Joanna
    Grzegorzek, Marcin
    Latecki, Longin Jan
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2749 - 2753
  • [50] 3D-RCNN: Instance-level 3D Object Reconstruction via Render-and-Compare
    Kundu, Abhijit
    Li, Yin
    Rehg, James M.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3559 - 3568