FocalFormer3D: Focusing on Hard Instance for 3D Object Detection

被引:8
|
作者
Chen, Yilun [1 ]
Yu, Zhiding [3 ]
Chen, Yukang [1 ]
Lan, Shiyi [3 ]
Anandkumar, Anima [2 ,3 ]
Jia, Jiaya [1 ]
Alvarez, Jose M.
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] CALTECH, Pasadena, CA USA
[3] NVIDIA, Santa Clara, CA USA
关键词
D O I
10.1109/ICCV51070.2023.00771
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
False negatives (FN) in 3D object detection, e.g., missing predictions of pedestrians, vehicles, or other obstacles, can lead to potentially dangerous situations in autonomous driving. While being fatal, this issue is understudied in many current 3D detection methods. In this work, we propose Hard Instance Probing (HIP), a general pipeline that identifies FN in a multi- stage manner and guides the models to focus on excavating difficult instances. For 3D object detection, we instantiate this method as FocalFormer3D, a simple yet effective detector that excels at excavating difficult objects and improving prediction recall. FocalFormer3D features a multi-stage query generation to discover hard objects and a box-level transformer decoder to efficiently distinguish objects from massive object candidates. Experimental results on the nuScenes and Waymo datasets validate the superior performance of FocalFormer3D. The advantage leads to strong performance on both detection and tracking, in both LiDAR and multi-modal settings. Notably, FocalFormer3D achieves a 70.5 mAP and 73.9 NDS on nuScenes detection benchmark, while the nuScenes tracking benchmark shows 72.1 AMOTA, both ranking 1st place on the nuScenes LiDAR leaderboard. Our code is available at https: //github.com/NVlabs/FocalFormer3D.
引用
下载
收藏
页码:8360 / 8371
页数:12
相关论文
共 50 条
  • [1] 3D Object Detection and Instance Segmentation from 3D Range and 2D Color Images
    Shen, Xiaoke
    Stamos, Ioannis
    SENSORS, 2021, 21 (04) : 1 - 29
  • [2] DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection
    Peng, Liang
    Wu, Xiaopei
    Yang, Zheng
    Liu, Haifeng
    Cai, Deng
    COMPUTER VISION - ECCV 2022, PT I, 2022, 13661 : 71 - 88
  • [3] 3D Object Detection Incorporating Instance Segmentation and Image Restoration
    HUANG Bo
    HUANG Man
    GAO Yongbin
    YU Yuxin
    JIANG Xiaoyan
    ZHANG Juan
    Wuhan University Journal of Natural Sciences, 2019, 24 (04) : 360 - 368
  • [4] Joint 3D Instance Segmentation and Object Detection for Autonomous Driving
    Zhou, Dingfu
    Fang, Jin
    Song, Xibin
    Liu, Liu
    Yin, Junbo
    Dai, Yuchao
    Li, Hongdong
    Yang, Ruigang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1836 - 1846
  • [5] Monocular 3D object detection with thermodynamic loss and decoupled instance depth
    Liu, Gang
    Xie, Xiaoxiao
    Yu, Qingchen
    CONNECTION SCIENCE, 2024, 36 (01)
  • [6] Shape Prior Guided Instance Disparity Estimation for 3D Object Detection
    Chen, Linghao
    Sun, Jiaming
    Xie, Yiming
    Zhang, Siyu
    Shuai, Qing
    Jiang, Qinhong
    Zhang, Guofeng
    Bao, Hujun
    Zhou, Xiaowei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5529 - 5540
  • [7] A robust 3D unique descriptor for 3D object detection
    Joshi, Piyush
    Rastegarpanah, Alireza
    Stolkin, Rustam
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (03)
  • [8] 3D Object Detection with Pointformer
    Pan, Xuran
    Xia, Zhuofan
    Song, Shiji
    Li, Li Erran
    Huang, Gao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7459 - 7468
  • [9] A survey of 3D object detection
    Wei Liang
    Pengfei Xu
    Ling Guo
    Heng Bai
    Yang Zhou
    Feng Chen
    Multimedia Tools and Applications, 2021, 80 : 29617 - 29641
  • [10] A survey of 3D object detection
    Liang, Wei
    Xu, Pengfei
    Guo, Ling
    Bai, Heng
    Zhou, Yang
    Chen, Feng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (19) : 29617 - 29641