Joint 3D Proposal Generation and Object Detection from View Aggregation

被引:0
|
作者
Ku, Jason [1 ]
Mozifian, Melissa [1 ]
Lee, Jungwook [1 ]
Harakeh, Ali [1 ]
Waslander, Steven L. [1 ]
机构
[1] Univ Waterloo, Dept Mech & Mechatron Engn, Fac Engn, 200 Univ Ave, Waterloo, ON, Canada
来源
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios. The proposed neural network architecture uses LIDAR point clouds and RGB images to generate features that are shared by two subnetworks: a region proposal network (RPN) and a second stage detector network. The proposed RPN uses a novel architecture capable of performing multimodal feature fusion on high resolution feature maps to generate reliable 3D object proposals for multiple object classes in road scenes. Using these proposals, the second stage detection network performs accurate oriented 3D bounding box regression and category classification to predict the extents, orientation, and classification of objects in 3D space. Our proposed architecture is shown to produce state of the art results on the KITTI 3D object detection benchmark [I] while running in real time with a low memory footprint, making it a suitable candidate for deployment on autonomous vehicles. Code is available at: https : / /github com/kujason/avod
引用
收藏
页码:5750 / 5757
页数:8
相关论文
共 50 条
  • [1] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud
    Shi, Shaoshuai
    Wang, Xiaogang
    Li, Hongsheng
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 770 - 779
  • [2] 3D Object Detection Based on Proposal Generation Network Utilizing Monocular Images
    ul Haq, Qazi Mazhar
    Haq, Muhamad Amirul
    Ruan, Shanq-Jang
    Liang, Pei-Jung
    Gao, De-Qin
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2022, 11 (05) : 47 - 53
  • [3] Structure Guided Proposal Completion for 3D Object Detection
    Shi, Chao
    Zhang, Chongyang
    Luo, Yan
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 504 - 520
  • [4] ARPNET: attention region proposal network for 3D object detection
    Yangyang Ye
    Chi Zhang
    Xiaoli Hao
    Science China Information Sciences, 2019, 62
  • [5] ARPNET: attention region proposal network for 3D object detection
    Yangyang YE
    Chi ZHANG
    Xiaoli HAO
    ScienceChina(InformationSciences), 2019, 62 (12) : 44 - 52
  • [6] ARPNET: attention region proposal network for 3D object detection
    Ye, Yangyang
    Zhang, Chi
    Hao, Xiaoli
    SCIENCE CHINA-INFORMATION SCIENCES, 2019, 62 (12)
  • [7] Learning View-Model Joint Relevance for 3D Object Retrieval
    Lu, Ke
    He, Ning
    Xue, Jian
    Dong, Jiyang
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (05) : 1449 - 1459
  • [8] Multi-view 3D object retrieval leveraging the aggregation of view and instance attentive features
    Lin, Dongyun
    Li, Yiqun
    Cheng, Yi
    Prasad, Shitala
    Nwe, Tin Lay
    Dong, Sheng
    Guo, Aiyuan
    KNOWLEDGE-BASED SYSTEMS, 2022, 247
  • [9] 3D model based object class detection in an arbitrary view
    Yan, Pingkun
    Khan, Saad M.
    Shah, Mubarak
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 1673 - 1678
  • [10] Viewpoint Equivariance for Multi-View 3D Object Detection
    Chen, Dian
    Li, Jie
    Guizilini, Vitor
    Ambrus, Rares
    Gaidon, Adrien
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9213 - 9222