3D detection transformer: Set prediction of objects using point clouds

被引:0
|
作者
Thon, Tan [1 ,2 ]
Lim, Joanne Mun-Yee [1 ]
Jinn, Foo Ji [1 ]
Muniandy, Ramachandran [2 ]
机构
[1] Monash Univ Malaysia, Sch Engn, Dept Elect & Robot Engn, Jalan Lagoon Selatan, Subang Jaya 47500, Selangor, Malaysia
[2] Asia Mobil Technol SDN BHD, Tower 3,Jalan Pengaturcara U1-51A, Shah Alam 40150, Selangor, Malaysia
关键词
Deep learning; 3D object detection; Point clouds; Transformers; Single-stage detector;
D O I
10.1016/j.cviu.2023.103808
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection in 3D scenes rely on two main methods: detection based on proposals (two-stage detectors) or detections based on anchors (single-stage detectors), similar to approaches for object detection in 2D. In this paper, we propose the 3DeTR framework that produces 3D detections without the use of anchors or proposals, allowing training of the entire neural network in an end-to-end manner. Raw point cloud scenes are augmented and input into distance-and-reflectiveness-based feature extractor to produce representative points. Then, a transformer encoder-decoder module learns the local object relations and global context to generate parallel detections, which are then passed to a set-based loss function to map predictions to the set of ground truth labels uniquely. The model's architecture produces 3D detections by regressing directly with the set of ground truths without the need for anchors or proposals, which are bottlenecks for object detection performances. We tested the framework on the KITTI Vision Benchmark Suite 3D object detection dataset, achieving results on par with the state-of-the-art: 80.37 AP on Cars (Moderate) class and 47.92 AP on Pedestrians (Moderate) class.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] 3D Siamese Transformer Network for Single Object Tracking on Point Clouds
    Hui, Le
    Wang, Lingpeng
    Tang, Linghua
    Lan, Kaihao
    Xie, Jin
    Yang, Jian
    [J]. COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 293 - 310
  • [32] MATNet: Semantic segmentation of 3D point clouds with multiscale adaptive transformer
    Zheng, Yufei
    Lu, Jian
    Chen, Xiaogai
    Zhang, Kaibing
    Zhou, Jian
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2024, 119
  • [33] RGB-D Images for Objects Recognition using 3D Point Clouds and RANSAC Plane Fitting
    Jalal, Ahmad
    Sarwar, M. Zeeshan
    Kim, Kibum
    [J]. PROCEEDINGS OF 2021 INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGIES (IBCAST), 2021, : 518 - 523
  • [34] 3D Detection for Occluded Vehicles From Point Clouds
    Zhao, Kun
    Liu, Li
    Meng, Yu
    Liu, Hao
    Gu, Qing
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2022, 14 (05) : 59 - 71
  • [35] Hole Boundary Detection of a Surface of 3D point clouds
    Van Sinh Nguyen
    Trong Hai Trinh
    Manh Ha Tran
    [J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP), 2015, : 124 - 129
  • [36] TriplClust: An Algorithm for Curve Detection in 3D Point Clouds
    Dalitz, Christoph
    Wilberg, Jens
    Aymans, Lukas
    [J]. IMAGE PROCESSING ON LINE, 2019, 9 : 26 - 46
  • [37] Road Junction Detection from 3D Point Clouds
    Habermann, Danilo
    Vido, Carlos E. O.
    Osorio, Fernando S.
    Ramos, Fabio
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4934 - 4940
  • [38] Hypergraph Representation for Detecting 3D Objects From Noisy Point Clouds
    Jiang, Ping
    Deng, Xiaoheng
    Wang, Leilei
    Chen, Zailiang
    Zhang, Shichao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7016 - 7029
  • [39] Recognizing geometric primitives in 3D point clouds of mechanical CAD objects
    Romanengo, Chiara
    Raffo, Andrea
    Biasotti, Silvia
    Falcidieno, Bianca
    [J]. COMPUTER-AIDED DESIGN, 2023, 157
  • [40] Comparison and classification of 3D objects surface point clouds on the example of feet
    Grimmer, Rainer
    Eskofier, Bjoern
    Schlarb, Heiko
    Hornegger, Joachim
    [J]. MACHINE VISION AND APPLICATIONS, 2011, 22 (02) : 235 - 243