3D detection transformer: Set prediction of objects using point clouds

被引：0

作者：

Thon, Tan ^{[1
,2
]}

Lim, Joanne Mun-Yee ^{[1
]}

Jinn, Foo Ji ^{[1
]}

Muniandy, Ramachandran ^{[2
]}

机构：

[1] Monash Univ Malaysia, Sch Engn, Dept Elect & Robot Engn, Jalan Lagoon Selatan, Subang Jaya 47500, Selangor, Malaysia

[2] Asia Mobil Technol SDN BHD, Tower 3,Jalan Pengaturcara U1-51A, Shah Alam 40150, Selangor, Malaysia

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2023年 / 236卷

关键词：

Deep learning; 3D object detection; Point clouds; Transformers; Single-stage detector;

D O I：

10.1016/j.cviu.2023.103808

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection in 3D scenes rely on two main methods: detection based on proposals (two-stage detectors) or detections based on anchors (single-stage detectors), similar to approaches for object detection in 2D. In this paper, we propose the 3DeTR framework that produces 3D detections without the use of anchors or proposals, allowing training of the entire neural network in an end-to-end manner. Raw point cloud scenes are augmented and input into distance-and-reflectiveness-based feature extractor to produce representative points. Then, a transformer encoder-decoder module learns the local object relations and global context to generate parallel detections, which are then passed to a set-based loss function to map predictions to the set of ground truth labels uniquely. The model's architecture produces 3D detections by regressing directly with the set of ground truths without the need for anchors or proposals, which are bottlenecks for object detection performances. We tested the framework on the KITTI Vision Benchmark Suite 3D object detection dataset, achieving results on par with the state-of-the-art: 80.37 AP on Cars (Moderate) class and 47.92 AP on Pedestrians (Moderate) class.

引用

页数：10

共 50 条

[31] 3D Siamese Transformer Network for Single Object Tracking on Point Clouds
Hui, Le
Wang, Lingpeng
Tang, Linghua
Lan, Kaihao
Xie, Jin
Yang, Jian
[J]. COMPUTER VISION - ECCV 2022, PT II, 2022, 13662 : 293 - 310
[32] MATNet: Semantic segmentation of 3D point clouds with multiscale adaptive transformer
Zheng, Yufei
Lu, Jian
Chen, Xiaogai
Zhang, Kaibing
Zhou, Jian
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2024, 119
[33] RGB-D Images for Objects Recognition using 3D Point Clouds and RANSAC Plane Fitting
Jalal, Ahmad
Sarwar, M. Zeeshan
Kim, Kibum
[J]. PROCEEDINGS OF 2021 INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGIES (IBCAST), 2021, : 518 - 523
[34] 3D Detection for Occluded Vehicles From Point Clouds
Zhao, Kun
Liu, Li
Meng, Yu
Liu, Hao
Gu, Qing
[J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2022, 14 (05) : 59 - 71
[35] Hole Boundary Detection of a Surface of 3D point clouds
Van Sinh Nguyen
Trong Hai Trinh
Manh Ha Tran
[J]. 2015 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP), 2015, : 124 - 129
[36] TriplClust: An Algorithm for Curve Detection in 3D Point Clouds
Dalitz, Christoph
Wilberg, Jens
Aymans, Lukas
[J]. IMAGE PROCESSING ON LINE, 2019, 9 : 26 - 46
[37] Road Junction Detection from 3D Point Clouds
Habermann, Danilo
Vido, Carlos E. O.
Osorio, Fernando S.
Ramos, Fabio
[J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4934 - 4940
[38] Hypergraph Representation for Detecting 3D Objects From Noisy Point Clouds
Jiang, Ping
Deng, Xiaoheng
Wang, Leilei
Chen, Zailiang
Zhang, Shichao
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7016 - 7029
[39] Recognizing geometric primitives in 3D point clouds of mechanical CAD objects
Romanengo, Chiara
Raffo, Andrea
Biasotti, Silvia
Falcidieno, Bianca
[J]. COMPUTER-AIDED DESIGN, 2023, 157
[40] Comparison and classification of 3D objects surface point clouds on the example of feet
Grimmer, Rainer
Eskofier, Bjoern
Schlarb, Heiko
Hornegger, Joachim
[J]. MACHINE VISION AND APPLICATIONS, 2011, 22 (02) : 235 - 243

← 1 2 3 4 5 →