PillarNet plus plus : Pillar-Based 3-D Object Detection With Multiattention

被引：2

作者：

Guo, Dongbing ^{[1
,2
]}

Yang, Guohui ^{[3
]}

Wang, Chunhui ^{[1
]}

机构：

[1] Harbin Inst Technol, Natl Key Lab Tunable Laser Technol, Harbin 150001, Peoples R China

[2] Shanxi Data Technol Co Ltd, Taiyuan 030032, Peoples R China

[3] Harbin Inst Technol, Sch Elect & Informat Engn, Dept Microwave Engn, Harbin 150001, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2023年 / 23卷 / 22期

关键词：

3-D object detection; autonomous driving; light detection and ranging (LiDAR); multiattention;

D O I：

10.1109/JSEN.2023.3323368

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Light detection and ranging (LiDAR)-based 3-D object detection constitutes a fundamental component of autonomous driving technology. In this research, we propose a novel approach called PillarNet++ to tackle the challenges associated with fine-grained information loss during point cloud encoding and the inadequate interaction or incomplete fusion of feature maps across different scales in subsequent feature extraction stages, resulting in a decrease in partial occlusion and long-distance 3-D object detection accuracy, leading to false and missed detections. The PillarNet++ method primarily comprises two modules: the multiattention-pillar-encoding (MAPE) module and the pseudo-image-split-multibranch-feature-pyramid-network (PSMB-FPN) module. The MAPE module enhances the information extraction capability in nonempty pillars by integrating max pooling and average pooling, by fusion of the pointwise, channelwise, and pillarwise attention; the MAPE module can adaptively focus on the important information and suppress the secondary point clouds. In addition, the stacked MAPE modules can refine pillars and extract finer features. On the other hand, the PSMB-FPN module splits the pseudo-image along the channel dimension and subsequently performs MB-FPN feature extraction and fusion on each channel, facilitating the interaction of multiscale and multilevel feature maps and improving prediction accuracy. Experimental results on the KITTI 3-D object detection benchmark show that the PillarNet++ method has the best performance among single-stage object detection algorithms and even exceeds most two-stage methods.

引用

页码：27733 / 27743

页数：11

共 50 条

[31] Object Detection for Chinese Traditional Costume Images Based GRP-DSOD plus plus Network
Zhao, Haiying
Yang, Ting
Hou, Xiaogang
Zhu, Hui
Yang, Zhuoyu
IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 18 - 31
[32] RSDet plus plus : Point-Based Modulated Loss for More Accurate Rotated Object Detection
Qian, Wen
Yang, Xue
Peng, Silong
Zhang, Xiujuan
Yan, Junchi
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7869 - 7879
[33] Pseudo-Stereo plus plus : Cycled Generative Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
Elhagry, Ahmed
Dai, Hang
El Saddik, Abdulmotaleb
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7428 - 7435
[34] Geometry-based automatic object localization and 3-D pose detection
Magnor, MA
FIFTH IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION, PROCEEDINGS, 2002, : 144 - 147
[35] Diversity Knowledge Distillation for LiDAR-Based 3-D Object Detection
Ning, Kanglin
Liu, Yanfei
Su, Yanzhao
Jiang, Ke
IEEE SENSORS JOURNAL, 2023, 23 (11) : 11181 - 11193
[36] 3-D Object Detection With Balanced Prediction Based on Contrastive Point Loss
Tong, Jiaxun
Liu, Kaiqi
Bai, Xia
Li, Wei
IEEE SENSORS JOURNAL, 2024, 24 (04) : 4969 - 4977
[37] A symbolic representation for 3-D object feature detection
Neal, PJ
Shapiro, LG
15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 221 - 224
[38] A 3-D object recognition system based on a rapid 3-D vision system
Choi, S
Park, H
Kim, S
Park, S
Won, S
Jecing, H
NEW TECHNOLOGIES FOR AUTOMATION OF METALLURGICAL INDUSTRY 2003, 2004, : 269 - 274
[39] PV-RCNN plus plus : Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
Shi, Shaoshuai
Jiang, Li
Deng, Jiajun
Wang, Zhe
Guo, Chaoxu
Shi, Jianping
Wang, Xiaogang
Li, Hongsheng
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) : 531 - 551
[40] PSA-Det3D: Pillar set abstraction for 3D object detection
Huang, Zhicong
Zheng, Zhijie
Zhao, Jingwen
Hu, Haifeng
Wang, Zixin
Chen, Dihu
PATTERN RECOGNITION LETTERS, 2023, 168 : 138 - 145

← 1 2 3 4 5 →