PillarNet plus plus : Pillar-Based 3-D Object Detection With Multiattention

被引:2
|
作者
Guo, Dongbing [1 ,2 ]
Yang, Guohui [3 ]
Wang, Chunhui [1 ]
机构
[1] Harbin Inst Technol, Natl Key Lab Tunable Laser Technol, Harbin 150001, Peoples R China
[2] Shanxi Data Technol Co Ltd, Taiyuan 030032, Peoples R China
[3] Harbin Inst Technol, Sch Elect & Informat Engn, Dept Microwave Engn, Harbin 150001, Peoples R China
关键词
3-D object detection; autonomous driving; light detection and ranging (LiDAR); multiattention;
D O I
10.1109/JSEN.2023.3323368
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Light detection and ranging (LiDAR)-based 3-D object detection constitutes a fundamental component of autonomous driving technology. In this research, we propose a novel approach called PillarNet++ to tackle the challenges associated with fine-grained information loss during point cloud encoding and the inadequate interaction or incomplete fusion of feature maps across different scales in subsequent feature extraction stages, resulting in a decrease in partial occlusion and long-distance 3-D object detection accuracy, leading to false and missed detections. The PillarNet++ method primarily comprises two modules: the multiattention-pillar-encoding (MAPE) module and the pseudo-image-split-multibranch-feature-pyramid-network (PSMB-FPN) module. The MAPE module enhances the information extraction capability in nonempty pillars by integrating max pooling and average pooling, by fusion of the pointwise, channelwise, and pillarwise attention; the MAPE module can adaptively focus on the important information and suppress the secondary point clouds. In addition, the stacked MAPE modules can refine pillars and extract finer features. On the other hand, the PSMB-FPN module splits the pseudo-image along the channel dimension and subsequently performs MB-FPN feature extraction and fusion on each channel, facilitating the interaction of multiscale and multilevel feature maps and improving prediction accuracy. Experimental results on the KITTI 3-D object detection benchmark show that the PillarNet++ method has the best performance among single-stage object detection algorithms and even exceeds most two-stage methods.
引用
收藏
页码:27733 / 27743
页数:11
相关论文
共 50 条
  • [31] Object Detection for Chinese Traditional Costume Images Based GRP-DSOD plus plus Network
    Zhao, Haiying
    Yang, Ting
    Hou, Xiaogang
    Zhu, Hui
    Yang, Zhuoyu
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 18 - 31
  • [32] RSDet plus plus : Point-Based Modulated Loss for More Accurate Rotated Object Detection
    Qian, Wen
    Yang, Xue
    Peng, Silong
    Zhang, Xiujuan
    Yan, Junchi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7869 - 7879
  • [33] Pseudo-Stereo plus plus : Cycled Generative Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving
    Elhagry, Ahmed
    Dai, Hang
    El Saddik, Abdulmotaleb
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7428 - 7435
  • [34] Geometry-based automatic object localization and 3-D pose detection
    Magnor, MA
    FIFTH IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION, PROCEEDINGS, 2002, : 144 - 147
  • [35] Diversity Knowledge Distillation for LiDAR-Based 3-D Object Detection
    Ning, Kanglin
    Liu, Yanfei
    Su, Yanzhao
    Jiang, Ke
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 11181 - 11193
  • [36] 3-D Object Detection With Balanced Prediction Based on Contrastive Point Loss
    Tong, Jiaxun
    Liu, Kaiqi
    Bai, Xia
    Li, Wei
    IEEE SENSORS JOURNAL, 2024, 24 (04) : 4969 - 4977
  • [37] A symbolic representation for 3-D object feature detection
    Neal, PJ
    Shapiro, LG
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 221 - 224
  • [38] A 3-D object recognition system based on a rapid 3-D vision system
    Choi, S
    Park, H
    Kim, S
    Park, S
    Won, S
    Jecing, H
    NEW TECHNOLOGIES FOR AUTOMATION OF METALLURGICAL INDUSTRY 2003, 2004, : 269 - 274
  • [39] PV-RCNN plus plus : Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
    Shi, Shaoshuai
    Jiang, Li
    Deng, Jiajun
    Wang, Zhe
    Guo, Chaoxu
    Shi, Jianping
    Wang, Xiaogang
    Li, Hongsheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (02) : 531 - 551
  • [40] PSA-Det3D: Pillar set abstraction for 3D object detection
    Huang, Zhicong
    Zheng, Zhijie
    Zhao, Jingwen
    Hu, Haifeng
    Wang, Zixin
    Chen, Dihu
    PATTERN RECOGNITION LETTERS, 2023, 168 : 138 - 145