3D-MAN: 3D Multi-frame Attention Network for Object Detection

被引:59
|
作者
Yang, Zetong [1 ]
Zhou, Yin [2 ]
Chen, Zhifeng [3 ]
Ngiam, Jiquan [3 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Waymo LLC, Mountain View, CA USA
[3] Google Res, Brain Team, Mountain View, CA USA
关键词
D O I
10.1109/CVPR46437.2021.00190
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection is an important module in autonomous driving and robotics. However, many existing methods focus on using single frames to perform 3D detection, and do not fully utilize information from multiple frames. In this paper, we present 3D-MAN: a 3D multi-frame attention network that effectively aggregates features from multiple perspectives and achieves state-of-the-art performance on Waymo Open Dataset. 3D-MAN first uses a novel fast single-frame detector to produce box proposals. The box proposals and their corresponding feature maps are then stored in a memory bank. We design a multi-view alignment and aggregation module, using attention networks, to extract and aggregate the temporal features stored in the memory bank. This effectively combines the features coming from different perspectives of the scene. We demonstrate the effectiveness of our approach on the large-scale complex Waymo Open Dataset, achieving state-of-the-art results compared to published single-frame and multi-frame methods.
引用
收藏
页码:1863 / 1872
页数:10
相关论文
共 50 条
  • [1] Multi-frame Attention Network for Left Ventricle Segmentation in 3D Echocardiography
    Ahn, Shawn S.
    Ta, Kevinminh
    Thorn, Stephanie
    Langdon, Jonathan
    Sinusas, Albert J.
    Duncan, James S.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 348 - 357
  • [2] D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence
    Lee, Junhyung
    Koh, Junho
    Lee, Youngwoo
    Choi, Jun Won
    arXiv, 2022,
  • [3] D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence
    Lee, Junhyung
    Koh, Junho
    Lee, Youngwoo
    Choi, Jun Won
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9238 - 9244
  • [4] 3D Object Detection With Multi-Frame RGB-Lidar Feature Alignment
    Ercelik, Emec
    Yurtsever, Ekim
    Knoll, Alois
    IEEE ACCESS, 2021, 9 : 143138 - 143149
  • [5] TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection
    Luo, Zhipeng
    Zhang, Gongjie
    Zhou, Changqing
    Liu, Tianrui
    Lu, Shijian
    Pan, Liang
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4219 - 4228
  • [6] Multi-Sensor Fusion 3D Object Detection Based on Multi-Frame Information
    Wu S.
    Geng J.
    Wu C.
    Yan Z.
    Chen K.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (12): : 1282 - 1289
  • [7] Boosting Single-Frame 3D Object Detection by Simulating Multi-Frame Point Clouds
    Zheng, Wu
    Jiang, Li
    Lu, FanBin
    Ye, Yangyang
    Fu, Chi-Wing
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4848 - 4856
  • [8] MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection
    Chen, Xuesong
    Shi, Shaoshuai
    Zhu, Benjin
    Cheung, Ka Chun
    Xu, Hang
    Li, Hongsheng
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 680 - 697
  • [9] Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection
    Zhang, Yifan
    Zhu, Zhiyu
    Hou, Junhui
    Wu, Dapeng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10614 - 10628
  • [10] Multi-frame fusion of undersampled 3D imagery
    Cain, Stephen C.
    UNCONVENTIONAL IMAGING AND WAVEFRONT SENSING 2012, 2012, 8520