3D-MAN: 3D Multi-frame Attention Network for Object Detection

被引：59

作者：

Yang, Zetong ^{[1
]}

Zhou, Yin ^{[2
]}

Chen, Zhifeng ^{[3
]}

Ngiam, Jiquan ^{[3
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[2] Waymo LLC, Mountain View, CA USA

[3] Google Res, Brain Team, Mountain View, CA USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.00190

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D object detection is an important module in autonomous driving and robotics. However, many existing methods focus on using single frames to perform 3D detection, and do not fully utilize information from multiple frames. In this paper, we present 3D-MAN: a 3D multi-frame attention network that effectively aggregates features from multiple perspectives and achieves state-of-the-art performance on Waymo Open Dataset. 3D-MAN first uses a novel fast single-frame detector to produce box proposals. The box proposals and their corresponding feature maps are then stored in a memory bank. We design a multi-view alignment and aggregation module, using attention networks, to extract and aggregate the temporal features stored in the memory bank. This effectively combines the features coming from different perspectives of the scene. We demonstrate the effectiveness of our approach on the large-scale complex Waymo Open Dataset, achieving state-of-the-art results compared to published single-frame and multi-frame methods.

引用

页码：1863 / 1872

页数：10

共 50 条

[1] Multi-frame Attention Network for Left Ventricle Segmentation in 3D Echocardiography
Ahn, Shawn S.
Ta, Kevinminh
Thorn, Stephanie
Langdon, Jonathan
Sinusas, Albert J.
Duncan, James S.
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT I, 2021, 12901 : 348 - 357
[2] D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence
Lee, Junhyung
Koh, Junho
Lee, Youngwoo
Choi, Jun Won
arXiv, 2022,
[3] D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence
Lee, Junhyung
Koh, Junho
Lee, Youngwoo
Choi, Jun Won
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9238 - 9244
[4] 3D Object Detection With Multi-Frame RGB-Lidar Feature Alignment
Ercelik, Emec
Yurtsever, Ekim
Knoll, Alois
IEEE ACCESS, 2021, 9 : 143138 - 143149
[5] TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection
Luo, Zhipeng
Zhang, Gongjie
Zhou, Changqing
Liu, Tianrui
Lu, Shijian
Pan, Liang
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 4219 - 4228
[6] Multi-Sensor Fusion 3D Object Detection Based on Multi-Frame Information
Wu S.
Geng J.
Wu C.
Yan Z.
Chen K.
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (12): : 1282 - 1289
[7] Boosting Single-Frame 3D Object Detection by Simulating Multi-Frame Point Clouds
Zheng, Wu
Jiang, Li
Lu, FanBin
Ye, Yangyang
Fu, Chi-Wing
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4848 - 4856
[8] MPPNet: Multi-frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection
Chen, Xuesong
Shi, Shaoshuai
Zhu, Benjin
Cheung, Ka Chun
Xu, Hang
Li, Hongsheng
COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 680 - 697
[9] Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection
Zhang, Yifan
Zhu, Zhiyu
Hou, Junhui
Wu, Dapeng
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 10614 - 10628
[10] Multi-frame fusion of undersampled 3D imagery
Cain, Stephen C.
UNCONVENTIONAL IMAGING AND WAVEFRONT SENSING 2012, 2012, 8520

← 1 2 3 4 5 →