A Coarse-to-Fine Framework for Point Voxel Transformer

被引：0

作者：

Bai, Zhuhua ^{[1
]}

Meng, Fantong ^{[1
]}

Li, Weiqing ^{[1
]}

Kang, Renke ^{[1
]}

Yang, Guolin ^{[1
]}

Dong, Zhigang ^{[1
]}

机构：

[1] Dalian Univ Technol, Dalian, Peoples R China

来源：

PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024 | 2024年

关键词：

3D vision; PVT; Coarse-to-Fine; Coarse-grained; Important Voxel Identification; Fine-grained;

D O I：

10.1109/CSCWD61410.2024.10580279

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

To effectively solve the problem that the input point clouds in the traditional point voxel transformer model (PVT) appear to be quite redundant in spatial dimensions, which causes massive computation and memory costs, we propose a novel coarse-to-fine point voxel transformer framework(CF-PVT) to relieve computation and memory burden while retaining performance. Our CF-PVT implements network inference in a two-stage manner. In the coarse inference stage, the input point cloud is split into coarse-grained voxels for economic computation. If it cannot be identified well, important voxels containing rich information are identified by the Important Voxel Identification Module and further split into fine-grained voxels. We conduct extensive experiments on traditional classification and segmentation tasks. The experiments demonstrate that our CF-PVT framework is highly effective. For example, while maintaining similar accuracy, CF-PVT reduces 60.1% FLOPs, and 68.9% latency of PVT1 on the ModelNet40 dataset.

引用

页码：205 / 211

页数：7

共 50 条

[21] Coarse-to-fine face detection
Fleuret, F
Geman, D
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 41 (1-2) : 85 - 107
[22] Coarse-to-fine manifold learning
Castro, R
Willett, R
Nowak, R
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 992 - 995
[23] Coarse-to-Fine Grained Classification
Huo, Yuqi
Lu, Yao
Niu, Yulei
Lu, Zhiwu
Wen, Ji-Rong
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1033 - 1036
[24] Coarse-to-fine dynamic programming
Raphael, C
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (12) : 1379 - 1390
[25] Coarse-to-Fine Face Detection
Francois Fleuret
Donald Geman
International Journal of Computer Vision, 2001, 41 : 85 - 107
[26] Coarse-to-Fine Nutrition Prediction
Wang, Binglu
Bu, Tianci
Hu, Zaiyi
Yang, Le
Zhao, Yongqiang
Li, Xuelong
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 (26) : 3651 - 3662
[27] ON THE COARSE-TO-FINE STRATEGY IN STEREOMATCHING
PRAZDNY, K
BULLETIN OF THE PSYCHONOMIC SOCIETY, 1987, 25 (02) : 92 - 94
[28] Coarse-to-Fine Point Cloud Registration with SE(3)-Equivariant Representations
Lin, Cheng-Wei
Chen, Tung-, I
Lee, Hsin-Ying
Chen, Wen-Chin
Hsu, Winston H.
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2833 - 2840
[29] Fast clustering method of LiDAR point clouds from coarse-to-fine
Guo, Dongbing
Qi, Baoling
Wang, Chunhui
INFRARED PHYSICS & TECHNOLOGY, 2023, 129
[30] Coarse-to-Fine Segmentation on LiDAR Point Clouds in Spherical Coordinate and Beyond
Li, You
Le Bihan, Clement
Pourtau, Txomin
Ristorcelli, Thomas
Ibanez-Guzman, Javier
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) : 14588 - 14601

← 1 2 3 4 5 →