A Coarse-to-Fine Framework for Point Voxel Transformer

被引:0
|
作者
Bai, Zhuhua [1 ]
Meng, Fantong [1 ]
Li, Weiqing [1 ]
Kang, Renke [1 ]
Yang, Guolin [1 ]
Dong, Zhigang [1 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
关键词
3D vision; PVT; Coarse-to-Fine; Coarse-grained; Important Voxel Identification; Fine-grained;
D O I
10.1109/CSCWD61410.2024.10580279
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
To effectively solve the problem that the input point clouds in the traditional point voxel transformer model (PVT) appear to be quite redundant in spatial dimensions, which causes massive computation and memory costs, we propose a novel coarse-to-fine point voxel transformer framework(CF-PVT) to relieve computation and memory burden while retaining performance. Our CF-PVT implements network inference in a two-stage manner. In the coarse inference stage, the input point cloud is split into coarse-grained voxels for economic computation. If it cannot be identified well, important voxels containing rich information are identified by the Important Voxel Identification Module and further split into fine-grained voxels. We conduct extensive experiments on traditional classification and segmentation tasks. The experiments demonstrate that our CF-PVT framework is highly effective. For example, while maintaining similar accuracy, CF-PVT reduces 60.1% FLOPs, and 68.9% latency of PVT1 on the ModelNet40 dataset.
引用
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [21] Coarse-to-fine face detection
    Fleuret, F
    Geman, D
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 41 (1-2) : 85 - 107
  • [22] Coarse-to-fine manifold learning
    Castro, R
    Willett, R
    Nowak, R
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 992 - 995
  • [23] Coarse-to-Fine Grained Classification
    Huo, Yuqi
    Lu, Yao
    Niu, Yulei
    Lu, Zhiwu
    Wen, Ji-Rong
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1033 - 1036
  • [24] Coarse-to-fine dynamic programming
    Raphael, C
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (12) : 1379 - 1390
  • [25] Coarse-to-Fine Face Detection
    Francois Fleuret
    Donald Geman
    International Journal of Computer Vision, 2001, 41 : 85 - 107
  • [26] Coarse-to-Fine Nutrition Prediction
    Wang, Binglu
    Bu, Tianci
    Hu, Zaiyi
    Yang, Le
    Zhao, Yongqiang
    Li, Xuelong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 (26) : 3651 - 3662
  • [27] ON THE COARSE-TO-FINE STRATEGY IN STEREOMATCHING
    PRAZDNY, K
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1987, 25 (02) : 92 - 94
  • [28] Coarse-to-Fine Point Cloud Registration with SE(3)-Equivariant Representations
    Lin, Cheng-Wei
    Chen, Tung-, I
    Lee, Hsin-Ying
    Chen, Wen-Chin
    Hsu, Winston H.
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2833 - 2840
  • [29] Fast clustering method of LiDAR point clouds from coarse-to-fine
    Guo, Dongbing
    Qi, Baoling
    Wang, Chunhui
    INFRARED PHYSICS & TECHNOLOGY, 2023, 129
  • [30] Coarse-to-Fine Segmentation on LiDAR Point Clouds in Spherical Coordinate and Beyond
    Li, You
    Le Bihan, Clement
    Pourtau, Txomin
    Ristorcelli, Thomas
    Ibanez-Guzman, Javier
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) : 14588 - 14601