A Coarse-to-Fine Framework for Point Voxel Transformer

被引:0
|
作者
Bai, Zhuhua [1 ]
Meng, Fantong [1 ]
Li, Weiqing [1 ]
Kang, Renke [1 ]
Yang, Guolin [1 ]
Dong, Zhigang [1 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
关键词
3D vision; PVT; Coarse-to-Fine; Coarse-grained; Important Voxel Identification; Fine-grained;
D O I
10.1109/CSCWD61410.2024.10580279
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
To effectively solve the problem that the input point clouds in the traditional point voxel transformer model (PVT) appear to be quite redundant in spatial dimensions, which causes massive computation and memory costs, we propose a novel coarse-to-fine point voxel transformer framework(CF-PVT) to relieve computation and memory burden while retaining performance. Our CF-PVT implements network inference in a two-stage manner. In the coarse inference stage, the input point cloud is split into coarse-grained voxels for economic computation. If it cannot be identified well, important voxels containing rich information are identified by the Important Voxel Identification Module and further split into fine-grained voxels. We conduct extensive experiments on traditional classification and segmentation tasks. The experiments demonstrate that our CF-PVT framework is highly effective. For example, while maintaining similar accuracy, CF-PVT reduces 60.1% FLOPs, and 68.9% latency of PVT1 on the ModelNet40 dataset.
引用
收藏
页码:205 / 211
页数:7
相关论文
共 50 条
  • [31] Coarse-to-fine adjustment for multi-platform point cloud fusion
    Zhao, Xin
    Li, Jianping
    Li, Yuhao
    Yang, Bisheng
    Sun, Sihan
    Lin, Yongfeng
    Dong, Zhen
    PHOTOGRAMMETRIC RECORD, 2024, 39 (188): : 807 - 830
  • [32] Shape recognition with coarse-to-fine point correspondence under image deformations
    Tang, Huixuan
    Wei, Hui
    COGNITIVE SYSTEMS, 2007, 4429 : 130 - +
  • [33] Multiple Granularity Modeling: A Coarse-to-Fine Framework for Fine-grained Action Analysis
    Ni, Bingbing
    Paramathayalan, Vignesh R.
    Li, Teng
    Moulin, Pierre
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (01) : 28 - 43
  • [34] Multiple Granularity Modeling: A Coarse-to-Fine Framework for Fine-grained Action Analysis
    Bingbing Ni
    Vignesh R. Paramathayalan
    Teng Li
    Pierre Moulin
    International Journal of Computer Vision, 2016, 120 : 28 - 43
  • [35] CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration
    Yu, Hao
    Li, Fu
    Saleh, Mahdi
    Busam, Benjamin
    Ilic, Slobodan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [36] A Coarse-to-Fine Framework for Cloud Removal in Remote Sensing Image Sequence
    Zhang, Yongjun
    Wen, Fei
    Gao, Zhi
    Ling, Xiao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (08): : 5963 - 5974
  • [37] A coarse-to-fine point completion network with details compensation and structure enhancement
    Yongwei Miao
    Chengyu Jing
    Weihao Gao
    Xudong Zhang
    Scientific Reports, 14
  • [38] A Coarse-to-Fine Framework for the 2021 Kidney and Kidney Tumor Segmentation Challenge
    Zhao, Zhongchen
    Chen, Huai
    Wang, Lisheng
    KIDNEY AND KIDNEY TUMOR SEGMENTATION, KITS 2021, 2022, 13168 : 53 - 58
  • [39] Beyond feature integration: a coarse-to-fine framework for cascade correlation tracking
    Dongdong Li
    Gongjian Wen
    Yangliu Kuai
    Fatih Porikli
    Machine Vision and Applications, 2019, 30 : 519 - 528
  • [40] A Two-Stage Coarse-to-Fine Brain Tumor Segmentation Framework
    Chen H.
    Qin Z.-G.
    Ding Y.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2020, 49 (04): : 590 - 596