An end-to-end dynamic point cloud geometry compression in latent space

被引:1
|
作者
Jiang, Zhaoyi [1 ,4 ]
Wang, Guoliang [1 ,4 ]
Tam, Gary K. L. [2 ,4 ]
Song, Chao [1 ,4 ]
Li, Frederick W. B. [3 ,4 ]
Yang, Bailin [1 ,4 ]
机构
[1] Zhejiang Gongshang Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Swansea Univ, Dept Comp Sci, Skewen, Wales
[3] Univ Durham, Dept Comp Sci, Durham, England
[4] Zhejiang Gongshang Univ, Sch Stat & Math, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic point clouds compression; Geometry encoding; Latent scene flow; Deep entropy model;
D O I
10.1016/j.displa.2023.102528
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic point clouds are widely used for 3D data representation in various applications such as immersive and mixed reality, robotics and autonomous driving. However, their irregularity and large scale make efficient compression and transmission a challenge. Existing methods require high bitrates to encode point clouds since temporal correlation is not well considered. This paper proposes an end-to-end dynamic point cloud compression network that operates in latent space, resulting in more accurate motion estimation and more effective motion compensation. Specifically, a multi-scale motion estimation network is introduced to obtain accurate motion vectors. Motion information computed at a coarser level is upsampled and warped to the finer level based on cost volume analysis for motion compensation. Additionally, a residual compression network is designed to mitigate the effects of noise and inaccurate predictions by encoding latent residuals, resulting in smaller conditional entropy and better results. The proposed method achieves an average 12.09% and 14.76% (D2) BD-Rate gain over state-of-the-art Deep Dynamic Point Cloud Compression (D-DPCC) in experimental results. Compared to V-PCC, our framework showed an average improvement of 81.29% (D1) and 77.57% (D2).
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Lossy Point Cloud Geometry Compression via End-to-End Learning
    Wang, Jianqiang
    Zhu, Hao
    Liu, Haojie
    Ma, Zhan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4909 - 4923
  • [2] MULTI-SCALE END-TO-END LEARNING FOR POINT CLOUD GEOMETRY COMPRESSION
    Xu, Yiqun
    Yin, Qian
    Wang, Shanshe
    Zhang, Xinfeng
    Ma, Siwei
    Gao, Wen
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2107 - 2111
  • [3] An End-to-End ConvLSTM-based Method for Point Cloud Streaming Compression
    Long, Junhua
    Feng, Mingjian
    Li, Boyang
    Ling, Yehua
    Wu, Chenghao
    Huang, Kai
    Cui, Mingyue
    2024 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS, ICARM 2024, 2024, : 382 - 387
  • [4] CCAG: End-to-End Point Cloud Registration
    Wang, Yong
    Zhou, Pengbo
    Geng, Guohua
    An, Li
    Liu, Yangyang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 435 - 442
  • [5] END-TO-END LEARNED IMAGE COMPRESSION WITH CONDITIONAL LATENT SPACE MODELING FOR ENTROPY CODING
    Yesilyurt, Aziz Berkay
    Kamisli, Fatih
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 501 - 505
  • [6] A survey on end-to-end point cloud learning
    Tang, Xikai
    Huang, Fangzheng
    Li, Chao
    Ban, Dayan
    IET IMAGE PROCESSING, 2023, 17 (05) : 1307 - 1321
  • [7] End-to-end point cloud registration with transformer
    Wang, Yong
    Zhou, Pengbo
    Geng, Guohua
    An, Li
    Zhang, Qi
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 58 (01)
  • [8] POINT CLOUD GEOMETRY SCALABLE CODING WITH A SINGLE END-TO-END DEEP LEARNING MODEL
    Guarda, Andre F. R.
    Rodrigues, Nuno M. M.
    Pereira, Fernando
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3354 - 3358
  • [9] AN END-TO-END SYSTEM FOR REAL-TIME DYNAMIC POINT CLOUD VISUALIZATION
    Hofer, Hansjoerg
    Seitner, Florian
    Gelautz, Margrit
    2018 INTERNATIONAL CONFERENCE ON 3D IMMERSION (IC3D), 2018,
  • [10] REGTR: End-to-end Point Cloud Correspondences with Transformers
    Yew, Zi Jian
    Lee, Gim Hee
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6667 - 6676