An end-to-end dynamic point cloud geometry compression in latent space

被引:1
|
作者
Jiang, Zhaoyi [1 ,4 ]
Wang, Guoliang [1 ,4 ]
Tam, Gary K. L. [2 ,4 ]
Song, Chao [1 ,4 ]
Li, Frederick W. B. [3 ,4 ]
Yang, Bailin [1 ,4 ]
机构
[1] Zhejiang Gongshang Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Swansea Univ, Dept Comp Sci, Skewen, Wales
[3] Univ Durham, Dept Comp Sci, Durham, England
[4] Zhejiang Gongshang Univ, Sch Stat & Math, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Dynamic point clouds compression; Geometry encoding; Latent scene flow; Deep entropy model;
D O I
10.1016/j.displa.2023.102528
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic point clouds are widely used for 3D data representation in various applications such as immersive and mixed reality, robotics and autonomous driving. However, their irregularity and large scale make efficient compression and transmission a challenge. Existing methods require high bitrates to encode point clouds since temporal correlation is not well considered. This paper proposes an end-to-end dynamic point cloud compression network that operates in latent space, resulting in more accurate motion estimation and more effective motion compensation. Specifically, a multi-scale motion estimation network is introduced to obtain accurate motion vectors. Motion information computed at a coarser level is upsampled and warped to the finer level based on cost volume analysis for motion compensation. Additionally, a residual compression network is designed to mitigate the effects of noise and inaccurate predictions by encoding latent residuals, resulting in smaller conditional entropy and better results. The proposed method achieves an average 12.09% and 14.76% (D2) BD-Rate gain over state-of-the-art Deep Dynamic Point Cloud Compression (D-DPCC) in experimental results. Compared to V-PCC, our framework showed an average improvement of 81.29% (D1) and 77.57% (D2).
引用
收藏
页数:11
相关论文
共 50 条
  • [41] End-to-End Cloud Application Cloning With Ditto
    Liang, Hmingyu
    Gan, Yu
    Li, Yueying
    Torres, Carlos
    Dhanotia, Abhishek
    Ketkar, Mahesh
    Delimitrou, Christina
    IEEE MICRO, 2024, 44 (04) : 34 - 43
  • [42] LATENT SPACE SLICING FOR ENHANCED ENTROPY MODELING IN LEARNING-BASED POINT CLOUD GEOMETRY COMPRESSION
    Frank, Nicolas
    Lazzarotto, Davi
    Ebrahimi, Touradj
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4878 - 4882
  • [43] End-to-End Cell Recognition by Point Annotation
    Shui, Zhongyi
    Zhang, Shichuan
    Zhu, Chenglu
    Wang, Bingchuan
    Chen, Pingyi
    Zheng, Sunyi
    Yang, Lin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT IV, 2022, 13434 : 109 - 118
  • [44] VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
    Zhou, Yin
    Tuzel, Oncel
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4490 - 4499
  • [45] Multi-UAVs End-to-End Distributed Trajectory Generation Over Point Cloud Data
    Marino, Antonio
    Pacchierotti, Claudio
    Giordano, Paolo Robuffo
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (09): : 7629 - 7636
  • [46] End-to-End Mesh Reconstruction from Partial Point Cloud based on Continuous Implicit Function
    Yu, Jiawei
    Huang, Xiaoshui
    Chen, Tao
    Yao, Yazhou
    Wang, Qiong
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [47] The Effect of Impactor Geometry on End-to-End Pecan Cracking
    Jackson, Mark W.
    Langston, Cody M.
    Madsen, Leah E.
    Davis, R. Benjamin
    AGRIENGINEERING, 2024, 6 (03): : 2470 - 2480
  • [48] Differentiable Product Quantization for End-to-End Embedding Compression
    Chen, Ting
    Li, Lala
    Sun, Yizhou
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [49] Towards End-to-End Image Compression and Analysis with Transformers
    Bai, Yuanchao
    Yang, Xu
    Liu, Xianming
    Jiang, Junjun
    Wang, Yaowei
    Ji, Xiangyang
    Gao, Wen
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 104 - 112
  • [50] End-to-end video compression for surveillance and conference videos
    Wang, Shenhao
    Zhao, Yu
    Gao, Han
    Ye, Mao
    Li, Shuai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42713 - 42730