Flora: Dual-Frequency LOss-Compensated ReAl-Time Monocular 3D Video Reconstruction

被引:0
|
作者
Wang, Likang [1 ]
Gong, Yue [3 ]
Wang, Qirui [3 ]
Zhou, Kaixuan [4 ,5 ]
Chen, Lei [1 ,2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Data Sci & Analyt Thrust, Guangzhou, Peoples R China
[3] Huawei Technol, Distributed & Parallel Software Lab, Shenzhen, Peoples R China
[4] Huawei Technol, Riemann Lab, Shenzhen, Peoples R China
[5] Huawei Technol, Fundamental Software Innovat Lab, Shenzhen, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a real-time monocular 3D video reconstruction approach named Flora for reconstructing delicate and complete 3D scenes from RGB video sequences in an end-to-end manner. Specifically, we introduce a novel method with two main contributions. Firstly, the proposed feature aggregation module retains both color and reliability in a dual-frequency form. Secondly, the loss compensation module solves missing structure by correcting losses for falsely pruned voxels. The dual-frequency feature aggregation module enhances reconstruction quality in both precision and recall, and the loss compensation module benefits the recall. Notably, both proposed contributions achieve great results with negligible inferencing overhead. Our state-of-the-art experimental results on real-world datasets demonstrate Flora's leading performance in both effectiveness and efficiency. The code is available at https://github.com/NoOneUST/Flora.
引用
收藏
页码:2599 / 2607
页数:9
相关论文
共 50 条
  • [21] DenseMatch: a dataset for real-time 3D reconstruction
    Lombardi, Marco
    Savardi, Mattia
    Signoroni, Alberto
    [J]. DATA IN BRIEF, 2021, 39
  • [22] Real-Time Active Multiview 3D Reconstruction
    Ide, Kai
    Sikora, Thomas
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTER VISION IN REMOTE SENSING, 2012, : 203 - 208
  • [23] Real-time 3D cone beam reconstruction
    Stsepankou, D
    Kornmesser, K
    Hesser, J
    Männer, R
    [J]. 2004 IEEE NUCLEAR SCIENCE SYMPOSIUM CONFERENCE RECORD, VOLS 1-7, 2004, : 3648 - 3652
  • [24] Real-time monocular 3D perception with ORB-Features
    Ji, Babing
    Cao, Qixin
    [J]. INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2018, 45 (06): : 776 - 783
  • [25] Real-time 3D shape reconstruction, dynamic 3D mesh deformation, and high fidelity visualization for 3D video
    Matsuyama, T
    Wu, X
    Takai, T
    Nobuhara, S
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 96 (03) : 393 - 434
  • [26] Real-Time 3-D Measurement Based on Dual-Frequency Hierarchical and Time-Interleaved Fringe Projection
    Wei, Zhimi
    Cao, Yiping
    Li, Chengmeng
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [27] Cross-Dimensional Refined Learning for Real-Time 3D Visual Perception from Monocular Video
    Hong, Ziyang
    Yue, C. Patrick
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2161 - 2170
  • [28] Real-time 2D to 3D video conversion
    Ideses, Ianir
    Yaroslavsky, Leonid P.
    Fishbain, Barak
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2007, 2 (01) : 3 - 9
  • [29] Real-time 2D to 3D video conversion
    Ianir Ideses
    Leonid P. Yaroslavsky
    Barak Fishbain
    [J]. Journal of Real-Time Image Processing, 2007, 2 : 3 - 9
  • [30] Corrective 3D Reconstruction of Lips from Monocular Video
    Garrido, Pablo
    Zollhoefer, Michael
    Wu, Chenglei
    Bradley, Derek
    Perez, Patrick
    Beeler, Thabo
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):