Flora: Dual-Frequency LOss-Compensated ReAl-Time Monocular 3D Video Reconstruction

被引:0
|
作者
Wang, Likang [1 ]
Gong, Yue [3 ]
Wang, Qirui [3 ]
Zhou, Kaixuan [4 ,5 ]
Chen, Lei [1 ,2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Data Sci & Analyt Thrust, Guangzhou, Peoples R China
[3] Huawei Technol, Distributed & Parallel Software Lab, Shenzhen, Peoples R China
[4] Huawei Technol, Riemann Lab, Shenzhen, Peoples R China
[5] Huawei Technol, Fundamental Software Innovat Lab, Shenzhen, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we propose a real-time monocular 3D video reconstruction approach named Flora for reconstructing delicate and complete 3D scenes from RGB video sequences in an end-to-end manner. Specifically, we introduce a novel method with two main contributions. Firstly, the proposed feature aggregation module retains both color and reliability in a dual-frequency form. Secondly, the loss compensation module solves missing structure by correcting losses for falsely pruned voxels. The dual-frequency feature aggregation module enhances reconstruction quality in both precision and recall, and the loss compensation module benefits the recall. Notably, both proposed contributions achieve great results with negligible inferencing overhead. Our state-of-the-art experimental results on real-world datasets demonstrate Flora's leading performance in both effectiveness and efficiency. The code is available at https://github.com/NoOneUST/Flora.
引用
收藏
页码:2599 / 2607
页数:9
相关论文
共 50 条
  • [1] NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video
    Sun, Jiaming
    Xie, Yiming
    Chen, Linghao
    Zhou, Xiaowei
    Bao, Hujun
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15593 - 15602
  • [2] Real-Time 3D Pose Reconstruction of Human Body from Monocular Video Sequences
    Zhu, LiangJia
    Hwang, Jenq-Neng
    Chen, Chih-Chang
    Lin, Ming-Hui
    Yen, Chen-Lan
    [J]. ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 717 - +
  • [3] Real-time 3D features reconstruction through monocular vision
    Liverani, Alfredo
    Leali, Francesco
    Pellicciari, Marcello
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2010, 4 (02): : 103 - 112
  • [4] Real-Time 3D Reconstruction Method Based on Monocular Vision
    Jia, Qingyu
    Chang, Liang
    Qiang, Baohua
    Zhang, Shihao
    Xie, Wu
    Yang, Xianyi
    Sun, Yangchang
    Yang, Minghao
    [J]. SENSORS, 2021, 21 (17)
  • [5] Real-time active 3D shape reconstruction for 3D video
    Wu, X
    Matsuyama, T
    [J]. ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 186 - 191
  • [6] Detailed Real-Time Urban 3D Reconstruction from Video
    M. Pollefeys
    D. Nistér
    J.-M. Frahm
    A. Akbarzadeh
    P. Mordohai
    B. Clipp
    C. Engels
    D. Gallup
    S.-J. Kim
    P. Merrell
    C. Salmi
    S. Sinha
    B. Talton
    L. Wang
    Q. Yang
    H. Stewénius
    R. Yang
    G. Welch
    H. Towles
    [J]. International Journal of Computer Vision, 2008, 78 : 143 - 167
  • [7] Detailed real-time urban 3D reconstruction from video
    Pollefeys, M.
    Nister, D.
    Frahm, J. -M.
    Akbarzadeh, A.
    Mordohai, P.
    Clipp, B.
    Engels, C.
    Gallup, D.
    Kim, S. -J.
    Merrell, P.
    Salmi, C.
    Sinha, S.
    Talton, B.
    Wang, L.
    Yang, Q.
    Stewenius, H.
    Yang, R.
    Welch, G.
    Towles, H.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 78 (2-3) : 143 - 167
  • [8] Real-Time 3-D Measurement With Dual-Frequency Fringes by Deep Learning
    Shen, Siyuan
    Lu, Rongsheng
    Wan, Dahang
    Yin, Jiajie
    He, Pan
    [J]. IEEE SENSORS JOURNAL, 2024, 24 (10) : 16576 - 16586
  • [9] Mobile3DRecon: Real-time Monocular 3D Reconstruction on a Mobile Phone
    Yang, Xingbin
    Zhou, Liyang
    Jiang, Hanqing
    Tang, Zhongliang
    Wang, Yuanbo
    Bao, Hujun
    Zhang, Guofeng
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (12) : 3446 - 3456
  • [10] Near real-time 3D reconstruction from InIm video stream
    Chaikalis, D.
    Passalis, G.
    Sgouros, N.
    Maroulis, D.
    Theoharis, T.
    [J]. IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2008, 5112 : 336 - 347