3D scene reconstruction from monocular spherical video with motion parallax

被引:1
|
作者
Tanaka, Kenji [1 ]
机构
[1] Fprime Project, Washington, DC 20546 USA
关键词
Computing methodologies; Artificial intelligence; Computer vision; Computer vision problems;
D O I
10.1109/ISMAR-Adjunct57072.2022.00043
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we describe a method to capture nearly entirely spherical (360 degree) depth information using two adjacent frames from a single spherical video with motion parallax. After illustrating a spherical depth information retrieval using two spherical cameras, we demonstrate monocular spherical stereo by using stabilized first-person video footage. Experiments demonstrated that the depth information was retrieved on up to 97% of the entire sphere in solid angle. At a speed of 30 km/h, we were able to estimate the depth of an object located over 30 m from the camera. We also reconstructed the 3D structures (point cloud) using the obtained depth data and confirmed the structures can be clearly observed. We can apply this method to 3D structure retrieval of surrounding environments such as 1) previsualization, location hunting/planning of a film, 2) real scene/computer graphics synthesis and 3) motion capture. Thanks to its simplicity, this method can be applied to various videos. As there is no pre-condition other than to be a 360 video with motion parallax, we can use any 360 videos including those on the Internet to reconstruct the surrounding environments. The cameras can be lightweight enough to be mounted on a drone. We also demonstrated such applications.
引用
收藏
页码:191 / 197
页数:7
相关论文
共 50 条
  • [1] 3D Reconstruction of Human Motion and Skeleton from Uncalibrated Monocular Video
    Chen, Yen-Lin
    Chai, Jinxiang
    [J]. COMPUTER VISION - ACCV 2009, PT I, 2010, 5994 : 71 - 82
  • [2] Towards robust 3D reconstruction of human motion from monocular video
    Chen, Cheng
    Zhuang, Yueting
    Xiao, Jun
    [J]. Advances in Artificial Reality and Tele-Existence, Proceedings, 2006, 4282 : 594 - 603
  • [3] NeuralRecon: Real-Time Coherent 3D Scene Reconstruction From Monocular Video
    Chen, Xi
    Sun, Jiaming
    Xie, Yiming
    Bao, Hujun
    Zhou, Xiaowei
    [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46 (12) : 7542 - 7555
  • [4] MotioNet: 3D Human motion reconstruction from monocular video with skeleton consistency
    Shi, Mingyi
    Aberman, Kfir
    Aristidou, Andreas
    Komura, Taku
    Lischinski, Dani
    Cohen-Or, Daniel
    Chen, Baoquan
    [J]. 1600, Association for Computing Machinery (40):
  • [5] MotioNet: 3D Human Motion Reconstruction from Monocular Video with Skeleton Consistency
    Shi, Mingyi
    Aberman, Kfir
    Aristidou, Andreas
    Komura, Taku
    Lischinski, Dani
    Cohen-Or, Daniel
    Chen, Baoquan
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (01):
  • [6] Corrective 3D Reconstruction of Lips from Monocular Video
    Garrido, Pablo
    Zollhoefer, Michael
    Wu, Chenglei
    Bradley, Derek
    Perez, Patrick
    Beeler, Thabo
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (06):
  • [7] 3D Motion and Skeleton Construction from Monocular Video
    Azmi, Nik Mohammad Wafiy
    Albakri, Ikmal Faiq
    Suaib, Norhaida Mohd
    Rahim, Mohd Shafry Mohd
    Yu, Hongchuan
    [J]. COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST 2019), 2020, 603 : 75 - 84
  • [8] Monocular 3D scene reconstruction at absolute scale
    Woehler, Christian
    d'Angelo, Pablo
    Krueger, Lars
    Kuhl, Annika
    Gross, Horst-Michael
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2009, 64 (06) : 529 - 540
  • [9] Reconstruction of Personalized 3D Face Rigs from Monocular Video
    Garrido, Pablo
    Zollhoefer, Michael
    Casas, Dan
    Valgaerts, Levi
    Varanasi, Kiran
    Perez, Patrick
    Theobalt, Christian
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (03):
  • [10] Joint Semantic Segmentation and 3D Reconstruction from Monocular Video
    Kundu, Abhijit
    Li, Yin
    Dellaert, Frank
    Li, Fuxin
    Rehg, James M.
    [J]. COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 703 - 718