Long-Term Photometric Consistent Novel View Synthesis with Diffusion Models

被引:1
|
作者
Yu, Jason J. [1 ,2 ]
Forghani, Fereshteh [1 ]
Derpanis, Konstantinos G. [1 ,2 ]
Brubaker, Marcus A. [1 ,2 ]
机构
[1] York Univ, Toronto, ON, Canada
[2] Vector Inst AI, Toronto, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/ICCV51070.2023.00653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Novel view synthesis from a single input image is a challenging task, where the goal is to generate a new view of a scene from a desired camera pose that may be separated by a large motion. The highly uncertain nature of this synthesis task due to unobserved elements within the scene (i.e. occlusion) and outside the field-of-view makes the use of generative models appealing to capture the variety of possible outputs. In this paper, we propose a novel generative model capable of producing a sequence of photorealistic images consistent with a specified camera trajectory, and a single starting image. Our approach is centred on an autoregressive conditional diffusion-based model capable of interpolating visible scene elements, and extrapolating unobserved regions in a view, in a geometrically consistent manner. Conditioning is limited to an image capturing a single camera view and the (relative) pose of the new camera view. To measure the consistency over a sequence of generated views, we introduce a new metric, the thresholded symmetric epipolar distance (TSED), to measure the number of consistent frame pairs in a sequence. While previous methods have been shown to produce high quality images and consistent semantics across pairs of views, we show empirically with our metric that they are often inconsistent with the desired camera poses. In contrast, we demonstrate that our method produces both photorealistic and view-consistent imagery. Additional material is available on our project page: https://yorkucvil.github.io/Photoconsistent-NVS/.
引用
收藏
页码:7071 / 7081
页数:11
相关论文
共 50 条
  • [1] Consistent View Synthesis with Pose-Guided Diffusion Models
    Tseng, Hung-Yu
    Li, Qinbo
    Kim, Changil
    Alsisan, Suhib
    Huang, Jia-Bin
    Kopf, Johannes
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 16773 - 16783
  • [2] A LONG-TERM VIEW
    HUNT, BB
    JOURNAL OF FORESTRY, 1993, 91 (10) : 4 - 4
  • [3] ESTIMATING LONG-TERM VOLATILITY PARAMETERS FOR MARKET-CONSISTENT MODELS
    Flint, E. J.
    Ochse, E. R.
    Polakow, D. A.
    SOUTH AFRICAN ACTUARIAL JOURNAL, 2014, 14 : 19 - 72
  • [4] The use of econometric models for long-term policies: a critical view
    Spaventa, Luigi
    PSL QUARTERLY REVIEW, 2013, 66 (266) : 267 - 290
  • [5] FED LONG-TERM VIEW
    不详
    ECONOMIST, 1961, 198 (08): : 748 - &
  • [6] LIBRARIES - THE LONG-TERM VIEW
    Tucker, William P.
    SCHOOL AND SOCIETY, 1938, 48 (1252): : 826 - 827
  • [7] DOCUMENTATION - LONG-TERM VIEW
    PIGANIOL, P
    INTERNATIONAL FORUM ON INFORMATION AND DOCUMENTATION, 1976, 1 (02): : 15 - 16
  • [8] A LONG-TERM VIEW OF HYPOSPADIAS
    BRACKA, A
    BRITISH JOURNAL OF PLASTIC SURGERY, 1989, 42 (03): : 251 - 255
  • [9] Safety: a long-term view
    Houlton, Sarah
    CHEMISTRY & INDUSTRY, 2012, 76 (06) : 21 - 21
  • [10] Long-term view scenario
    Lect. Notes Bus. Inf. Process., (65-68):