PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models

被引:0
|
作者
Chen, Jiacheng [1 ]
Deng, Ruizhi [1 ]
Furukawa, Yasutaka [1 ]
机构
[1] Simon Fraser Univ, Burnaby, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents PolyDiffuse, a novel structured reconstruction algorithm that transforms visual sensor data into polygonal shapes with Diffusion Models (DM), an emerging machinery amid exploding generative AI, while formulating reconstruction as a generation process conditioned on sensor data. The task of structured reconstruction poses two fundamental challenges to DM: 1) A structured geometry is a "set" (e.g., a set of polygons for a floorplan geometry), where a sample of N elements has N! different but equivalent representations, making the denoising highly ambiguous; and 2) A "reconstruction" task has a single solution, where an initial noise needs to be chosen carefully, while any initial noise works for a generation task. Our technical contribution is the introduction of a Guided Set Diffusion Model where 1) the forward diffusion process learns guidance networks to control noise injection so that one representation of a sample remains distinct from its other permutation variants, thus resolving denoising ambiguity; and 2) the reverse denoising process reconstructs polygonal shapes, initialized and directed by the guidance networks, as a conditional generation process subject to the sensor data. We have evaluated our approach for reconstructing two types of polygonal shapes: floorplan as a set of polygons and HD map for autonomous cars as a set of polylines. Through extensive experiments on standard benchmarks, we demonstrate that PolyDiffuse significantly advances the current state of the art and enables broader practical applications. The code and data are available on our project page: https://poly-diffuse.github.io.
引用
收藏
页数:26
相关论文
共 50 条
  • [21] Zero-Shot Medical Image Translation via Frequency-Guided Diffusion Models
    Li, Yunxiang
    Shao, Hua-Chieh
    Liang, Xiao
    Chen, Liyuan
    Li, Ruiqi
    Jiang, Steve
    Wang, Jing
    Zhang, You
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (03) : 980 - 993
  • [22] Highly accelerated MRI via implicit neural representation guided posterior sampling of diffusion models
    Chu, Jiayue
    Du, Chenhe
    Lin, Xiyue
    Zhang, Xiaoqun
    Wang, Lihui
    Zhang, Yuyao
    Wei, Hongjiang
    MEDICAL IMAGE ANALYSIS, 2025, 100
  • [23] Semi-Supervised CT Denoising via Text-Guided Mamba Diffusion Models
    Su, Bo
    Hu, Xiangyun
    Xu, Jiabo
    Deng, Kai
    Zha, Yunfei
    Wan, Jun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [24] Point set silhouettes via local reconstruction
    Olson, Matt
    Dyer, Ramsay
    Zhang, Hao
    Sheffer, Alla
    COMPUTERS & GRAPHICS-UK, 2011, 35 (03): : 500 - 509
  • [25] MR image reconstruction via guided filter
    Heyan Huang
    Hang Yang
    Kang Wang
    Medical & Biological Engineering & Computing, 2018, 56 : 635 - 648
  • [26] MR image reconstruction via guided filter
    Huang, Heyan
    Yang, Hang
    Wang, Kang
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2018, 56 (04) : 635 - 648
  • [27] A framework for the construction of level set methods for shape optimization and reconstruction
    Burger, M
    INTERFACES AND FREE BOUNDARIES, 2003, 5 (03) : 301 - 329
  • [28] iShapEditing: Intelligent Shape Editing with Diffusion Models
    Li, Jing
    Zhang, Juyong
    Chen, Falai
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)
  • [29] Shape reconstruction using the level set method for microwave applications
    Hajihashemi, Mohammad Reza
    El-Shenawee, Magda
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2008, 7 : 92 - 96
  • [30] Positional diffusion: Graph-based diffusion models for set ordering
    Giuliari, Francesco
    Scarpellini, Gianluca
    Fiorini, Stefano
    James, Stuart
    Morerio, Pietro
    Wang, Yiming
    Del Bue, Alessio
    PATTERN RECOGNITION LETTERS, 2024, 186 : 272 - 278