Text2City: One-Stage Text-Driven Urban Layout Regeneration

被引:0
|
作者
Qin, Yiming [1 ,2 ]
Zhao, Nanxuan [3 ]
Sheng, Bin [1 ]
Lau, Rynson W. H. [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] City Univ Hong Kong, Hong Kong, Peoples R China
[3] Adobe Res, San Jose, CA USA
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Regenerating urban layout is an essential process for urban regeneration. In this paper, we propose a new task called text-driven urban layout regeneration, which provides an intuitive input modal - text - for users to specify the regeneration, instead of designing complex rules. Given the target region to be regenerated, we propose a one-stage text-driven urban layout regeneration model, Text2City, to jointly and progressively regenerate the urban layout (i.e., road and building layouts) based on textual layout descriptions and surrounding context (i.e., urban layouts and functions of the surrounding regions). Text2City first extracts road and building attributes from the textual layout description to guide the regeneration. It includes a novel one-stage joint regenerator network based on the conditioned denoising diffusion probabilistic models (DDPMs) and prior knowledge exchange. To harmonize the regenerated layouts through joint optimization, we propose the interactive & enhanced guidance module for self-enhancement and prior knowledge exchange between road and building layouts during the regeneration. We also design a series of constraints from attribute-, geometry- and pixel-levels to ensure rational urban layout generation. To train our model, we build a large-scale dataset containing urban layouts and layout descriptions, covering 147K regions. Qual-itative and quantitative evaluations show that our proposed method outperforms the baseline methods in regenerating desirable urban layouts that meet the textual descriptions.
引用
收藏
页码:4578 / 4586
页数:9
相关论文
共 26 条
  • [21] Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model
    Wang, Yin
    Leng, Zhiying
    Li, Frederick W. B.
    Wu, Shun-Cheng
    Liang, Xiaohui
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21978 - 21987
  • [22] A Text-Driven Aircraft Fault Diagnosis Model Based on a Word2vec and Priori-Knowledge Convolutional Neural Network
    Xu, Zhenzhong
    Chen, Bang
    Zhou, Shenghan
    Chang, Wenbing
    Ji, Xinpeng
    Wei, Chaofan
    Hou, Wenkui
    AEROSPACE, 2021, 8 (04)
  • [23] 3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
    Yang, Haibo
    Chen, Yang
    Pan, Yingwei
    Yao, Ting
    Chen, Zhineng
    Mei, Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6860 - 6868
  • [24] Text-driven automatic frame generation using MPEG-4 synthetic/natural hybrid coding for 2-D head-and-shoulder scene
    Cheung, CH
    Po, LM
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL II, 1997, : 69 - 72
  • [25] REAL-TIME CONVERSION FROM A SINGLE 2D FACE IMAGE TO A 3D TEXT-DRIVEN EMOTIVE AUDIO-VISUAL AVATAR
    Tang, Hao
    Hu, Yuxiao
    Fu, Yun
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1205 - 1208
  • [26] The latent city: Industrialization of urban space and politics in Oberhausen 1846-1929 .1. Text .2. Maps - German - Reif,G
    Honhart, M
    AMERICAN HISTORICAL REVIEW, 1996, 101 (04): : 1232 - 1233