Text2City: One-Stage Text-Driven Urban Layout Regeneration

被引：0

作者：

Qin, Yiming ^{[1
,2
]}

Zhao, Nanxuan ^{[3
]}

Sheng, Bin ^{[1
]}

Lau, Rynson W. H. ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[2] City Univ Hong Kong, Hong Kong, Peoples R China

[3] Adobe Res, San Jose, CA USA

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5 | 2024年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Regenerating urban layout is an essential process for urban regeneration. In this paper, we propose a new task called text-driven urban layout regeneration, which provides an intuitive input modal - text - for users to specify the regeneration, instead of designing complex rules. Given the target region to be regenerated, we propose a one-stage text-driven urban layout regeneration model, Text2City, to jointly and progressively regenerate the urban layout (i.e., road and building layouts) based on textual layout descriptions and surrounding context (i.e., urban layouts and functions of the surrounding regions). Text2City first extracts road and building attributes from the textual layout description to guide the regeneration. It includes a novel one-stage joint regenerator network based on the conditioned denoising diffusion probabilistic models (DDPMs) and prior knowledge exchange. To harmonize the regenerated layouts through joint optimization, we propose the interactive & enhanced guidance module for self-enhancement and prior knowledge exchange between road and building layouts during the regeneration. We also design a series of constraints from attribute-, geometry- and pixel-levels to ensure rational urban layout generation. To train our model, we build a large-scale dataset containing urban layouts and layout descriptions, covering 147K regions. Qual-itative and quantitative evaluations show that our proposed method outperforms the baseline methods in regenerating desirable urban layouts that meet the textual descriptions.

引用

页码：4578 / 4586

页数：9

共 26 条

[21] Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model
Wang, Yin
Leng, Zhiying
Li, Frederick W. B.
Wu, Shun-Cheng
Liang, Xiaohui
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21978 - 21987
[22] A Text-Driven Aircraft Fault Diagnosis Model Based on a Word2vec and Priori-Knowledge Convolutional Neural Network
Xu, Zhenzhong
Chen, Bang
Zhou, Shenghan
Chang, Wenbing
Ji, Xinpeng
Wei, Chaofan
Hou, Wenkui
AEROSPACE, 2021, 8 (04)
[23] 3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Yang, Haibo
Chen, Yang
Pan, Yingwei
Yao, Ting
Chen, Zhineng
Mei, Tao
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 6860 - 6868
[24] Text-driven automatic frame generation using MPEG-4 synthetic/natural hybrid coding for 2-D head-and-shoulder scene
Cheung, CH
Po, LM
INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL II, 1997, : 69 - 72
[25] REAL-TIME CONVERSION FROM A SINGLE 2D FACE IMAGE TO A 3D TEXT-DRIVEN EMOTIVE AUDIO-VISUAL AVATAR
Tang, Hao
Hu, Yuxiao
Fu, Yun
Hasegawa-Johnson, Mark
Huang, Thomas S.
2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1205 - 1208
[26] The latent city: Industrialization of urban space and politics in Oberhausen 1846-1929 .1. Text .2. Maps - German - Reif,G
Honhart, M
AMERICAN HISTORICAL REVIEW, 1996, 101 (04): : 1232 - 1233

← 1 2 3 →