Text-Guided Synthesis of Crowd Animation

被引：1

作者：

Ji, Xuebo ^{[1
]}

Pan, Zherong ^{[2
]}

Gao, Xifeng ^{[2
]}

Pan, Jia ^{[1
]}

机构：

[1] Univ Hong Kong, Ctr Transformat Garment Prod TransGP, Hong Kong, Peoples R China

[2] LightSpeed Studios, Seattle, WA USA

来源：

PROCEEDINGS OF SIGGRAPH 2024 CONFERENCE PAPERS | 2024年

关键词：

Diffusion Model; Multi-Agent Navigation; Collision Avoidance; Crowd Simulation;

D O I：

10.1145/3641519.3657516

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Creating vivid crowd animations is core to immersive virtual environments in digital games. This work focuses on tackling the challenges of the crowd behavior generation problem. Existing approaches are labor-intensive, relying on practitioners to manually craft the complex behavior systems. We propose a machine learning approach to synthesize diversified dynamic crowd animation scenarios for a given environment based on a text description input. We first train two conditional diffusion models that generate text-guided agent distribution fields and velocity fields. Assisted by local navigation algorithms, the fields are then used to control multiple groups of agents. We further employ Large-Language Model (LLM) to canonicalize the general script into a structured sentence for more stable training and better scalability. To train our diffusion models, we devise a constructive method to generate random environments and crowd animations. We show that our trained diffusion models can generate crowd animations for both unseen environments and novel scenario descriptions. Our method paves the way towards automatic generating of crowd behaviors for virtual environments. Code and data for this paper are available at: https://github.com/MLZG/Text-Crowd.git.

引用

页数：11

共 50 条

[1] LivePhoto: Real Image Animation with Text-Guided Motion Control
Chen, Xi
Liu, Zhiheng
Chen, Mengting
Feng, Yutong
Liu, Yu
Shen, Yujun
Zhao, Hengshuang
COMPUTER VISION-ECCV 2024, PT XVIII, 2025, 15076 : 475 - 491
[2] Text-Guided Synthesis of Eulerian Cinemagraphs
Mahapatra, Aniruddha
Siarohin, Aliaksandr
Lee, Hsin-Ying
Tulyakov, Sergey
Zhu, Jun-Yan
ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (06):
[3] Text-Guided Customizable Image Synthesis and Manipulation
Zhang, Zhiqiang
Fu, Chen
Weng, Wei
Zhou, Jinjia
APPLIED SCIENCES-BASEL, 2022, 12 (20):
[4] Text-Guided Synthesis of Masked Face Images
Anjali, T.
Masilamani, V.
ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 21 (01)
[5] Text-Guided Sketch-to-Photo Image Synthesis
Osahor, Uche
Nasrabadi, Nasser M.
IEEE ACCESS, 2022, 10 : 98278 - 98289
[6] Text-Guided Image Inpainting
Zhang, Zijian
Zhao, Zhou
Zhang, Zhu
Huai, Baoxing
Yuan, Jing
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4079 - 4087
[7] IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers
Wu, Ronghuan
Su, Wanchao
Ma, Kede
Liao, Jing
ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (06):
[8] Benchmarking Robustness to Text-Guided Corruptions
Mofayezi, Mohammadreza
Medghalchi, Yasamin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 779 - 786
[9] Text-Guided Vector Graphics Customization
Zhang, Peiying
Zhao, Nanxuan
Liao, Jing
PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,
[10] Text-Guided Automated Self Assessment
Pirnay-Dummer, Pablo
Ifenthaler, Dirk
MULTIPLE PERSPECTIVES ON PROBLEM SOLVING AND LEARNING IN THE DIGITAL AGE, 2011, : 217 - 225

← 1 2 3 4 5 →