Text-Guided Synthesis of Crowd Animation

被引:1
|
作者
Ji, Xuebo [1 ]
Pan, Zherong [2 ]
Gao, Xifeng [2 ]
Pan, Jia [1 ]
机构
[1] Univ Hong Kong, Ctr Transformat Garment Prod TransGP, Hong Kong, Peoples R China
[2] LightSpeed Studios, Seattle, WA USA
关键词
Diffusion Model; Multi-Agent Navigation; Collision Avoidance; Crowd Simulation;
D O I
10.1145/3641519.3657516
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Creating vivid crowd animations is core to immersive virtual environments in digital games. This work focuses on tackling the challenges of the crowd behavior generation problem. Existing approaches are labor-intensive, relying on practitioners to manually craft the complex behavior systems. We propose a machine learning approach to synthesize diversified dynamic crowd animation scenarios for a given environment based on a text description input. We first train two conditional diffusion models that generate text-guided agent distribution fields and velocity fields. Assisted by local navigation algorithms, the fields are then used to control multiple groups of agents. We further employ Large-Language Model (LLM) to canonicalize the general script into a structured sentence for more stable training and better scalability. To train our diffusion models, we devise a constructive method to generate random environments and crowd animations. We show that our trained diffusion models can generate crowd animations for both unseen environments and novel scenario descriptions. Our method paves the way towards automatic generating of crowd behaviors for virtual environments. Code and data for this paper are available at: https://github.com/MLZG/Text-Crowd.git.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] LivePhoto: Real Image Animation with Text-Guided Motion Control
    Chen, Xi
    Liu, Zhiheng
    Chen, Mengting
    Feng, Yutong
    Liu, Yu
    Shen, Yujun
    Zhao, Hengshuang
    COMPUTER VISION-ECCV 2024, PT XVIII, 2025, 15076 : 475 - 491
  • [2] Text-Guided Synthesis of Eulerian Cinemagraphs
    Mahapatra, Aniruddha
    Siarohin, Aliaksandr
    Lee, Hsin-Ying
    Tulyakov, Sergey
    Zhu, Jun-Yan
    ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (06):
  • [3] Text-Guided Customizable Image Synthesis and Manipulation
    Zhang, Zhiqiang
    Fu, Chen
    Weng, Wei
    Zhou, Jinjia
    APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [4] Text-Guided Synthesis of Masked Face Images
    Anjali, T.
    Masilamani, V.
    ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 21 (01)
  • [5] Text-Guided Sketch-to-Photo Image Synthesis
    Osahor, Uche
    Nasrabadi, Nasser M.
    IEEE ACCESS, 2022, 10 : 98278 - 98289
  • [6] Text-Guided Image Inpainting
    Zhang, Zijian
    Zhao, Zhou
    Zhang, Zhu
    Huai, Baoxing
    Yuan, Jing
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4079 - 4087
  • [7] IconShop: Text-Guided Vector Icon Synthesis with Autoregressive Transformers
    Wu, Ronghuan
    Su, Wanchao
    Ma, Kede
    Liao, Jing
    ACM TRANSACTIONS ON GRAPHICS, 2023, 42 (06):
  • [8] Benchmarking Robustness to Text-Guided Corruptions
    Mofayezi, Mohammadreza
    Medghalchi, Yasamin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 779 - 786
  • [9] Text-Guided Vector Graphics Customization
    Zhang, Peiying
    Zhao, Nanxuan
    Liao, Jing
    PROCEEDINGS OF THE SIGGRAPH ASIA 2023 CONFERENCE PAPERS, 2023,
  • [10] Text-Guided Automated Self Assessment
    Pirnay-Dummer, Pablo
    Ifenthaler, Dirk
    MULTIPLE PERSPECTIVES ON PROBLEM SOLVING AND LEARNING IN THE DIGITAL AGE, 2011, : 217 - 225