Motion to Dance Music Generation using Latent Diffusion Model

被引:2
|
作者
Tan, Vanessa [1 ]
Nam, JungHyun [1 ]
Nam, Juhan [1 ]
Noh, Junyong [1 ]
机构
[1] KAIST GSCT, Daejeon, South Korea
基金
新加坡国家研究基金会;
关键词
3D motion to music; music generation; latent diffusion model;
D O I
10.1145/3610543.3626164
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The role of music in games and animation, particularly in dance content, is essential for creating immersive and entertaining experiences. Although recent studies have made strides in generating dance music from videos, their practicality in integrating music into games and animation remains limited. In this context, we present a method capable of generating plausible dance music from 3D motion data and genre labels. Our approach leverages a combination of a UNET-based latent diffusion model and a pre-trained VAE model. To evaluate the performance of the proposed model, we employ evaluation metrics to assess various audio properties, including beat alignment, audio quality, motion-music correlation, and genre score. The quantitative results show that our approach outperforms previous methods. Furthermore, we demonstrate that our model can generate audio that seamlessly fits to in-the-wild motion data. This capability enables us to create plausible dance music that complements dynamic movements of characters and enhances overall audiovisual experience in interactive media. Examples from our proposed model are available at this link: https://dmdproject.github.io/.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer
    Li, Buyu
    Zhao, Yongchi
    Shi, Zhelun
    Sheng, Lu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1272 - 1279
  • [22] Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion Model
    Song, Wenfeng
    Jin, Xingliang
    Li, Shuai
    Chen, Chenglizhao
    Hao, Aimin
    Hou, Xia
    Li, Ning
    Qin, Hong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 821 - 830
  • [23] External Validation of a Latent Diffusion Model for Brain Imaging Generation
    Boscutti, Andrea
    Mwangi, Benson
    Wu, Mon-Ju
    Zunta-Soares, Giovana B.
    Hasan, Khader
    Sores, Jair C.
    NEUROPSYCHOPHARMACOLOGY, 2024, 49 : 86 - 87
  • [24] Music-Driven Dance Generation
    Qi, Yu
    Liu, Yazhou
    Sun, Quansen
    IEEE ACCESS, 2019, 7 : 166540 - 166550
  • [25] Dance Dance Generation: Motion Transfer for Internet Videos
    Zhou, Yipin
    Wang, Zhaowen
    Fang, Chen
    Bui, Trung
    Berg, Tamara L.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1208 - 1216
  • [26] Artificial intelligence using a latent diffusion model enables the generation of diverse and potent antimicrobial peptides
    Wang, Yeji
    Song, Minghui
    Liu, Fujing
    Liang, Zhen
    Hong, Rui
    Dong, Yuemei
    Luan, Huaizu
    Fu, Xiaojie
    Yuan, Wenchang
    Fang, Wenjie
    Li, Gang
    Lou, Hongxiang
    Chang, Wenqiang
    SCIENCE ADVANCES, 2025, 11 (06):
  • [27] Latent Diffusion for Language Generation
    Lovelace, Justin
    Kishore, Varsha
    Wan, Chao
    Shekhtman, Eliot
    Weinberger, Kilian Q.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [28] Automatic Generation of Dance and Facial Expressions Linked to Music using HMM
    Sato, Taiki
    Osana, Yuko
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3999 - 4006
  • [29] PLay: Parametrically Conditioned Layout Generation using Latent Diffusion
    Cheng, Chin-Yi
    Huang, Forrest
    Li, Gang
    Li, Yang
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [30] Diffusion of Ecstasy in the Electronic Dance Music Scene
    Palamar, Joseph J.
    SUBSTANCE USE & MISUSE, 2020, 55 (13) : 2243 - 2250