Music Conditioned Generation for Human-Centric Video

被引:0
|
作者
Zhao, Zimeng [1 ]
Zuo, Binghui [1 ]
Wang, Yangang [1 ]
机构
[1] Southeast Univ, Sch Automat, Key Lab Measurement & Control Complex Syst Engn, Minist Educ, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiple signal classification; Generative adversarial networks; Correlation; Visualization; Training; Task analysis; Feature extraction; Video generation; signal processing; cross-modal learning; human-centric;
D O I
10.1109/LSP.2024.3358978
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Music and human-centric video are two fundamental signals across languages. Correlation analysis between the two is currently used in choreography and film accompaniment. This letter explores this correlation in a new task: human-centric video generation from a start-end image pair and transitional music. Existing human-centric generation methods are not competent for this task because they require frame-wise pose as input or have difficulty handling long-duration videos. Our key idea is to build a temporal generation framework dominated by DDPM and assisted by VAE and GAN. It reduces the computational cost of music-image diffusion by utilizing the latent space compactness of VAE and the image translation efficiency of GAN. To produce videos with both long duration and high quality, our framework first generates small-scale keyframes and then generates high-resolution videos. To strengthen the frame-wise consistency of the human body, a frame-aligned correspondence map is adopted as an intermediate supervision. Extensive experiments compared with the SOTA method have demonstrated the rationality and effectiveness of this signal generation framework.
引用
收藏
页码:506 / 510
页数:5
相关论文
共 50 条
  • [41] Toward a Human-Centric Approach to Cybersecurity
    Deibert, Ronald J.
    ETHICS & INTERNATIONAL AFFAIRS, 2018, 32 (04) : 411 - 424
  • [42] Collaboration for human-centric eGovernment workflows
    Gaaloul, Khaled
    Charoy, Francois
    Schaad, Andreas
    Lee, Hannah
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2007 WORKSHOPS, 2007, 4832 : 201 - +
  • [43] Delegation Protocols in Human-Centric Workflows
    Gaaloul, Khaled
    Proper, H. A.
    Charoy, Francois
    13TH IEEE INTERNATIONAL CONFERENCE ON COMMERCE AND ENTERPRISE COMPUTING (CEC 2011), 2011, : 219 - 224
  • [44] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
    Kim, Gwanghyun
    Kim, Hayeon
    Seo, Hoigi
    Kang, Dong Un
    Chun, Se Young
    arXiv,
  • [45] Implementation of a Human-Centric GUI for Next-Generation Intensive Care Unit
    Lu, Tsung-Che
    Hsu, Shang-Hwa
    Tzeng, Sing-Jia
    Chang, Che-Ming
    Van, Lan-Da
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2014,
  • [46] A new Human-centric Factory Model
    May, Goekan
    Taisch, Marco
    Bettoni, Andrea
    Maghazei, Omid
    Matarazzo, Annarita
    Stahl, Bojan
    12TH GLOBAL CONFERENCE ON SUSTAINABLE MANUFACTURING - EMERGING POTENTIALS, 2015, 26 : 103 - 108
  • [47] Human-Centric Situational Awareness in the Bedroom
    Yen, Yu Chun
    Li, Jiun-Yi
    Lu, Ching Hu
    Yang, Tsung Han
    Fu, Li Chen
    TOWARD USEFUL SERVICES FOR ELDERLY AND PEOPLE WITH DISABILITIES, 2011, 6719 : 72 - 79
  • [48] A futuristic perspective on human-centric assembly
    Wang, Lihui
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 199 - 201
  • [49] ZOOM IN TO THE DETAILS OF HUMAN-CENTRIC VIDEOS
    Li, Guanghan
    Zhao, Yaping
    Ji, Mengqi
    Yuan, Xiaoyun
    Fang, Lu
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3089 - 3093
  • [50] Human-centric images and videos analysis
    Liu, Si
    Ni, Bingbing
    Lin, Liang
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1331 - 1332