Real-Time Emotion-Based Piano Music Generation Using Generative Adversarial Network (GAN)

被引:0
|
作者
Zheng, Lijun [1 ]
Li, Chenglong [2 ]
机构
[1] Ewha Womans Univ, Sch Mus, Seoul 03760, South Korea
[2] Qiannan Normal Coll Nationalities, Conservatory Mus & Dance, Duyun 558000, Guizhou, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Generative adversarial networks; Learning automata; Deep learning; Music; Instruments; Complexity theory; Computational modeling; Reinforcement learning; Real-time music generation; generative adversarial network; self-attention mechanism; reinforcement learning; learning automata; emotion-based music;
D O I
10.1109/ACCESS.2024.3414673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic creation of real-time, emotion-based piano music pieces remains a challenge for deep learning models. While Generative Adversarial Networks (GANs) have shown promise, existing methods can struggle with generating musically coherent pieces and often require complex manual configuration. This paper proposes a novel model called Learning Automata-based Self-Attention Generative Adversarial Network (LA-SAGAN) to address these limitations. The proposed model uses a Generative Adversarial Network (GAN), combined with Self-Attention (SA) mechanism to reach this goal. The benefits of using SA modules in GAN architecture is twofold: First, SA mechanism results in generating music pieces with homogenous structure, which means long-distance dependencies in generated outputs are considered. Second, the SA mechanism utilizes the emotional features of the input to produce output pieces. This results in generating music pieces with desired genre or theme. In order to control the complexity of the proposed model, and optimize its structure, a set of Learning Automata (LA) models have been used to determine the activity state of each SA module. To do this, an iterative algorithm based on cooperation of LAs is introduced which optimizes the model by deactivating unnecessary SA modules. The efficiency of the proposed model in generating piano music has been evaluated. Evaluations demonstrate LA-SAGAN's effectiveness: at least 14.47% improvement in entropy (diversity) and improvements in precision (at least 2.47%) and recall (at least 2.13%). Moreover, human evaluation confirms superior musical coherence and adherence to emotional cues.
引用
收藏
页码:87489 / 87500
页数:12
相关论文
共 50 条
  • [1] Multitrack Emotion-Based Music Generation Network Using Continuous Symbolic Features
    Zhang, Donghui
    Li, Xiaobing
    Lu, Di
    Tie, Yun
    Gao, Yan
    Qi, Lin
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [2] Real-time image carrier generation based on generative adversarial network and fast object detection
    Li, Chuanlong
    Sun, Xingming
    Zhou, Zhili
    Yang, Yimin
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (03) : 655 - 665
  • [3] Real-time image carrier generation based on generative adversarial network and fast object detection
    Chuanlong Li
    Xingming Sun
    Zhili Zhou
    Yimin Yang
    Journal of Real-Time Image Processing, 2020, 17 : 655 - 665
  • [4] Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial Network
    Yang, Xin
    Chen, Jingyu
    Wang, Zhiwei
    Zhang, Qiaozhe
    Liu, Wenyu
    Liao, Chunyuan
    Cheng, Kwang-Ting
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 896 - 904
  • [5] EXTENDING MUSIC BASED ON EMOTION AND TONALITY VIA GENERATIVE ADVERSARIAL NETWORK
    Tseng, Bo-Wei
    Shen, Yih-Liang
    Chi, Tai-Shih
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 86 - 90
  • [6] AUTOMATED REAL-TIME PHARMACOKINETIC (PK) PREDICTION WITH GENERATIVE ADVERSARIAL NETWORK (GAN) METHODOLOGY.
    Hsieh, H.
    Liu, G.
    Lu, D.
    Liu, K.
    Lu, J.
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2021, 109 : S13 - S13
  • [7] Physics-based generative adversarial network for real-time acoustic holography
    Lu, Qingyi
    Zhong, Chengxi
    Su, Hu
    Liu, Song
    ULTRASONICS, 2025, 149
  • [8] A REAL-TIME MEDICAL ULTRASOUND SIMULATOR BASED ON A GENERATIVE ADVERSARIAL NETWORK MODEL
    Peng, Bo
    Huang, Xing
    Wang, Shiyuan
    Jiang, Jingfeng
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4629 - 4633
  • [9] Generation of Music With Dynamics Using Deep Convolutional Generative Adversarial Network
    Toh, Raymond Kwan How
    Sourin, Alexei
    2021 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2021), 2021, : 137 - 140
  • [10] Real-time Traffic Flow Parameters Estimation Model Based on Generative Adversarial Network
    Yao R.-H.
    Wang R.-Y.
    Zhang W.-S.
    Ye J.-S.
    Sun F.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2022, 22 (03): : 158 - 167