Using machine learning analysis to interpret the relationship between music emotion and lyric features

被引:6
|
作者
Xu, Liang [1 ]
Sun, Zaoyi [2 ]
Wen, Xin [1 ]
Huang, Zhengxi [1 ]
Chao, Chi-ju [3 ]
Xu, Liuchang [4 ,5 ]
机构
[1] Zhejiang Univ, Dept Psychol & Behav Sci, Hangzhou, Peoples R China
[2] Zhejiang Univ Technol, Coll Educ, Hangzhou, Peoples R China
[3] Tsinghua Univ, Dept Informat Art & Design, Beijing, Peoples R China
[4] Zhejiang A&F Univ, Zhejiang Prov Key Lab Forestry Intelligent Monito, Hangzhou, Peoples R China
[5] Zhejiang A&F Univ, Coll Math & Comp Sci, Hangzhou, Peoples R China
关键词
Music emotion recognition; Lyric feature extraction; Audio signal processing; LIWC; Chinese pop song; INTEGRATION; MELODY; CLASSIFICATION; PERCEPTION; LANGUAGE; MEMORY; WORDS; SONGS; TEXT;
D O I
10.7717/peerj-cs.785
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Melody and lyrics, reflecting two unique human cognitive abilities, are usually combined in music to convey emotions. Although psychologists and computer scientists have made considerable progress in revealing the association between musical structure and the perceived emotions of music, the features of lyrics are relatively less discussed. Using linguistic inquiry and word count (LIWC) technology to extract lyric features in 2,372 Chinese songs, this study investigated the effects of LIWC-based lyric features on the perceived arousal and valence of music. First, correlation analysis shows that, for example, the perceived arousal of music was positively correlated with the total number of lyric words and the mean number of words per sentence and was negatively correlated with the proportion of words related to the past and insight. The perceived valence of music was negatively correlated with the proportion of negative emotion words. Second, we used audio and lyric features as inputs to construct music emotion recognition (MER) models. The performance of random forest regressions reveals that, for the recognition models of perceived valence, adding lyric features can significantly improve the prediction effect of the model using audio features only, for the recognition models of perceived arousal, lyric features are almost useless. Finally, by calculating the feature importance to interpret the MER models, we observed that the audio features played a decisive role in the recognition models of both perceived arousal and perceived valence. Unlike the uselessness of the lyric features in the arousal recognition model, several lyric features, such as the usage frequency of words related to sadness, positive emotions, and tentativeness, played important roles in the valence recognition model.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 50 条
  • [1] Music Emotion Recognition with the Extraction of Audio Features Using Machine Learning Approaches
    Juthi, Jannatul Humayra
    Gomes, Anthony
    Bhuiyan, Touhid
    Mahmud, Imran
    PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 318 - 329
  • [2] Machine Learning-based Modeling and Prediction of the Intrinsic Relationship between Human Emotion and Music
    Su, Jun
    Zhou, Peng
    ACM TRANSACTIONS ON APPLIED PERCEPTION, 2022, 19 (03)
  • [3] Modeling Music Emotion Judgments Using Machine Learning Methods
    Vempala, Naresh N.
    Russo, Frank A.
    FRONTIERS IN PSYCHOLOGY, 2018, 8
  • [4] Music Emotion Annotation by Machine Learning
    Cheung, Wai Ling
    Lu, Guojun
    2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 584 - 589
  • [5] Dimensional Music Emotion Recognition by Machine Learning
    Bai, Junjie
    Feng, Lixiao
    Peng, Jun
    Shi, Jinliang
    Luo, Kan
    Li, Zuojin
    Liao, Lu
    Wang, Yingxu
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2016, 10 (04) : 74 - 89
  • [6] Predicting Hit Music using MIDI features and Machine Learning
    Rajyashree, R.
    Anand, Anmol
    Soni, Yash
    Mahajan, Harshitaa
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 94 - 98
  • [7] Recognizing Speech Emotion Based on Acoustic Features Using Machine Learning
    Nasim, Md Abu Saleh
    Chowdory, Md Rakibul Hassan
    Dey, Ashim
    Das, Annesha
    13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 95 - +
  • [8] Detection of emotion by text analysis using machine learning
    Machova, Kristina
    Szaboova, Martina
    Paralic, Jan
    Micko, Jan
    FRONTIERS IN PSYCHOLOGY, 2023, 14
  • [9] Emousic: Emotion and Activity-Based Music Player Using Machine Learning
    Sarda, Pranav
    Halasawade, Sushmita
    Padmawar, Anuja
    Aghav, Jagannath
    ADVANCES IN COMPUTER COMMUNICATION AND COMPUTATIONAL SCIENCES, IC4S 2018, 2019, 924 : 179 - 188
  • [10] Design of Music Emotion Analysis and Creation Aid System Based on Machine Learning
    Qi, Jiang
    Li, Fangfang
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 2435 - 2446