Multi-modal, Multi-task and Multi-label for Music Genre Classification and Emotion Regression

被引:3
|
作者
Pandeya, Yagya Raj [1 ]
You, Jie [1 ]
Bhattarai, Bhuwan [1 ]
Lee, Joonwhoan [1 ]
机构
[1] Jeonbuk Natl Univ, Div Comp Sci & Engn, Jeonju, South Korea
基金
新加坡国家研究基金会;
关键词
multi-modal; multi-task; multi-label; music genre classification; emotion regression;
D O I
10.1109/ICTC52510.2021.9620826
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A smart system is highly desirable with the capability to divide music into coarse and fine categories based on emotion and genre. In this paper, we classify the music based on genre and emotion into 44 class categories which further subdivided into 255 fine categories. The music and lyrics information is input into two separate networks and the global information is integrated for the final classification and regression task. We proposed a channel and filter convolution network by factorizing spatial and temporal interactions of the standard 2D/3D convolutions. Furthermore, the channel interaction of the general residual block is factorized to one. The proposed convolutional method yields significant gains in accuracy and lower computational cost. The network is trained and tested on a public dataset and evaluated for individual and joint representation of audio and lyrics networks.
引用
收藏
页码:1042 / 1045
页数:4
相关论文
共 50 条
  • [1] Multi-label emotion classification based on adversarial multi-task learning
    Lin, Nankai
    Fu, Sihui
    Lin, Xiaotian
    Wang, Lianxi
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (06)
  • [2] Multi-modal Multi-label Emotion Detection with Modality and Label Dependence
    Dong Zhang
    Ju, Xincheng
    Li, Junhui
    Li, Shoushan
    Zhu, Qiaoming
    Zhou, Guodong
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3584 - 3593
  • [3] Multi-label classification of music by emotion
    Konstantinos Trohidis
    Grigorios Tsoumakas
    George Kalliris
    Ioannis Vlahavas
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [4] Multi-label classification of music by emotion
    Trohidis, Konstantinos
    Tsoumakas, Grigorios
    Kalliris, George
    Vlahavas, Ioannis
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
  • [5] Multi-modal Music Genre Classification Approach
    Zhen, Chao
    Xu, Jieping
    [J]. PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 8, 2010, : 398 - 402
  • [6] Tailor Versatile Multi-Modal Learning for Multi-Label Emotion Recognition
    Zhang, Yi
    Chen, Mingyuan
    Shen, Jundong
    Wang, Chongjun
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9100 - 9108
  • [7] Multi-task Joint Feature Selection for Multi-label Classification
    He Zhifen
    Yang Ming
    Liu Huidong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (02) : 281 - 287
  • [8] Multi-task Joint Feature Selection for Multi-label Classification
    HE Zhifen
    YANG Ming
    LIU Huidong
    [J]. Chinese Journal of Electronics, 2015, 24 (02) : 281 - 287
  • [9] Multi-modal microblog classification via multi-task learning
    Sicheng Zhao
    Hongxun Yao
    Sendong Zhao
    Xuesong Jiang
    Xiaolei Jiang
    [J]. Multimedia Tools and Applications, 2016, 75 : 8921 - 8938
  • [10] Multi-modal microblog classification via multi-task learning
    Zhao, Sicheng
    Yao, Hongxun
    Zhao, Sendong
    Jiang, Xuesong
    Jiang, Xiaolei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 8921 - 8938