Music Mood Classification Using Visual and Acoustic Features

被引:0
|
作者
Chagas Tavares, Juliano Cezar [1 ]
da Costa, Yandre Maldonado e Gomes [1 ]
机构
[1] Univ Estadual Maringa, Dept Informat, Maringa, Parana, Brazil
关键词
music mood classification; pattern recognition; spectrograms; acoustic features; visual features; machine learning; signal processing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This work aims to present a system for automatic music mood classification based on acoustic and visual features extracted from the music. The visual features are obtained from spectrograms and the acoustic features are extracted directly from the audio signal. The texture operators used are Local Phase Quantization (LPQ), Local Binary Pattern (LBP) and Robust Local Binary Pattern (RLBP). The acoustic features are described using Rhythm Patterns (RP), Rhythm Histogram (RH) and Statistical Spectrum Descriptor (SSD). The experiments performed were made on a subset of the Latin Music Mood Database, considering three mood classes: positive, negative, and neutral. In the classification step, SVM classifier was used and the final results were taken by using 5-fold cross validation. From several classifiers created performing a zoning strategy along the images, the best individual classifier is that created from the image zone which corresponds to the frequencies from 1,700 Hz to 3,400 Hz, and using the RLBP visual descriptor. In this case, the F-measure obtained is about 59.55%.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Combining visual and acoustic features for music genre classification
    Nanni, Loris
    Costa, Yandre M. G.
    Lumini, Alessandra
    Kim, Moo Young
    Baek, Seung Ryul
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2016, 45 : 108 - 117
  • [2] Optimal classification of music genres based on acoustic and visual features
    Kumaraswamy, Balachandra
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (23):
  • [3] Combining Acoustic and Multilevel Visual Features for Music Genre Classification
    Wu, Ming-Ju
    Jang, Jyh-Shing R.
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2015, 12 (01) : 1 - 17
  • [4] Ensemble of deep learning, visual and acoustic features for music genre classification
    Nanni, Loris
    Costa, Yandre M. G.
    Aguiar, Rafael L.
    Silla, Carlos N., Jr.
    Brahnam, Sheryl
    [J]. JOURNAL OF NEW MUSIC RESEARCH, 2018, 47 (04) : 383 - 397
  • [5] Speech/music classification using visual and spectral chromagram features
    Gajanan K. Birajdar
    Mukesh D. Patil
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 329 - 347
  • [6] Music genre classification using visual features with feature selection
    Zottesso, Rafael H. D.
    Costa, Yandre M. G.
    Bertolini, Diego
    [J]. PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,
  • [7] Speech/music classification using visual and spectral chromagram features
    Birajdar, Gajanan K.
    Patil, Mukesh D.
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (01) : 329 - 347
  • [8] Investigation of acoustic and visual features for acoustic scene classification
    Xie, Jie
    Zhu, Mingying
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 126 : 20 - 29
  • [9] Acoustic analysis and mood classification of pain-relieving music
    Knox, Don
    Beveridge, Scott
    Mitchell, Laura A.
    MacDonald, Raymond A. R.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (03): : 1673 - 1682
  • [10] Automatic Music Mood Classification Based on Timbre and Modulation Features
    Ren, Jia-Min
    Wu, Ming-Ju
    Jang, Jyh-Shing Roger
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2015, 6 (03) : 236 - 246