Music Track Recommendation Using Deep-CNN and Mel Spectrograms

被引:0
|
作者
Yin, Tingrong [1 ]
机构
[1] Hunan First Normal Univ, Sch Mus & Dance, Hunan 410002, Peoples R China
来源
MOBILE NETWORKS & APPLICATIONS | 2023年 / 28卷 / 06期
关键词
Music track; Recommendation algorithm; Deep learning; Convolutional neural network; GRAPH;
D O I
10.1007/s11036-023-02170-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recommender systems using IoT and deep learning play a vital part in creating an engaging experience on online music streaming platforms. However, in the musical domain, it is quite challenging to build a recommender system as some of the tracks are short. Similarly, some are listened to several times or generally consumed in sessions with other tracks. The recommendation of the next track is highly context dependent. Traditional recommendation algorithms were not able to extract deep-level features from the audio signal and effectively mine user's preferred music. Therefore, this paper aims to propose a deep learning-based model to build a music recommendation algorithm. The algorithm first preprocesses the original data, and then generates the Mel spectrogram feature set through fast Fourier transform and Mel filter processing. After applying logarithmic operation, these spectrograms are then fed to the convolutional neural network algorithm to categorize music tracks. The inference results are used to understand the user's preferences and recommend their favorite music tracks. Experimental research and comparison on different data sets show that the algorithm has good performance in the recommendation effect.
引用
收藏
页码:2130 / 2137
页数:8
相关论文
共 50 条
  • [1] Classification of Accented English Using CNN Model Trained on Amplitude Mel-Spectrograms
    Lesnichaia, Mariia
    Mikhailava, Veranika
    Bogach, Natalia
    Lezhenin, Iurii
    Blake, John
    Pyshkin, Evgeny
    [J]. INTERSPEECH 2022, 2022, : 3669 - 3673
  • [2] Deep Learning Approaches for Classroom Audio Classification Using Mel Spectrograms
    Mou, Afsana
    Milanova, Mariofanna
    Baillie, Mark
    [J]. NEW APPROACHES FOR MULTIDIMENSIONAL SIGNAL PROCESSING, NAMSP 2022, 2023, 332 : 23 - 30
  • [3] Localized Deep-CNN Structure for Face Recognition
    Al-Azzawi, Adil
    Hind, Jade
    Cheng, Jianlin
    [J]. 2018 11TH INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2018), 2018, : 52 - 57
  • [4] The entering and exiting management system by person specification using Deep-CNN
    Kizuna, Hiroto
    Sato, Hiroyuki
    [J]. 2017 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2017, : 542 - 545
  • [5] Deep-CNN for Plant Disease Diagnosis Using Low Resolution Leaf Images
    Rahman, Ashiqur
    Al Foisal, Md Hafiz
    Rahman, Md Hafijur
    Miah, Md Ranju
    Mridha, M. F.
    [J]. MACHINE LEARNING AND AUTONOMOUS SYSTEMS, 2022, 269 : 459 - 469
  • [6] Object recognition from enhanced underwater image using optimized deep-CNN
    Lyernisha, S. R.
    Christopher, C. Seldev
    Fernisha, S. R.
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2023, 21 (04)
  • [7] SALIENCY DETECTION BY FORWARD AND BACKWARD CUES IN DEEP-CNN
    Imamoglu, Nevrez
    Zhang, Chi
    Shimoda, Wataru
    Fang, Yuming
    Shi, Boxin
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 430 - 434
  • [8] Preprocessing of Breast Cancer Images to Create Datasets for Deep-CNN
    Beeravolu, Abhijith Reddy
    Azam, Sami
    Jonkman, Mirjam
    Shanmugam, Bharanidharan
    Kannoorpatti, Krishnan
    Anwar, Adnan
    [J]. IEEE ACCESS, 2021, 9 : 33438 - 33463
  • [9] Real Time Feature Extraction Deep-CNN for Mask Detection
    Mahmoud, Hanan A. Hosni
    Alghamdi, Norah S.
    Alharbi, Amal H.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (03): : 1423 - 1434
  • [10] Preprocessing of Breast Cancer Images to Create Datasets for Deep-CNN
    Beeravolu, Abhijith Reddy
    Azam, Sami
    Jonkman, Mirjam
    Shanmugam, Bharanidharan
    Kannoorpatti, Krishnan
    Anwar, Adnan
    [J]. IEEE Access, 2021, 9 : 33438 - 33463