PRE-TRAINED DEEP NEURAL NETWORK USING SPARSE AUTOENCODERS AND SCATTERING WAVELET TRANSFORM FOR MUSICAL GENRE RECOGNITION

被引:6
|
作者
Klec, Mariusz [1 ]
Korzinek, Danijel [1 ]
机构
[1] Polish Japanese Acad Informat Technol, Warsaw, Poland
来源
COMPUTER SCIENCE-AGH | 2015年 / 16卷 / 02期
关键词
Sparse Autoencoders; deep learning; genre recognition; Scattering Wavelet Transform;
D O I
10.7494/csci.2015.16.2.133
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Research described in this paper tries to combine the approach of Deep Neural Networks (DNN) with the novel audio features extracted using the Scattering Wavelet Transform (SWT) for classifying musical genres. The SWT uses a sequence of Wavelet Transforms to compute the modulation spectrum coefficients of multiple orders, which has already shown to be promising for this task. The DNN in this work uses pre-trained layers using Sparse Autoencoders (SAE). Data obtained from the Creative Commons website jamendo.com is used to boost the well-known GTZAN database, which is a standard bench-mark for this task. The final classifier is tested using a 10-fold cross validation to achieve results similar to other state-of-the-art approaches.
引用
收藏
页码:133 / 144
页数:12
相关论文
共 50 条
  • [1] Unsupervised Feature Pre-training of the Scattering Wavelet Transform for Musical Genre Recognition
    Klec, Mariusz
    Korzinek, Danijel
    INTERNATIONAL WORKSHOP ON INNOVATIONS IN INFORMATION AND COMMUNICATION SCIENCE AND TECHNOLOGY, IICST 2014, 2014, 18 : 133 - 139
  • [2] Modelling of Speech Parameters of Punjabi by Pre-trained Deep Neural Network Using Stacked Denoising Autoencoders
    Kaur, Navdeep
    Singh, Parminder
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [3] Fast Learning for Accurate Object Recognition Using a Pre-trained Deep Neural Network
    Lobato-Rios, Victor
    Tenorio-Gonzalez, Ana C.
    Morales, Eduardo F.
    ADVANCES IN SOFT COMPUTING, MICAI 2017, PT I, 2018, 10632 : 41 - 53
  • [4] Development of a deep learning network using a pre-trained convolutional neural network
    Rooney, M.
    Mitchell, J.
    McLaren, D. B.
    Nailon, W. H.
    RADIOTHERAPY AND ONCOLOGY, 2019, 133 : S1051 - S1052
  • [5] Image Hashing by Pre-Trained Deep Neural Network
    Li Pingyuan
    Zhang Dan
    Yuan Xiaoguang
    Jiang Suiping
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 468 - 471
  • [6] Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition
    Zhang, Hua
    Gou, Ruoyun
    Shang, Jili
    Shen, Fangyao
    Wu, Yifan
    Dai, Guojun
    FRONTIERS IN PHYSIOLOGY, 2021, 12
  • [7] Object Recognition using Template Matching and Pre-trained convolutional neural network
    Abbas, Qaisar
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (08): : 69 - 79
  • [8] Autism Spectrum Disorder Detection Based on Wavelet Transform of BOLD fMRI Signals Using Pre-trained Convolution Neural Network
    Al-Hiyali, Mohammed I.
    Yahya, Norashikin
    Faye, Ibrahima
    Khan, Zia
    INTERNATIONAL JOURNAL OF INTEGRATED ENGINEERING, 2021, 13 (05): : 49 - 56
  • [9] Painting Classification Using a Pre-trained Convolutional Neural Network
    Banerji, Sugata
    Sinha, Atreyee
    COMPUTER VISION, GRAPHICS, AND IMAGE PROCESSING, ICVGIP 2016, 2017, 10481 : 168 - 179
  • [10] Skin Lesion Classification Using Pre-Trained DenseNet201 Deep Neural Network
    Jasil, S. P. Godlin
    Ulagamuthalvi, V.
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 393 - 396