Deep Convolutional Neural Network with Transfer Learning for Environmental Sound Classification

被引:8
|
作者
Lu, Jianrui [1 ]
Ma, Ruofei [1 ]
Liu, Gongliang [1 ]
Qin, Zhiliang [2 ]
机构
[1] Harbin Inst Technol, Dept Commun Engn, Weihai, Peoples R China
[2] Weihai Beiyang Elect Grp Co Ltd, Technol R&D Ctr, Weihai, Peoples R China
基金
中国国家自然科学基金;
关键词
environmental sound classification; transfer learning; Xception; CNN; Log-Mel spectrogram; scalogram; MFCC; ESC-50;
D O I
10.1109/ICCCR49711.2021.9349393
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Environmental sound classification (ESC) is an important issue. However, due to the lack of datasets, high-accuracy ESC has always been challenging. In this paper, we propose a new convolutional neural network (CNN) model using transfer learning technology for ESC task. First, we represent sound as RGB image, where the red channel corresponds to the Log-Mel spectrogram, the green channel corresponds to the scalogram, and the blue channel corresponds to the Mel frequency cepstrum coefficient (MFCC). Second, we train a CNN architecture based on Xception model which has a better performance on the JFT dataset. Test results show that the proposed approach is with a better performance on the ESC accuracy.
引用
收藏
页码:242 / 245
页数:4
相关论文
共 50 条
  • [1] Deep Convolutional Neural Network with Mixup for Environmental Sound Classification
    Zhang, Zhichao
    Xu, Shugong
    Cao, Shan
    Zhang, Shunqing
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 356 - 367
  • [2] Deep convolutional neural network for environmental sound classification via dilation
    Roy, Sanjiban Sekhar
    Mihalache, Sanda Florentina
    Pricop, Emil
    Rodrigues, Nishant
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (02) : 1827 - 1833
  • [3] Deep Convolutional Neural Network Combined with Concatenated Spectrogram for Environmental Sound Classification
    Chi, Zhejian
    Li, Ying
    Chen, Cheng
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 251 - 254
  • [4] Environmental sound classification using a regularized deep convolutional neural network with data augmentation
    Mushtaq, Zohaib
    Su, Shun-Feng
    [J]. APPLIED ACOUSTICS, 2020, 167
  • [5] Transfer learning with deep convolutional neural network for constitution classification with face image
    Huan, Er-Yang
    Wen, Gui-Hua
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) : 11905 - 11919
  • [6] Classification of Breast Abnormalities Using a Deep Convolutional Neural Network and Transfer Learning
    Ruchai, A. N.
    Kober, V., I
    Dorofeev, K. A.
    Karnaukhov, V. N.
    Mozerov, M. G.
    [J]. JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2021, 66 (06) : 778 - 783
  • [7] Deep convolutional recurrent neural network with transfer learning for hyperspectral image classification
    Liu, Bing
    Yu, Xuchu
    Yu, Anzhu
    Wan, Gang
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2018, 12 (02)
  • [8] Transfer learning with deep convolutional neural network for constitution classification with face image
    Er-Yang Huan
    Gui-Hua Wen
    [J]. Multimedia Tools and Applications, 2020, 79 : 11905 - 11919
  • [9] Crop pest classification based on deep convolutional neural network and transfer learning
    Thenmozhi, K.
    Reddy, U. Srinivasulu
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2019, 164
  • [10] Classification of Breast Abnormalities Using a Deep Convolutional Neural Network and Transfer Learning
    A. N. Ruchai
    V. I. Kober
    K. A. Dorofeev
    V. N. Karnaukhov
    M. G. Mozerov
    [J]. Journal of Communications Technology and Electronics, 2021, 66 : 778 - 783