Deep Convolutional Neural Network with Transfer Learning for Environmental Sound Classification

被引:8
|
作者
Lu, Jianrui [1 ]
Ma, Ruofei [1 ]
Liu, Gongliang [1 ]
Qin, Zhiliang [2 ]
机构
[1] Harbin Inst Technol, Dept Commun Engn, Weihai, Peoples R China
[2] Weihai Beiyang Elect Grp Co Ltd, Technol R&D Ctr, Weihai, Peoples R China
基金
中国国家自然科学基金;
关键词
environmental sound classification; transfer learning; Xception; CNN; Log-Mel spectrogram; scalogram; MFCC; ESC-50;
D O I
10.1109/ICCCR49711.2021.9349393
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Environmental sound classification (ESC) is an important issue. However, due to the lack of datasets, high-accuracy ESC has always been challenging. In this paper, we propose a new convolutional neural network (CNN) model using transfer learning technology for ESC task. First, we represent sound as RGB image, where the red channel corresponds to the Log-Mel spectrogram, the green channel corresponds to the scalogram, and the blue channel corresponds to the Mel frequency cepstrum coefficient (MFCC). Second, we train a CNN architecture based on Xception model which has a better performance on the JFT dataset. Test results show that the proposed approach is with a better performance on the ESC accuracy.
引用
收藏
页码:242 / 245
页数:4
相关论文
共 50 条
  • [31] Transfer Learning with Deep Convolutional Neural Network for SAR Target Classification with Limited Labeled Data
    Huang, Zhongling
    Pan, Zongxu
    Lei, Bin
    REMOTE SENSING, 2017, 9 (09)
  • [32] Classification of Heart Sounds Using Chaogram Transform and Deep Convolutional Neural Network Transfer Learning
    Harimi, Ali
    Majd, Yahya
    Gharahbagh, Abdorreza Alavi
    Hajihashemi, Vahid
    Esmaileyan, Zeynab
    Machado, Jose J. M.
    Tavares, Joao Manuel R. S.
    SENSORS, 2022, 22 (24)
  • [33] Robust technique for environmental sound classification using convolutional recurrent neural network
    Anam Bansal
    Naresh Kumar Garg
    Multimedia Tools and Applications, 2024, 83 : 54755 - 54772
  • [34] Fast environmental sound classification based on resource adaptive convolutional neural network
    Zheng Fang
    Bo Yin
    Zehua Du
    Xianqing Huang
    Scientific Reports, 12
  • [35] Fast environmental sound classification based on resource adaptive convolutional neural network
    Fang, Zheng
    Yin, Bo
    Du, Zehua
    Huang, Xianqing
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [36] Convolutional neural network based traffic sound classification robust to environmental noise
    Lee, Jaejun
    Kim, Wansoo
    Lee, Kyogu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2018, 37 (06): : 469 - 474
  • [37] Robust technique for environmental sound classification using convolutional recurrent neural network
    Bansal, Anam
    Garg, Naresh Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54755 - 54772
  • [38] Lightweight deep convolutional neural network for background sound classification in speech signals
    Dayal, Aveen
    Yeduri, Sreenivasa Reddy
    Koduru, Balu Harshavardan
    Jaiswal, Rahul Kumar
    Soumya, J.
    Srinivas, M. B.
    Pandey, Om Jee
    Cenkeramaddi, Linga Reddy
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 151 (04): : 2773 - 2786
  • [39] A Deep Transfer Convolutional Neural Network Framework for EEG Signal Classification
    Xu, Gaowei
    Shen, Xiaoang
    Chen, Sirui
    Zong, Yongshuo
    Zhang, Canyang
    Yue, Hongyang
    Liu, Min
    Chen, Fei
    Che, Wenliang
    IEEE ACCESS, 2019, 7 : 112767 - 112776
  • [40] Tomato pest classification using deep convolutional neural network with transfer learning, fine tuning and scratch learning
    Pattnaik, Gayatri
    Shrivastava, Vimal K.
    Parvathi, K.
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (03): : 433 - 442