LEARNING ENVIRONMENTAL SOUNDS WITH END-TO-END CONVOLUTIONAL NEURAL NETWORK

被引:0
|
作者
Tokozume, Yuji [1 ]
Harada, Tatsuya [1 ]
机构
[1] Univ Tokyo, Tokyo, Japan
关键词
Environmental sound classification; convolutional neural network; end-to-end system; feature learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Environmental sound classification (ESC) is usually conducted based on handcrafted features such as the log-mel feature. Meanwhile, end-to-end classification systems perform feature extraction jointly with classification and have achieved success particularly in image classification. In the same manner, if environmental sounds could be directly learned from the raw waveforms, we would be able to extract a new feature effective for classification that could not have been designed by humans, and thi s new feature could improve the classification performance. In this paper, we propose a novel end-to-end ESC system using a convolutional neural network (CNN). The classification accuracy of our system on ESC-50 is 5.1% higher than that achieved when using logmel-CNN with the static log-mel feature. Moreover, we achieve a 6.5% improvement in classification accuracy over the state-of-the-art logmel-CNN with the static and delta log-mel feature, simply by combining our system and logmel-CNN.
引用
收藏
页码:2721 / 2725
页数:5
相关论文
共 50 条
  • [31] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
    Junho Jo
    Hyung Il Koo
    Jae Woong Soh
    Nam Ik Cho
    Multimedia Tools and Applications, 2020, 79 : 32137 - 32150
  • [32] Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
    Jo, Junho
    Koo, Hyung Il
    Soh, Jae Woong
    Cho, Nam Ik
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (43-44) : 32137 - 32150
  • [33] Automatic Driving of End-to-end Convolutional Neural Network Based on MobileNet-V2 Migration Learning
    Hu, Minghong
    Guo, Hui
    Ji, Xuyuan
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON VISUAL INFORMATION COMMUNICATION AND INTERACTION, VINCI 2019, 2019,
  • [34] End-to-End Driving Activities and Secondary Tasks Recognition Using Deep Convolutional Neural Network and Transfer Learning
    Xing, Yang
    Tang, Jianlin
    Liu, Hong
    Lv, Chen
    Cao, Dongpu
    Velenis, Efstathios
    Wang, Fei-Yue
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 1626 - 1631
  • [35] Jasper: An End-to-End Convolutional Neural Acoustic Model
    Li, Jason
    Lavrukhin, Vitaly
    Ginsburg, Boris
    Leary, Ryan
    Kuchaiev, Oleksii
    Cohen, Jonathan M.
    Nguyen, Huyen
    Gadde, Ravi Teja
    INTERSPEECH 2019, 2019, : 71 - 75
  • [36] End-to-End Text Recognition with Convolutional Neural Networks
    Wang, Tao
    Wu, David J.
    Coates, Adam
    Ng, Andrew Y.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3304 - 3308
  • [37] DeepChess :End-to-End Deep Neural Network for Automatic Learning in Chess
    David, Omid E.
    Netanyahu, Nathan S.
    Wolf, Lior
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 88 - 96
  • [38] An End-to-End Multiplex Graph Neural Network for Graph Representation Learning
    Liang, Yanyan
    Zhang, Yanfeng
    Gao, Dechao
    Xu, Qian
    IEEE ACCESS, 2021, 9 : 58861 - 58869
  • [39] End-to-End Convolutional Neural Network-based Voice Presentation Attack Detection
    Muckenhirn, Hannah
    Magimai-Doss, Mathew
    Marcel, Sebastien
    2017 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB), 2017, : 335 - 341
  • [40] Microaneurysm detection in fundus images based on a novel end-to-end convolutional neural network
    Liao, Yinhan
    Xia, Haiying
    Song, Shuxiang
    Li, Haisheng
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2021, 41 (02) : 589 - 604