LEARNING ENVIRONMENTAL SOUNDS WITH END-TO-END CONVOLUTIONAL NEURAL NETWORK

被引:0
|
作者
Tokozume, Yuji [1 ]
Harada, Tatsuya [1 ]
机构
[1] Univ Tokyo, Tokyo, Japan
关键词
Environmental sound classification; convolutional neural network; end-to-end system; feature learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Environmental sound classification (ESC) is usually conducted based on handcrafted features such as the log-mel feature. Meanwhile, end-to-end classification systems perform feature extraction jointly with classification and have achieved success particularly in image classification. In the same manner, if environmental sounds could be directly learned from the raw waveforms, we would be able to extract a new feature effective for classification that could not have been designed by humans, and thi s new feature could improve the classification performance. In this paper, we propose a novel end-to-end ESC system using a convolutional neural network (CNN). The classification accuracy of our system on ESC-50 is 5.1% higher than that achieved when using logmel-CNN with the static log-mel feature. Moreover, we achieve a 6.5% improvement in classification accuracy over the state-of-the-art logmel-CNN with the static and delta log-mel feature, simply by combining our system and logmel-CNN.
引用
收藏
页码:2721 / 2725
页数:5
相关论文
共 50 条
  • [1] End-to-end Learning Approach for Autonomous Driving: A Convolutional Neural Network Model
    Wang, Yaqin
    Liu, Dongfang
    Jeon, Hyewon
    Chu, Zhiwei
    Matson, Eric T.
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 833 - 839
  • [2] End-to-End Exposure Fusion Using Convolutional Neural Network
    Wang, Jinhua
    Wang, Weiqiang
    Xu, Guangmei
    Liu, Hongzhe
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (02): : 560 - 563
  • [3] End-to-End Hardware Accelerator for Deep Convolutional Neural Network
    Chang, Tian-Sheuan
    [J]. 2018 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2018,
  • [4] End-to-end environmental sound classification using a 1D convolutional neural network
    Abdoli, Sajjad
    Cardinal, Patrick
    Koerich, Alessandro Lameiras
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 136 : 252 - 263
  • [5] HACNet: End-to-end learning of interpretable table-to-image converter and convolutional neural network
    Matsuda, Takuya
    Uchida, Kento
    Saito, Shota
    Shirakawa, Shinichi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [6] End-to-End PSK Signals Demodulation Using Convolutional Neural Network
    Chen, Wen-Jie
    Wang, Jiao
    Li, Jian-Qing
    [J]. IEEE ACCESS, 2022, 10 : 58302 - 58310
  • [7] End-to-End Unsupervised Deformable Image Registration with a Convolutional Neural Network
    de Vos, Bob D.
    Berendsen, Floris F.
    Viergever, Max A.
    Staring, Marius
    Isgum, Ivana
    [J]. DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, 2017, 10553 : 204 - 212
  • [8] An End-to-End Dense Connected Heterogeneous Graph Convolutional Neural Network
    Yan, Ranhui
    Cai, Jia
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2023, PT I, 2024, 14447 : 462 - 475
  • [9] An End-To-End Hyperbolic Deep Graph Convolutional Neural Network Framework
    Zhou Y.
    Huo H.
    Hou Z.
    Bu L.
    Wang Y.
    Mao J.
    Lv X.
    Bu F.
    [J]. CMES - Computer Modeling in Engineering and Sciences, 2023, 139 (01): : 537 - 563
  • [10] Image reflection removal using end-to-end convolutional neural network
    Li, Jinjiang
    Li, Guihui
    Fan, Hui
    [J]. IET IMAGE PROCESSING, 2020, 14 (06) : 1047 - 1058