Environment Sound Event Classification With a Two-Stream Convolutional Neural Network

被引:26
|
作者
Dong, Xifeng [1 ]
Yin, Bo [1 ,2 ]
Cong, Yanping [1 ]
Du, Zehua [1 ]
Huang, Xianqing [1 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao 266100, Peoples R China
[2] Pilot Natl Lab Marine Sci & Technol, Qingdao 266237, Peoples R China
基金
中国国家自然科学基金;
关键词
Environmental sound classification; sound recognition; convolutional neural networks; data processing; pre-emphasis; two stream model; RECOGNITION; REPRESENTATIONS;
D O I
10.1109/ACCESS.2020.3007906
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, with the construction of intelligent cities, the importance of environmental sound classification (ESC) research has become increasingly prominent. However, due to the non-stationary nature of environment sound and the strong interference of ambient noise, the recognition accuracy of ESC is not high enough. Even with deep learning methods, it is difficult to fully extract features from models with a single input. Aiming to improve the recognition accuracy of ESC, this paper proposes a two-stream convolutional neural network (CNN) based on raw audio CNN (RACNN) and logmel CNN (LMCNN). In this method, a pre-emphasis module is first constructed to deal with raw audio signal. The processed audio data and logmel data are imported into RACNN and LMCNN, respectively to obtain both of time and frequency features of audio. In addition, a random-padding method is proposed to patch shorter data sequences. In such a way, the available data for experiment are greatly increased. Finally, the effectiveness of the methods has been verified based on UrbanSound8K dataset in experimental part.
引用
收藏
页码:125714 / 125721
页数:8
相关论文
共 50 条
  • [21] A Two-Stream Graph Convolutional Neural Network for Dynamic Traffic Flow Forecasting
    Li, Zhaoyang
    Li, Lin
    Peng, Yuquan
    Tao, Xiaohui
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 355 - 362
  • [22] Robust Detection of Image Operator Chain with Two-Stream Convolutional Neural Network
    Liao, Xin
    Li, Kaide
    Zhu, Xinshan
    Liu, K. J. Ray
    IEEE Journal on Selected Topics in Signal Processing, 2020, 5 (955-968): : 955 - 968
  • [23] Spotting Football Events Using Two-Stream Convolutional Neural Network and Dilated Recurrent Neural Network
    Mahaseni, Behzad
    Faizal, Erma Rahayu Mohd
    Raj, Ram Gopal
    IEEE ACCESS, 2021, 9 : 61929 - 61942
  • [24] Detection of background forgery using a two-stream convolutional neural network architecture
    Elmaci, Mehmet
    Toprak, Ahmet Nusret
    Aslantas, Veysel
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 36739 - 36766
  • [25] Two-Stream Mixed Convolutional Neural Network for American Sign Language Recognition
    Ma, Ying
    Xu, Tianpei
    Kim, Kangchul
    SENSORS, 2022, 22 (16)
  • [26] Detection of background forgery using a two-stream convolutional neural network architecture
    Mehmet Elmaci
    Ahmet Nusret Toprak
    Veysel Aslantas
    Multimedia Tools and Applications, 2024, 83 : 36739 - 36766
  • [27] Driver Behavior Analysis via Two-Stream Deep Convolutional Neural Network
    Chen, Ju-Chin
    Lee, Chien-Yi
    Huang, Peng-Yu
    Lin, Cheng-Rong
    APPLIED SCIENCES-BASEL, 2020, 10 (06):
  • [28] Robust Detection of Image Operator Chain With Two-Stream Convolutional Neural Network
    Liao, Xin
    Li, Kaide
    Zhu, Xinshan
    Liu, K. J. Ray
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (05) : 955 - 968
  • [29] Two-stream convolutional networks for skin cancer classification
    Aloraini, Mohammed
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30741 - 30753
  • [30] Two-Stream Convolutional Neural Network Based on Gradient Image for Aluminum Profile Surface Defects Classification and Recognition
    Duan, Chunmei
    Zhang, Taochuan
    IEEE ACCESS, 2020, 8 : 172152 - 172165