Hierarchical classification for acoustic scenes using deep learning

被引:1
|
作者
Ding, Biyun [1 ]
Zhang, Tao [1 ]
Liu, Ganjun [1 ]
Wang, Chao [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Acoustic scene classification; Convolutional neural network; Data augmentation; Hierarchical classification; Late fusion;
D O I
10.1016/j.apacoust.2023.109594
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic Scene Classification (ASC) aims to obtain the sound environment by analyzing audio signals. Due to the low complexity and acquisition cost of audio signals, ASC has enormous potential in various applications, such as audio-based surveillance, smart cities/homes, and robotics. Recently, various methods have been proposed for ASC and achieved good performance. However, when they are used to address complex ASC problems, most of them suffer from the low-performance problem. In this paper, we propose to use hierarchical classification methods to replace the conventional flat approach in ASC applications, which utilizes the class hierarchy to optimize classification performance. In particular, we investigate the ASC problem under the framework of hierarchical classification. Firstly, to improve classification performance, three hierarchical classification methods introducing the class hierarchy of acoustic scenes are proposed for ASC. Moreover, to fully utilize the class hierarchy, a hybrid hierarchical classification method, and an optimal late fusion-based hierarchical method are proposed, which are based on the flexibility and simplification of hierarchical classification. The experiments demonstrate the efficacy of hierarchical ASC systems for performance improvement, and the best system achieves an accuracy of 78.86% on the DCASE 2020 Task1A dataset, resulting in accuracy gains of 24.76% and 8.52% absolute over the DCASE 2020 Task 1A baseline and the conventional non-hierarchical method, respectively.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Acoustic Classification using Deep Learning
    Aslam, Muhammad Ahsan
    Sarwar, Muhammad Umer
    Hanif, Muhammad Kashif
    Talib, Ramzan
    Khalid, Usama
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (08) : 153 - 159
  • [2] Classification of Complicated Urban Forest Acoustic Scenes with Deep Learning Models
    Zhang, Chengyun
    Zhan, Haisong
    Hao, Zezhou
    Gao, Xinghui
    [J]. FORESTS, 2023, 14 (02):
  • [3] Hierarchical Modulation Classification using Deep Learning
    Vanhoy, Garrett
    Thurston, Noah
    Burger, Andrew
    Breckenridge, Jacob
    Bose, Tamal
    [J]. 2018 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2018), 2018, : 20 - 25
  • [4] Acoustic Scene Classification using Deep Learning Architectures
    Spoorthy, V
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    [J]. 2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [5] Classification of Ecological Garden Scenes Using Deep Learning in Remote Sensing Images
    Wang, Xiaoyu
    Liu, Shaohui
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (04)
  • [6] COVID-19 Severity Classification Using a Hierarchical Classification Deep Learning Model
    Ortiz, Sergio
    Morales, Juan Carlos
    Rojas, Fernando
    Valenzuela, Olga
    Herrera, Luis Javier
    Rojas, Ignacio
    [J]. BIOINFORMATICS AND BIOMEDICAL ENGINEERING, PT I, 2022, : 442 - 452
  • [7] EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning
    Lakomkin, Egor
    Zamani, Mohammad Ali
    Weber, Cornelius
    Magg, Sven
    Wermter, Stefan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 4445 - 4450
  • [8] Deep Learning Approach to Classification of Acoustic Signals Using Information Features
    Lysenko, P. V.
    Nasonov, I. A.
    Galyaev, A. A.
    Berlin, L. M.
    [J]. DOKLADY MATHEMATICS, 2023, 108 (SUPPL 2) : S196 - S204
  • [9] HDLTex: Hierarchical Deep Learning for Text Classification
    Kowsari, Kamran
    Brown, Donald E.
    Heidarysafa, Mojtaba
    Meimandi, Kiana Jafari
    Gerber, Matthew S.
    Barnes, Laura E.
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 364 - 371
  • [10] Using deep learning for acoustic event classification: The case of natural disasters
    Ekpezu, Akon O.
    Wiafe, Isaac
    Katsriku, Ferdinand
    Yaokumah, Winfred
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 149 (04): : 2926 - 2935