Impact of various continuous wavelet transforms for acoustic scene classification with DCASE dataset

被引:0
|
作者
Singh, Vikash Kumar [1 ]
Sharma, Kalpana [1 ]
Sur, Samarendra Nath [2 ]
机构
[1] Sikkim Manipal Univ, Sikkim Manipal Inst Technol, Dept Comp Sci & Engn, Gangtok, Sikkim, India
[2] Sikkim Manipal Univ, Sikkim Manipal Inst Technol, Dept Elect & Commun Engn, Gangtok, Sikkim, India
关键词
Morlet; Shannon; Mexican Hat; Gaussian derivative; Dynamic time warping; Alignment cost;
D O I
10.1007/s11760-025-04031-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Acoustic scenes can occur in various environments, such as metro stations, buses, parks, airports etc. The chaos can arise during these acoustic scenes. Therefore, to manage the chaos during an acoustic scene, it is essential to identify each specific sound class in the environment. The present research paper introduces a model for classifying acoustic scenes using Dynamic Time Warping and various Continuous Wavelet Transform (CWT) variations. The model's development follows general steps, including sourcing the audio dataset, generating reference audio, extracting wavelet features, computing normalized alignment cost and comparing the classes of audio from the training and testing datasets based on alignment cost. The audio dataset, comprising 23,035 audio files used for model development, is sourced from the Detection and Classification of Acoustic Scenes and Events (DCASE). The proposed model uses Morlet, Shannon, Mexican Hat, Gaussian Derivative, and Complex Morlet wavelets for the Acoustic Scenes Classification (ASC) task. With more neighbors, Morlet, Shannon, Mexican Hat, and Complex Morlet show higher accuracy. The Morlet is the best CWT, achieving 70.1056% accuracy for ASC task. The proposed model surpasses the DCASE 2021 Challenge Task 1-Subtask A baseline model, which has an accuracy of 47.7%, as well as other models from DCASE in the same year.
引用
收藏
页数:15
相关论文
共 16 条
  • [1] ACOUSTIC SCENE CLASSIFICATION: AN OVERVIEW OF DCASE 2017 CHALLENGE ENTRIES
    Mesaros, Annamaria
    Heittola, Toni
    Virtanen, Tuomas
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 411 - 415
  • [2] A classification of continuous wavelet transforms in dimension three
    Currey, Bradley
    Fuehr, Hartmut
    Oussa, Vignon
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2019, 46 (03) : 500 - 543
  • [3] ASSESSMENT OF HUMAN AND MACHINE PERFORMANCE IN ACOUSTIC SCENE CLASSIFICATION: DCASE 2016 CASE STUDY
    Mesaros, Annamaria
    Heittola, Toni
    Virtanen, Tuomas
    2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 319 - 323
  • [4] Detection and classification of acoustic transients using wavelet transforms
    Dixon, TL
    Sibul, LH
    WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING IV, PTS 1 AND 2, 1996, 2825 : 342 - 351
  • [5] Acoustic Scene Classification using Dynamic Time Warping Technique based on Short Time Fourier Transform and Discrete Wavelet Transforms
    Singh, Vikash Kumar
    Sharma, Kalpana
    Sur, Samarendra Nath
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, : 1887 - 1913
  • [6] Wavelet Transform based Mel-scaled Features for Acoustic Scene Classification
    Waldekar, Shefali
    Saha, Goutam
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3323 - 3327
  • [7] Binaural Acoustic Scene Classification Using Wavelet Scattering, Parallel Ensemble Classifiers and Nonlinear Fusion
    Hajihashemi, Vahid
    Gharahbagh, Abdorreza Alavi
    Cruz, Pedro Miguel
    Ferreira, Marta Campos
    Machado, Jose J. M.
    Tavares, Joao Manuel R. S.
    SENSORS, 2022, 22 (04)
  • [8] Multi-Timescale Wavelet Scattering With Genetic Algorithm Feature Selection for Acoustic Scene Classification
    Kek, Xing Yong
    Chin, Cheng Siong
    Li, Ye
    IEEE ACCESS, 2022, 10 : 25987 - 26001
  • [9] Underwater acoustic classification using wavelet scattering transform and convolutional neural network with limited dataset
    Liu, Yongxiang
    Zhang, Biqi
    Kong, Fantong
    Wang, Biao
    Luo, Chengming
    Ma, Lin
    APPLIED ACOUSTICS, 2025, 232
  • [10] Acoustic Scene Classification Using Various Features and DNN Model: A Monolithic and Hierarchical Approach
    Paseddula, Chandrasekhar
    Gangashetty, Suryakanth V.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025, 44 (01) : 239 - 280