Data Augmentation and Deep Learning Methods in Sound Classification: A Systematic Review

被引:29
|
作者
Abayomi-Alli, Olusola O. [1 ]
Damasevicius, Robertas [1 ]
Qazi, Atika [2 ]
Adedoyin-Olowe, Mariam [3 ]
Misra, Sanjay [4 ]
机构
[1] Kaunas Univ Technol, Dept Software Engn, LT-44249 Kaunas, Lithuania
[2] Univ Brunei Darussalam, Ctr Lifelong Learning, BE-1410 Gadong, Brunei
[3] Birmingham City Univ, Sch Comp & Digital Technol, Birmingham B4 7XG, W Midlands, England
[4] Ostfold Univ Coll, Dept Comp Sci & Commun, N-1757 Halden, Norway
关键词
sound data; audio data; data augmentation; feature extraction; deep learning; ARTIFICIAL-INTELLIGENCE; EVENT CLASSIFICATION; FAULT-DIAGNOSIS; NEURAL-NETWORKS; RECOGNITION; SPEECH; FEATURES; AUDIO; BREATH;
D O I
10.3390/electronics11223795
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of this systematic literature review (SLR) is to identify and critically evaluate current research advancements with respect to small data and the use of data augmentation methods to increase the amount of data available for deep learning classifiers for sound (including voice, speech, and related audio signals) classification. Methodology: This SLR was carried out based on the standard SLR guidelines based on PRISMA, and three bibliographic databases were examined, namely, Web of Science, SCOPUS, and IEEE Xplore. Findings. The initial search findings using the variety of keyword combinations in the last five years (2017-2021) resulted in a total of 131 papers. To select relevant articles that are within the scope of this study, we adopted some screening exclusion criteria and snowballing (forward and backward snowballing) which resulted in 56 selected articles. Originality: Shortcomings of previous research studies include the lack of sufficient data, weakly labelled data, unbalanced datasets, noisy datasets, poor representations of sound features, and the lack of effective augmentation approach affecting the overall performance of classifiers, which we discuss in this article. Following the analysis of identified articles, we overview the sound datasets, feature extraction methods, data augmentation techniques, and its applications in different areas in the sound classification research problem. Finally, we conclude with the summary of SLR, answers to research questions, and recommendations for the sound classification task.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Fractional-Order Calculus-Based Data Augmentation Methods for Environmental Sound Classification with Deep Learning
    Yazgac, Bilgi Gorkem
    Kirci, Murvet
    [J]. FRACTAL AND FRACTIONAL, 2022, 6 (10)
  • [2] Deep Learning Methods for Heart Sounds Classification: A Systematic Review
    Chen, Wei
    Sun, Qiang
    Chen, Xiaomin
    Xie, Gangcai
    Wu, Huiqun
    Xu, Chen
    [J]. ENTROPY, 2021, 23 (06)
  • [3] Explanations of Augmentation Methods for Deep Learning ECG Classification
    Balasubramanian, Nikil Sharan Prabahar
    Dakshit, Sagnik
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PT II, AIME 2024, 2024, 14845 : 277 - 287
  • [4] A comprehensive systematic review of deep learning methods for hyperspectral images classification
    Ranjan, Pallavi
    Girdhar, Ashish
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (17) : 6221 - 6306
  • [5] Effect of Data Augmentation in the Classification and Validation of Tomato Plant Disease with Deep Learning Methods
    Wagle, Shivali Amit
    Harikrishnan, R.
    Sampe, Jahariah
    Mohammad, Faseehuddin
    Ali, Sawal Hamid Md
    [J]. TRAITEMENT DU SIGNAL, 2021, 38 (06) : 1657 - 1670
  • [6] METRIC LEARNING BASED DATA AUGMENTATION FOR ENVIRONMENTAL SOUND CLASSIFICATION
    Lu, Rui
    Duan, Zhiyao
    Zhang, Changshui
    [J]. 2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 1 - 5
  • [7] Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
    Salamon, Justin
    Bello, Juan Pablo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) : 279 - 283
  • [8] Machine Learning and Deep Learning Methods for Skin Lesion Classification and Diagnosis: A Systematic Review
    Kassem, Mohamed A.
    Hosny, Khalid M.
    Damasevicius, Robertas
    Eltoukhy, Mohamed Meselhy
    [J]. DIAGNOSTICS, 2021, 11 (08)
  • [9] Data Augmentation Techniques to Detect Cervical Cancer Using Deep Learning: A Systematic Review
    Wubineh, Betelhem Zewdu
    Rusiecki, Andrzej
    Halawa, Krzysztof
    [J]. SYSTEM DEPENDABILITY-THEORY AND APPLICATIONS, DEPCOS-RELCOMEX 2024, 2024, 1026 : 325 - 336
  • [10] Opportunities and challenges of deep learning methods for electrocardiogram data: A systematic review
    Hong, Shenda
    Zhou, Yuxi
    Shang, Junyuan
    Xiao, Cao
    Sun, Jimeng
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 122