Data augmentation guided knowledge distillation for environmental sound classification

被引:11
|
作者
Tripathi, Achyut Mani [1 ]
Paul, Konark [2 ]
机构
[1] Indian Institue Technol Guwahati, Dept Comp Sci & Engn, Gauhati 781039, Assam, India
[2] HSR Sect 5, Bengaluru 560034, Karnataka, India
关键词
Data augmentation; Deep model compression; Environmental sound classification; Knowledge distillation; Spectrogram; NETWORKS;
D O I
10.1016/j.neucom.2022.03.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Environmental sound classification (ESC) is an increasingly relevant field of research in recent years but has high computational overhead in its classification of environmental sounds. Knowledge distillation (KD) is a prominent technique to develop a lightweight deep model by distilling knowledge from a heavyweight model into a less computationally complex model. Generally, conventional KD techniques require manual setting of a temperature parameter to explore the similarity among the classes. Herein, we propose a novel data augmentation technique that creates an augmented data instance by blending hidden features of a data sample from one class with a style information (mean and standard values) of a data sample from another class. We have designed a new loss function to accomplish Knowledge Distillation that minimizes Kullback-Leibler (KL) divergence loss between class probabilities obtained from the teacher and student networks while classifying the augmented data sample. Furthermore, the proposed KD technique rids the process of a temperature parameter that needs to be set manually by the traditional vanilla KD technique. Our experiments on two benchmark ESC datasets i.e., the ESC-10 and DCASE 2019 Task-1(A) dataset demonstrate comparable performance of the student network to state-of-the-art techniques. Moreover, the student model is explainable and clearly explains why a signal is classified into a specific class. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:59 / 77
页数:19
相关论文
共 50 条
  • [1] Environmental Sound Classification Based on Knowledge Distillation
    Cui, Qianjin
    Zhao, Kun
    Wang, Li
    Gao, Kai
    Cao, Fang
    Wang, Xiaoman
    [J]. 2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 245 - 249
  • [2] Divide and Distill: New Outlooks on Knowledge Distillation for Environmental Sound Classification
    Tripathi, Achyut Mani
    Pandey, Om Jee
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1100 - 1113
  • [3] METRIC LEARNING BASED DATA AUGMENTATION FOR ENVIRONMENTAL SOUND CLASSIFICATION
    Lu, Rui
    Duan, Zhiyao
    Zhang, Changshui
    [J]. 2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 1 - 5
  • [4] Revamped Knowledge Distillation for Sound Classification
    Tripathi, Achyut Mani
    Mishra, Aakansha
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [5] Isotonic Data Augmentation for Knowledge Distillation
    Cui, Wanyun
    Yan, Sen
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2314 - 2320
  • [6] Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
    Salamon, Justin
    Bello, Juan Pablo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) : 279 - 283
  • [7] A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation
    Zeng, Jinfang
    Li, Youming
    Zhang, Yu
    Chen, Da
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2021, 20 (03)
  • [8] Data Augmentation Using Generative Adversarial Network for Environmental Sound Classification
    Madhu, Aswathy
    Kumaraswamy, Suresh
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [9] MixKd: Mix Data Augmentation Guided Knowledge Distillation for Plant Leaf Disease Recognition
    Zhang, Haotian
    Wang, Meili
    [J]. GREEN, PERVASIVE, AND CLOUD COMPUTING, GPC 2022, 2023, 13744 : 169 - 177
  • [10] Environmental Sound Classification using Deep Convolutional Neural Networks and Data Augmentation
    Davis, Nithya
    Suresh, K.
    [J]. 2018 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2018, : 41 - 45