Spectrogram based multi-task audio classification

被引:0
|
作者
Yuni Zeng
Hua Mao
Dezhong Peng
Zhang Yi
机构
[1] Sichuan University,Machine Intelligence Laboratory, College of Computer Science
来源
关键词
Multi-task learning; Convolutional neural networks; Deep residual networks; Audio classification;
D O I
暂无
中图分类号
学科分类号
摘要
Audio classification is regarded as a great challenge in pattern recognition. Although audio classification tasks are always treated as independent tasks, tasks are essentially related to each other such as speakers’ accent and speakers’ identification. In this paper, we propose a Deep Neural Network (DNN)-based multi-task model that exploits such relationships and deals with multiple audio classification tasks simultaneously. We term our model as the gated Residual Networks (GResNets) model since it integrates Deep Residual Networks (ResNets) with a gate mechanism, which extract better representations between tasks compared with Convolutional Neural Networks (CNNs). Specifically, two multiplied convolutional layers are used to replace two feed-forward convolution layers in the ResNets. We tested our model on multiple audio classification tasks and found that our multi-task model achieves higher accuracy than task-specific models which train the models separately.
引用
收藏
页码:3705 / 3722
页数:17
相关论文
共 50 条
  • [31] A Multi-Task Classification Method for Application Traffic Classification Using Task Relationships
    Baek, Ui-Jun
    Kim, Boseon
    Park, Jee-Tae
    Choi, Jeong-Woo
    Kim, Myung-Sup
    ELECTRONICS, 2023, 12 (17)
  • [32] Multi-Task Learning-Based Immunofluorescence Classification of Kidney Disease
    Pan, Sai
    Fu, Yibing
    Chen, Pu
    Liu, Jiaona
    Liu, Weicen
    Wang, Xiaofei
    Cai, Guangyan
    Yin, Zhong
    Wu, Jie
    Tang, Li
    Wang, Yong
    Duan, Shuwei
    Dai, Ning
    Jiang, Lai
    Xu, Mai
    Chen, Xiangmei
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (20)
  • [33] A Multi-task Text Classification Model Based on Label Embedding Learning
    Xu, Yuemei
    Fan, Zuwei
    Cao, Han
    CYBER SECURITY, CNCERT 2021, 2022, 1506 : 211 - 225
  • [34] WEIGHTED AND MULTI-TASK LOSS FOR RARE AUDIO EVENT DETECTION
    Huy Phan
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    Mertins, Alfred
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 336 - 340
  • [35] Multi-Task Learning Based Joint Pulse Detection and Modulation Classification
    Akyon, Fatih Cagatay
    Nuhoglu, Mustafa Atahan
    Alp, Yasar Kemal
    Arikan, Orhan
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [36] Multi-task Modular Backpropagation for Feature-Based Pattern Classification
    Chandra, Rohitash
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT VI, 2017, 10639 : 558 - 566
  • [37] Breast cancer pathological image classification based on multi-task model
    Yu L.
    Xia Y.
    Wang P.
    Yan Y.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (08): : 53 - 57
  • [38] STATNet: Spectral and Temporal features based Multi-Task Network for Audio Spoofing Detection
    Ranjan, Rishabh
    Vatsa, Mayank
    Singh, Richa
    2022 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB), 2022,
  • [39] Multi-task sentiment classification model based on DistilBert and multi-scale CNN
    Xiong, Guanghao
    Yan, Ke
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 700 - 707
  • [40] Dataset for modulation classification and signal type classification for multi-task and single task learning
    Jagannath, Anu
    Jagannath, Jithin
    COMPUTER NETWORKS, 2021, 199