Spectrogram based multi-task audio classification

被引:0
|
作者
Yuni Zeng
Hua Mao
Dezhong Peng
Zhang Yi
机构
[1] Sichuan University,Machine Intelligence Laboratory, College of Computer Science
来源
关键词
Multi-task learning; Convolutional neural networks; Deep residual networks; Audio classification;
D O I
暂无
中图分类号
学科分类号
摘要
Audio classification is regarded as a great challenge in pattern recognition. Although audio classification tasks are always treated as independent tasks, tasks are essentially related to each other such as speakers’ accent and speakers’ identification. In this paper, we propose a Deep Neural Network (DNN)-based multi-task model that exploits such relationships and deals with multiple audio classification tasks simultaneously. We term our model as the gated Residual Networks (GResNets) model since it integrates Deep Residual Networks (ResNets) with a gate mechanism, which extract better representations between tasks compared with Convolutional Neural Networks (CNNs). Specifically, two multiplied convolutional layers are used to replace two feed-forward convolution layers in the ResNets. We tested our model on multiple audio classification tasks and found that our multi-task model achieves higher accuracy than task-specific models which train the models separately.
引用
收藏
页码:3705 / 3722
页数:17
相关论文
共 50 条
  • [21] Multi-task learning for underwater object classification
    Stack, J. R.
    Crosby, F.
    McDonald, R. J.
    Xue, Y.
    Carin, L.
    DETECTION AND REMEDIATION TECHNOLOGIES FOR MINES AND MINELIKE TARGETS XII, 2007, 6553
  • [22] Adversarial Multi-task Learning for Text Classification
    Liu, Pengfei
    Qiu, Xipeng
    Huang, Xuanjing
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1 - 10
  • [23] Multi-task label noise learning for classification
    Liu, Zongmin
    Wang, Ziyi
    Wang, Ting
    Xu, Yitian
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [24] Cancer Classification with Multi-task Deep Learning
    Liao, Qing
    Jiang, Lin
    Wang, Xuan
    Zhang, Chunkai
    Ding, Ye
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 76 - 81
  • [25] Multi-Task Label Embedding for Text Classification
    Zhang, Honglun
    Xiao, Liqiang
    Chen, Wenqing
    Wang, Yongkun
    Jin, Yaohui
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4545 - 4553
  • [26] MULTI-TASK CLASSIFICATION WITH INFINITE LOCAL EXPERTS
    Wang, Chunping
    An, Qi
    Carin, Lawrence
    Dunson, David B.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1569 - +
  • [27] Multi-task classification with sequential instances and tasks
    Xu, Wei
    Liu, Wei
    Chi, Haoyuan
    Huang, Xiaolin
    Yang, Jie
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 64 : 59 - 67
  • [28] Generative Multi-Task Learning for Text Classification
    Zhao, Wei
    Gao, Hui
    Chen, Shuhui
    Wang, Nan
    IEEE ACCESS, 2020, 8 : 86380 - 86387
  • [29] A Multi-Task Music Artist Classification Network
    Panda, Swaroop
    Namboodiri, Vinay P.
    2020 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NETWORKS (CINE 2020), 2020,
  • [30] Tchebycheff Procedure for Multi-task Text Classification
    Mao, Yuren
    Yung, Shuang
    Liu, Weiwei
    Du, Bo
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4217 - 4226