Spectrogram based multi-task audio classification

被引:0
|
作者
Yuni Zeng
Hua Mao
Dezhong Peng
Zhang Yi
机构
[1] Sichuan University,Machine Intelligence Laboratory, College of Computer Science
来源
关键词
Multi-task learning; Convolutional neural networks; Deep residual networks; Audio classification;
D O I
暂无
中图分类号
学科分类号
摘要
Audio classification is regarded as a great challenge in pattern recognition. Although audio classification tasks are always treated as independent tasks, tasks are essentially related to each other such as speakers’ accent and speakers’ identification. In this paper, we propose a Deep Neural Network (DNN)-based multi-task model that exploits such relationships and deals with multiple audio classification tasks simultaneously. We term our model as the gated Residual Networks (GResNets) model since it integrates Deep Residual Networks (ResNets) with a gate mechanism, which extract better representations between tasks compared with Convolutional Neural Networks (CNNs). Specifically, two multiplied convolutional layers are used to replace two feed-forward convolution layers in the ResNets. We tested our model on multiple audio classification tasks and found that our multi-task model achieves higher accuracy than task-specific models which train the models separately.
引用
收藏
页码:3705 / 3722
页数:17
相关论文
共 50 条
  • [41] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [42] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    Memetic Computing, 2020, 12 : 355 - 369
  • [43] Learning Temporal Resolution in Spectrogram for Audio Classification
    Liu, Haohe
    Liu, Xubo
    Kong, Qiuqiang
    Wang, Wenwu
    Plumbley, Mark D.
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13873 - 13881
  • [44] Multi-Task Convolutional Networks for Motor Imagery Classification Based on EEG and fNIRS
    Feng, Lufeng
    He, Qun
    Xu, Xiangyuan
    Jiang, Guoqian
    Xie, Ping
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2021, 168 : S199 - S199
  • [45] Supervised Machine Learning Based Multi-Task Artificial Intelligence Classification of Retinopathies
    Alam, Minhaj
    Le, David
    Lim, Jennifer, I
    Chan, Robison V. P.
    Yao, Xincheng
    JOURNAL OF CLINICAL MEDICINE, 2019, 8 (06)
  • [46] Brain Networks Classification Based on an Adaptive Multi-Task Convolutional Neural Networks
    Xing X.
    Ji J.
    Yao Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (07): : 1449 - 1459
  • [47] Autoencoder-based multi-task learning for imputation and classification of incomplete data
    Lai, Xiaochen
    Wu, Xia
    Zhang, Liyong
    APPLIED SOFT COMPUTING, 2021, 98
  • [48] Autoencoder-based multi-task learning for imputation and classification of incomplete data
    Lai, Xiaochen
    Wu, Xia
    Zhang, Liyong
    Applied Soft Computing, 2021, 98
  • [49] Visual-audio emotion recognition based on multi-task and ensemble learning with multiple features
    Hao M.
    Cao W.-H.
    Liu Z.-T.
    Wu M.
    Xiao P.
    Cao, Wei-Hua (weihuacao@cug.edu.cn), 1600, Elsevier B.V., Netherlands (391): : 42 - 51
  • [50] Concentric RadViz: Visual Exploration of Multi-Task Classification
    Piazentin Ono, Jorge Henrique
    Sikansi, Fabio
    Correa, Debora Cristina
    Paulovich, Fernando Vieira
    Paiva, Afonso
    Nonato, Luis Gustavo
    2015 28TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES, 2015, : 165 - 172