MULTI-TASK LEARNING IN DEEP NEURAL NETWORKS FOR IMPROVED PHONEME RECOGNITION

被引:0
|
作者
Seltzer, Michael L. [1 ]
Droppo, Jasha [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
关键词
Acoustic model; speech recognition; multi-task learning; deep neural network; TIMIT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we demonstrate how to improve the performance of deep neural network (DNN) acoustic models using multi-task learning. In multi-task learning, the network is trained to perform both the primary classification task and one or more secondary tasks using a shared representation. The additional model parameters associated with the secondary tasks represent a very small increase in the number of trained parameters, and can be discarded at runtime. In this paper, we explore three natural choices for the secondary task: the phone label, the phone context, and the state context. We demonstrate that, even on a strong baseline, multi-task learning can provide a significant decrease in error rate. Using phone context, the phonetic error rate (PER) on TIMIT is reduced from 21.63% to 20.25% on the core test set, and surpassing the best performance in the literature for a DNN that uses a standard feed-forward network architecture.
引用
收藏
页码:6965 / 6969
页数:5
相关论文
共 50 条
  • [41] Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection
    Li, Jiakang
    Sun, Meng
    Zhang, Xiongwei
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1517 - 1522
  • [42] Multi-Task Deep Neural Networks for Multi-Document Reading Comprehension
    Liu, Chang
    Liu, Zhuang
    Lin, Wayne
    Zhao, Jun
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [43] Multi-Task Learning for Improved Recognition of Multiple Types of Acoustic Information
    Kim, Jae-Won
    Park, Hochong
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (10): : 1762 - 1765
  • [44] Multi-task deep learning approach for sound event recognition and tracking
    Chen, Tzung-Shi
    Chen, Ming-Ju
    Chen, Tzung-Cheng
    [J]. INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2024, 46 (02)
  • [45] Adaptive Feature Aggregation in Deep Multi-Task Convolutional Neural Networks
    Cui, Chaoran
    Shen, Zhen
    Huang, Jin
    Chen, Meng
    Xu, Mingliang
    Wang, Meng
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2133 - 2144
  • [46] Predicting human protein function with multi-task deep neural networks
    Fa, Rui
    Cozzetto, Domenico
    Wan, Cen
    Jones, David T.
    [J]. PLOS ONE, 2018, 13 (06):
  • [47] Deep Adaptive Feature Aggregation in Multi-task Convolutional Neural Networks
    Shen, Zhen
    Cui, Chaoran
    Huang, Jin
    Zong, Jian
    Chen, Meng
    Yin, Yilong
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2213 - 2216
  • [48] Deep Heterogeneous Multi-Task Metric Learning for Visual Recognition and Retrieval
    Gan, Shikang
    Luo, Yong
    Wen, Yonggang
    Liu, Tongliang
    Hu, Han
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1837 - 1845
  • [49] Multi-Task Deep Neural Networks for Multimodal Personality Trait Prediction
    Mujtaba, Dena F.
    Mahapatra, Nihar R.
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 85 - 91
  • [50] Pareto Multi-task Deep Learning
    Riccio, Salvatore D.
    Dyankov, Deyan
    Jansen, Giorgio
    Di Fatta, Giuseppe
    Nicosia, Giuseppe
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 132 - 141