Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition

被引:10
|
作者
Lu, Liang [1 ]
Renals, Steve [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
speech recognition; highway network; small footprint deep learning;
D O I
10.21437/Interspeech.2016-39
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
For speech recognition, deep neural networks (DNNs) have significantly improved the recognition accuracy in most of benchmark datasets and application domains. However, compared to the conventional Gaussian mixture models, DNN-based acoustic models usually have much larger number of model parameters, making it challenging for their applications in resource constrained platforms, e.g., mobile devices. In this paper, we study the application of the recently proposed highway network to train small-footprint DNNs, which are thinner and deeper, and have significantly smaller number of model parameters compared to conventional DNNs. We investigated this approach on the AMI meeting speech transcription corpus which has around 80 hours of audio data. The highway neural networks constantly outperformed their plain DNN counterparts, and the number of model parameters can be reduced significantly without sacrificing the recognition accuracy.
引用
收藏
页码:12 / 16
页数:5
相关论文
共 50 条
  • [1] Small-Footprint Highway Deep Neural Networks for Speech Recognition
    Lu, Liang
    Renals, Steve
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1502 - 1511
  • [2] Using Highway Connections to Enable Deep Small-footprint LSTM-RNNs for Speech Recognition
    Cheng Gaofeng
    Li Xin
    Yan Yonghong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (01) : 107 - 112
  • [3] Using Highway Connections to Enable Deep Small-footprint LSTM-RNNs for Speech Recognition
    CHENG Gaofeng
    LI Xin
    YAN Yonghong
    [J]. Chinese Journal of Electronics, 2019, 28 (01) : 107 - 112
  • [4] KNOWLEDGE DISTILLATION FOR SMALL-FOOTPRINT HIGHWAY NETWORKS
    Lu, Liang
    Guo, Michelle
    Renals, Steve
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4820 - 4824
  • [5] SMALL-FOOTPRINT KEYWORD SPOTTING USING DEEP NEURAL NETWORKS
    Chen, Guoguo
    Parada, Carolina
    Heigold, Georg
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] PocketSUMMIT: Small-Footprint Continuous Speech Recognition
    Hetherington, I. Lee
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2173 - 2176
  • [7] Structure Growth for Small-Footprint Speech Recognition
    Wu, Jiayao
    Tang, Zhiyuan
    Wang, Dong
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 461 - 465
  • [8] Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition
    Hsu, Wei-Ning
    Zhang, Yu
    Lee, Ann
    Glass, James
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 395 - 399
  • [9] Convolutional Neural Networks for Small-footprint Keyword Spotting
    Sainath, Tara N.
    Parada, Carolina
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1478 - 1482
  • [10] Reduced Model Size Deep Convolutional Neural Networks for Small-Footprint Keyword Spotting
    Tsai, Tsung Han
    Lin, Xin Hui
    [J]. 2021 28TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS (IEEE ICECS 2021), 2021,