Densely Connected Networks for Conversational Speech Recognition

被引:5
|
作者
Han, Kyu J. [1 ]
Chandrashekaran, Akshay [1 ]
Kim, Jungsuk [1 ]
Lane, Ian [1 ]
机构
[1] Capio Inc, Belmont, CA 94002 USA
关键词
Densely connected LSTM; Switchboard; conversational speech recognition;
D O I
10.21437/Interspeech.2018-1486
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we show how we have achieved the state-of-the-art performance on the industry-standard NIST 2000 Hub5 English evaluation set. We propose densely connected LSTMs (namely, dense LSTMs), inspired by the densely connected convolutional neural networks recently introduced for image classification tasks. It is shown that the proposed dense LSTMs would provide more reliable performance as compared to the conventional, residual LSTMs as more LSTM layers are stacked in neural networks. With RNN-LM rescoring and lattice combination on the 5 systems (including 2 dense LSTM based systems) trained across three different phone sets, Capio's conversational speech recognition system has obtained 5.0% and 9.1% on Switchboard and CallHome, respectively.
引用
收藏
页码:796 / 800
页数:5
相关论文
共 50 条
  • [1] Densely connected convolutional networks for speech recognition
    Li, Chia Yu
    Vu, Ngoc Thang
    [J]. Speech Communication - 13th ITG-Fachtagung Sprachkommunikation, 2020, : 321 - 325
  • [2] Face Recognition Based on Densely Connected Convolutional Networks
    Zhang, Tong
    Wang, Rong
    Ding, Jianwei
    Li, Xin
    Li, Bo
    [J]. 2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [3] Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition
    Tang, Jian
    Song, Yan
    Dai, LiRong
    McLoughlin, Ian
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1783 - 1787
  • [4] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [5] Speech densely connected convolutional networks for small-footprint keyword spotting
    Tsai, Tsung-Han
    Lin, Xin-Hui
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 39119 - 39137
  • [6] Speech densely connected convolutional networks for small-footprint keyword spotting
    Tsung-Han Tsai
    Xin-Hui Lin
    [J]. Multimedia Tools and Applications, 2023, 82 : 39119 - 39137
  • [7] Conversational telephone speech recognition
    Gauvain, JL
    Lamel, L
    Schwenk, H
    Adda, G
    Chen, L
    Lefèvre, F
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 212 - 215
  • [8] Asymmetric convolution with densely connected networks
    Wang, Liejun
    Wen, Huanglu
    Qin, Jiwei
    Cheng, Shuli
    [J]. INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2020, 12 (03) : 274 - 284
  • [9] Speech adaptation using neural networks for connected digit recognition
    Cheng, XL
    Wang, H
    Li, ZG
    [J]. ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2401 - 2404
  • [10] Recognition of Interest in Human Conversational Speech
    Schuller, Bjoern
    Koehler, Niels
    Mueller, Ronald
    Rigoll, Gerhard
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 793 - 796