ASR ERROR DETECTION AND RECOGNITION RATE ESTIMATION USING DEEP BIDIRECTIONAL RECURRENT NEURAL NETWORKS

被引:0
|
作者
Ogawa, Atsunori [1 ]
Hori, Takaaki [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Tokyo, Japan
关键词
Automatic speech recognition; error detection; recognition rate estimation; deep bidirectional recurrent neural networks; generalization ability; LSTM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recurrent neural networks (RNNs) have recently been applied as the classifiers for sequential labeling problems. In this paper, deep bidirectional RNNs (DBRNNs) are applied for the first time to error detection in automatic speech recognition (ASR), which is a sequential labeling problem. We investigate three types of ASR error detection tasks, i.e. confidence estimation, out-of-vocabulary word detection and error type classification. We also estimate recognition rates from the error type classification results. Experimental results show that the DBRNNs greatly outperform conditional random fields (CRFs), especially for the detection of infrequent error labels. The DBRNNs also slightly outperform the CRFs in recognition rate estimation. In addition, experiments using a reduced size of training data suggest that the DBRNNs have a better generalization ability than the CRFs owing to their word vector representation in a low-dimensional continuous space. As a result, the DBRNNs trained using only 20% of the training data show higher error detection performance than the CRFs trained using the full training data.
引用
收藏
页码:4370 / 4374
页数:5
相关论文
共 50 条
  • [1] Error detection and accuracy estimation in automatic speech recognition using deep bidirectional recurrent neural networks
    Ogawa, Atsunori
    Hori, Takaaki
    SPEECH COMMUNICATION, 2017, 89 : 70 - 83
  • [2] Speaker-Adapted Confidence Measures for ASR Using Deep Bidirectional Recurrent Neural Networks
    Angel Del-Agua, Miguel
    Gimenez, Adria
    Sanchis, Albert
    Civera, Jorge
    Juan, Alfons
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (07) : 1194 - 1202
  • [3] ASR ERROR DETECTION USING RECURRENT NEURAL NETWORK LANGUAGE MODEL AND COMPLEMENTARY ASR
    Tam, Yik-Cheung
    Lei, Yun
    Zheng, Jing
    Wang, Wen
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] Localisation in Wireless Networks using Deep Bidirectional Recurrent Neural Networks
    Lynch, David
    Ho, Lester
    MacDonald, Michael
    O'Neill, Michael
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] Metaphor Detection using Ensembles of Bidirectional Recurrent Neural Networks
    Brooks, Jennifer
    Youssef, Abdou
    FIGURATIVE LANGUAGE PROCESSING, 2020, : 244 - 249
  • [6] CONFIDENCE ESTIMATION AND DELETION PREDICTION USING BIDIRECTIONAL RECURRENT NEURAL NETWORKS
    Ragni, A.
    Li, Q.
    Gales, M. J. F.
    Wang, Y.
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 204 - 211
  • [7] System-independent ASR error detection and classification using Recurrent Neural Network
    Errattahia, Rahhal
    Hannani, Asmaa E. L.
    Hain, Thomas
    Ouahmane, Hassan
    COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 187 - 199
  • [8] Electricity Theft Detection Using Deep Bidirectional Recurrent Neural Network
    Chen, Zhongtao
    Meng, De
    Zhang, Yufan
    Xin, Tinglin
    Xiao, Ding
    2020 22ND INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): DIGITAL SECURITY GLOBAL AGENDA FOR SAFE SOCIETY!, 2020, : 401 - 406
  • [9] English Grammar Error Detection Using Recurrent Neural Networks
    He, Zhenhui
    Scientific Programming, 2021, 2021
  • [10] English Grammar Error Detection Using Recurrent Neural Networks
    He, Zhenhui
    SCIENTIFIC PROGRAMMING, 2021, 2021