Deep Neural Network Calibration for E2E Speech Recognition System

被引:1
|
作者
Lee, Mun-Hak [1 ]
Chang, Joon-Hyuk [1 ]
机构
[1] Hanyang Univ, Dept Elect Engn, Seoul, South Korea
来源
关键词
E2E speech recognition; deep neural network calibration;
D O I
10.21437/Interspeech.2021-176
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Cross-entropy loss, which is commonly used in deep-neural-network-based (DNN) classification model training, induces models to assign a high probability value to one class. Networks trained in this fashion tend to be overconfident, which causes a problem in the decoding process of the speech recognition system, as it uses the combined probability distribution of multiple independently trained networks. Overconfidence in neural networks can be quantified as a calibration error, which is the difference between the output probability of a model and the likelihood of obtaining an actual correct answer. We show that the deep-learning-based components of an end-to-end (E2E) speech recognition system with high classification accuracy contain calibration errors and quantify them using various calibration measures. In addition, it was experimentally shown that the calibration function, which was being trained to minimize calibration errors effectively mitigates those of the speech recognition system, and as a result, can improve the performance of beam-search during decoding.
引用
收藏
页码:4064 / 4068
页数:5
相关论文
共 50 条
  • [21] Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods
    Ye, Lingxuan
    Cheng, Gaofeng
    Yang, Runyan
    Yang, Zehui
    Tian, Sanli
    Zhang, Pengyuan
    Yan, Yonghong
    [J]. INTERSPEECH 2022, 2022, : 3163 - 3167
  • [22] DeepIntent: ImplicitIntent based Android IDS with E2E Deep Learning architecture
    Sewak, Mohit
    Sahay, Sanjay K.
    Rathore, Hemant
    [J]. 2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,
  • [23] E2E数据采集网络
    张振华
    宫海波
    李国星
    [J]. 中国科技信息, 2017, (06) : 67 - 70
  • [24] An Efficient E2E Verifiable E-voting System without Setup Assumptions
    Kiayias, Aggelos
    Zacharias, Thomas
    Zhang, Bingsheng
    [J]. IEEE SECURITY & PRIVACY, 2017, 15 (03) : 14 - 23
  • [25] A UL-NOMA system providing low E2E latency
    Tezuka, Hayato
    Moriyama, Masafumi
    Takizawa, Kenichi
    Kojima, Fumihide
    [J]. 2019 IEEE VTS ASIA PACIFIC WIRELESS COMMUNICATIONS SYMPOSIUM (APWCS 2019), 2019,
  • [26] BILINGUAL SPEECH RECOGNITION SYSTEM FOR ISOLATED WORDS USING DEEP NEURAL NETWORK
    Bharathi, B.
    Kavitha, S.
    Sugapriya, S.
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, AND SIGNAL PROCESSING (ICCCSP): SPECIAL FOCUS ON TECHNOLOGY AND INNOVATION FOR SMART ENVIRONMENT, 2018, : 78 - 81
  • [27] Management and enforcement of secured E2E network slices across transport domains
    Alemany, Pol
    Molina, Alejandro
    Dangerville, Cyril
    Asensio, Rodrigo
    Ayed, Dhouha
    Munoz, Raul
    Casellas, Ramon
    Martinez, Ricardo
    Skarmeta, Antonio
    Vilalta, Ricard
    [J]. OPTICAL FIBER TECHNOLOGY, 2022, 73
  • [28] 5G E2E Network Slicing Predictable Traffic Generator
    Jaumard, Brigitte
    Ziazet, Junior Momo
    [J]. 2023 19TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM, 2023,
  • [29] Intent-Based E2E Network Slice Management for Industry 4.0
    Chirivella-Perez, Enrique
    Salva-Garcia, Pablo
    Ricart-Sanchez, Ruben
    Calero, Jose Alcaraz
    Wang, Qi
    [J]. 2021 JOINT EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS & 6G SUMMIT (EUCNC/6G SUMMIT), 2021, : 353 - 358
  • [30] Primi Speech Recognition Based on Deep Neural Network
    Hu, Wenjun
    Fu, Meijun
    Pan, Wenlin
    [J]. 2016 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS), 2016, : 667 - 671