Personalization for BERT-based Discriminative Speech Recognition Rescoring

被引:0
|
作者
Kolehmainen, Jari [1 ]
Gu, Yile [1 ]
Gourav, Aditya [1 ]
Shivakumar, Prashanth Gurunath [1 ]
Gandhe, Ankur [1 ]
Rastrow, Ariya [1 ]
Bulyko, Ivan [1 ]
机构
[1] Amazon, Bellevue, WA 98004 USA
来源
关键词
speech recognition; rescoring; personalization; prompting; gazetteers;
D O I
10.21437/Interspeech.2023-990
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recognition of personalized content remains a challenge in end-to-end speech recognition. We explore three novel approaches that use personalized content in a neural rescoring step to improve recognition: gazetteers, prompting, and a cross-attention based encoder-decoder model. We use internal de-identified en-US data from interactions with a virtual voice assistant supplemented with personalized named entities to compare these approaches. On a test set with personalized named entities, we show that each of these approaches improves word error rate by over 10%, against a neural rescoring baseline. We also show that on this test set, natural language prompts can improve word error rate by 7% without any training and with a marginal loss in generalization. Overall, gazetteers were found to perform the best with a 10% improvement in word error rate (WER), while also improving WER on a general test set by 1%.
引用
收藏
页码:366 / 370
页数:5
相关论文
共 50 条
  • [21] A BERT-based Language Modeling Framework
    Chien, Chin-Yueh
    Chen, Kuan-Yu
    [J]. INTERSPEECH 2022, 2022, : 699 - 703
  • [22] IMPROVING END-TO-END SPEECH TRANSLATION MODEL WITH BERT-BASED CONTEXTUAL INFORMATION
    Bang, Jeong-Uk
    Lee, Min-Kyu
    Yun, Seung
    Kim, Sang-Hun
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6227 - 6231
  • [23] AAEBERT: Debiasing BERT-based Hate Speech Detection Models via Adversarial Learning
    Okpala, Ebuka
    Cheng, Long
    Mbwambo, Nicodemus
    Luo, Feng
    [J]. 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1606 - 1612
  • [24] A Cross-Attention BERT-Based Framework for Continuous Sign Language Recognition
    Zhou, Zhenxing
    Tam, Vincent W. L.
    Lam, Edmund Y.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1818 - 1822
  • [25] A BERT-based Idiom Detection Model
    Gamage, Gihan
    De Silva, Daswin
    Adikari, Achini
    Alahakoon, Damminda
    [J]. 2022 15TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI), 2022,
  • [26] A BERT-based deontic logic learner
    Sun, Jingyun
    Huang, Shaobin
    Wei, Chi
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (04)
  • [27] A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media
    Mozafari, Marzieh
    Farahbakhsh, Reza
    Crespi, Noel
    [J]. COMPLEX NETWORKS AND THEIR APPLICATIONS VIII, VOL 1, 2020, 881 : 928 - 940
  • [28] BERT-Based Models with Attention Mechanism and Lambda Layer for Biomedical Named Entity Recognition
    Shi, Yuning
    Kimura, Masaomi
    [J]. 2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 536 - 544
  • [29] BERT-Based Scientific Paper Quality Prediction
    Sasaki, Taiki
    Ito, Yasuaki
    Nakano, Koji
    Kasagi, Akihiko
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 212 - 223
  • [30] BERT-Based Approaches to Identifying Malicious URLs
    Su, Ming-Yang
    Su, Kuan-Lin
    [J]. SENSORS, 2023, 23 (20)