Personalization for BERT-based Discriminative Speech Recognition Rescoring

被引:0
|
作者
Kolehmainen, Jari [1 ]
Gu, Yile [1 ]
Gourav, Aditya [1 ]
Shivakumar, Prashanth Gurunath [1 ]
Gandhe, Ankur [1 ]
Rastrow, Ariya [1 ]
Bulyko, Ivan [1 ]
机构
[1] Amazon, Bellevue, WA 98004 USA
来源
关键词
speech recognition; rescoring; personalization; prompting; gazetteers;
D O I
10.21437/Interspeech.2023-990
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recognition of personalized content remains a challenge in end-to-end speech recognition. We explore three novel approaches that use personalized content in a neural rescoring step to improve recognition: gazetteers, prompting, and a cross-attention based encoder-decoder model. We use internal de-identified en-US data from interactions with a virtual voice assistant supplemented with personalized named entities to compare these approaches. On a test set with personalized named entities, we show that each of these approaches improves word error rate by over 10%, against a neural rescoring baseline. We also show that on this test set, natural language prompts can improve word error rate by 7% without any training and with a marginal loss in generalization. Overall, gazetteers were found to perform the best with a 10% improvement in word error rate (WER), while also improving WER on a general test set by 1%.
引用
收藏
页码:366 / 370
页数:5
相关论文
共 50 条
  • [31] BERT-Based Approaches to Identifying Malicious URLs
    Su, Ming-Yang
    Su, Kuan-Lin
    [J]. SENSORS, 2023, 23 (20)
  • [32] BERT-Based Scientific Paper Quality Prediction
    Sasaki, Taiki
    Ito, Yasuaki
    Nakano, Koji
    Kasagi, Akihiko
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 212 - 223
  • [33] BERT-Based Stock Market Sentiment Analysis
    Lee, Chien-Cheng
    Gao, Zhongjian
    Tsai, Chun-Li
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [34] BERT-based Conformal Predictor for Sentiment Analysis
    Maltoudoglou, Lysimachos
    Paisios, Andreas
    Papadopoulos, Harris
    [J]. CONFORMAL AND PROBABILISTIC PREDICTION AND APPLICATIONS, VOL 128, 2020, 128 : 269 - 284
  • [35] BBVD: A BERT-based Method for Vulnerability Detection
    Huang, Weichang
    Lin, Shuyuan
    Li, Chen
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 890 - 898
  • [36] BERT-Based GitHub Issue Report Classification
    Siddiq, Mohammed Latif
    Santos, Joanna C. S.
    [J]. 2022 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON NATURAL LANGUAGE-BASED SOFTWARE ENGINEERING (NLBSE 2022), 2022, : 33 - 36
  • [37] A UNIVERSAL BERT-BASED FRONT-END MODEL FOR MANDARIN TEXT-TO-SPEECH SYNTHESIS
    Bai, Zilong
    Hu, Beibei
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6074 - 6078
  • [38] Auxiliary Loss for BERT-Based Paragraph Segmentation
    Zhuo, Binggang
    Murata, Masaki
    Ma, Qing
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (01) : 58 - 67
  • [39] SIAMBERT: Siamese Bert-based Code Search
    Pena, Francisco J.
    Gonzalez, Angel Luis
    Pashami, Sepideh
    Al-Shishtawy, Ahmad
    Payberah, Amir H.
    [J]. 2022 34TH WORKSHOP OF THE SWEDISH ARTIFICIAL INTELLIGENCE SOCIETY (SAIS 2022), 2022, : 64 - 70
  • [40] BERTMap: A BERT-Based Ontology Alignment System
    He, Yuan
    Chen, Jiaoyan
    Antonyrajah, Denvar
    Horrocks, Ian
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5684 - 5691