DYNAMIC ADJUSTMENT OF LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION USING WORD SIMILARITY

被引:0
|
作者
Currey, Anna [1 ]
Illina, Irina
Fohr, Dominique
机构
[1] Univ Lorraine, LORIA, UMR 7503, F-54506 Vandoeuvre Les Nancy, France
关键词
ASR; language modeling; OOV; word embeddings; lexicon extension; VOCABULARY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Out-of-vocabulary (OOV) words can pose a particular problem for automatic speech recognition (ASR) of broadcast news. The language models (LMs) of ASR systems are typically trained on static corpora, whereas new words (particularly new proper nouns) are continually introduced in the media. Additionally, such OOVs are often content-rich proper nouns that are vital to understanding the topic. In this work, we explore methods for dynamically adding OOVs to language models by adapting the n-gram language model used in our ASR system. We propose two strategies: the first relies on finding in-vocabulary (IV) words similar to the OOVs, where word embeddings are used to define similarity. Our second strategy leverages a small contemporary corpus to estimate OOV probabilities. The models we propose yield improvements in perplexity over the baseline; in addition, the corpus-based approach leads to a significant decrease in proper noun error rate over the baseline in recognition experiments.
引用
收藏
页码:426 / 432
页数:7
相关论文
共 50 条
  • [1] Revisiting Dynamic Adjustment of Language Model Scaling Factor for Automatic Speech Recognition
    Sato, Hiroshi
    Moriya, Takafumi
    Shinohara, Yusuke
    Masumura, Ryo
    Fukutomi, Takaaki
    Matsui, Kiyoaki
    Ashihara, Takanori
    Yamaguchi, Yoshikazu
    Aono, Yushi
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 186 - 191
  • [2] SEMANTIC WORD EMBEDDING NEURAL NETWORK LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Audhkhasi, Kartik
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5995 - 5999
  • [3] Word segments in category-based language models for automatic speech recognition
    Justo, Raquel
    Torres, M. Ines
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 1, PROCEEDINGS, 2007, 4477 : 249 - +
  • [4] EFFECT OF WORD-LENGTH AND SIMILARITY ON AUTOMATIC SPEECH RECOGNITION
    DEHAAN, HJ
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1984, 22 (04) : 272 - 272
  • [5] GEOGRAPHIC LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Xiao, Xiaoqiang
    Chen, Hong
    Zylak, Mark
    Sosa, Daniela
    Desu, Suma
    Krishnamoorthy, Mahesh
    Liu, Daben
    Paulik, Matthias
    Zhang, Yuchen
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6124 - 6128
  • [6] JOINT LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING
    Bayer, Ali Orkan
    Riccardi, Giuseppe
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 199 - 203
  • [7] AUTOMATIC SPEECH RECOGNITION USING PSYCHOACOUSTIC MODELS
    ZWICKER, E
    TERHARDT, E
    PAULUS, E
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (02): : 487 - 498
  • [8] Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition
    Sak, Hasim
    Saraclar, Murat
    Gungor, Tunga
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2341 - 2351
  • [9] Automatic Speech Recognition for Irish: testing lexicons and language models
    Qian, Mengjie
    Berthelsen, Harald
    Lonergan, Liam
    Murphy, Andy
    O'Neill, Claire
    Chiarain, Neasa Ni
    Gobl, Christer
    Chasaide, Ailbhe Ni
    [J]. 2022 33RD IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2022,
  • [10] Neural Error Corrective Language Models for Automatic Speech Recognition
    Tanaka, Tomohiro
    Masumura, Ryo
    Masataki, Hirokazu
    Aono, Yushi
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 401 - 405