A category based approach for recognition of out-of-vocabulary words

被引:0
|
作者
Gallwitz, F
Noth, E
Niemann, H
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In almost all applications of automatic speech recognition, especially in spontaneous speech tasks, the recognizer vocabulary cannot cover all occurring words. There is always a significant amount of out-of-vocabulary words even when the vocabulary size is very large. In this paper we present a new approach for the integration of out-of-vocabulary words into statistical language models. We use category information for all words in the training corpus to define a function that gives an approximation of the out-of-vocabulary word emission probability for each word category. This information is integrated into the language models. Although we use a simple acoustic model for out-of-vocabulary words, we achieve a 6% reduction of word error rate on spontaneous speech data with about 5% out-of-vocabulary rate.
引用
收藏
页码:228 / 231
页数:4
相关论文
共 50 条
  • [21] Improving Abstractive Summarization by Training Masked Out-of-Vocabulary Words
    Lee, Tae-Seok
    Lee, Hyun-Young
    Kang, Seung-Shik
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2022, 18 (03): : 344 - 358
  • [22] Few-Shot Representation Learning for Out-Of-Vocabulary Words
    Hu, Ziniu
    Chen, Ting
    Chang, Kai-Wei
    Sun, Yizhou
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4102 - 4112
  • [23] Rejection of out-of-vocabulary words using phoneme confidence likelihood
    Jitsuhiro, T
    Takahashi, S
    Aikawa, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 217 - 220
  • [24] Exploiting Out-of-Vocabulary Words for Out-of-Domain Detection in Dialog Systems
    Ryu, Seonghan
    Lee, Donghyeon
    Lee, Gary Geunbae
    Kim, Kyungduk
    Noh, Hyungjong
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 165 - +
  • [25] Replacing Out-of-Vocabulary Words with an Appropriate Synonym Based on Word2VnCR
    Kim, Jeongin
    Hong, Taekeun
    Kim, Pankoo
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [26] Replacing Out-of-Vocabulary Words with an Appropriate Synonym Based on Word2VnCR
    Kim, Jeongin
    Hong, Taekeun
    Kim, Pankoo
    Mobile Information Systems, 2021, 2021
  • [27] USING SYNTHETIC AUDIO TO IMPROVE THE RECOGNITION OF OUT-OF-VOCABULARY WORDS IN END-TO-END ASR SYSTEMS
    Zheng, Xianrui
    Liu, Yulan
    Gunceler, Deniz
    Willett, Daniel
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5674 - 5678
  • [28] Out-of-vocabulary word rejection algorithm in Korean variable vocabulary word recognition
    Moon, KS
    Kim, YJ
    Kim, HR
    Chung, JH
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 53 - 56
  • [29] A Large Corpus of Product Reviews in Portuguese: Tackling Out-Of-Vocabulary Words
    Hartmann, Nathan S.
    Avanco, Lucas V.
    Balage, Pedro P.
    Duran, Magali S.
    Nunes, Maria G. V.
    Pardo, Thiago A. S.
    Aluisio, Sandra M.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3865 - 3871
  • [30] Online PLSA: Batch Updating Techniques Including Out-of-Vocabulary Words
    Bassiou, Nikoletta K.
    Kotropoulos, Constantine L.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (11) : 1953 - 1966