Lexicon Adaptation for Subword Speech Recognition

被引:0
|
作者
Mertens, Timo [1 ]
Schneider, Daniel [2 ]
Naess, Arild Brandrud [1 ]
Svendsen, Torbjorn [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Elect & Telecommun, Trondheim, Norway
[2] Fraunhofer IAIS, St Augustin, Germany
关键词
D O I
10.1109/ASRU.2009.5373296
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present two approaches to adapt a syllable-based recognition lexicon in an Automatic Speech Recognition (ASR) setting. The motivation is to evaluate whether adaptation techniques commonly used on a word level can also be employed on a subword level. The first method predicts syllable variations, taking into account sub-syllabic phone cluster variations, and subsequently adapts the syllable lexicon. The second approach adds syllable bigrams to the lexicon to cope with acoustic confusability of subword units and syllable-inherent phone attachment ambiguities. We evaluate the methods on two German data sets, one consisting of planned and the other of spontaneous speech. Although the first method did not yield any improvement in the syllable error rate (SER), we could observe that the predicted confusions correlate with those observed in the test data. Bigram adaptation improved the SER by 1.3% and 0.8% absolute on the planned and spontaneous data sets, respectively.
引用
收藏
页码:562 / +
页数:2
相关论文
共 50 条
  • [1] Subword Speech Recognition for Agglutinative Languages
    Valizada, Alakbar
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
  • [2] Subword Modeling for Automatic Speech Recognition
    Livescu, Karen
    Fosler-Lussier, Eric
    Metze, Florian
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 44 - 57
  • [3] Subword Speech Recognition for Detection of Unseen Words
    Bulyko, Ivan
    Herrero, Jose
    Mihelich, Chris
    Kimball, Owen
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2445 - 2448
  • [4] A comparison of lexicon-building methods for subword-based speech recognisers
    Holter, T
    Svendsen, T
    [J]. 1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 102 - 106
  • [5] Subword unit based speech recognition in car environments
    Fischer, A
    Stahl, V
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 257 - 260
  • [6] LARGE VOCABULARY SPEECH RECOGNITION USING SUBWORD UNITS
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    [J]. SPEECH COMMUNICATION, 1993, 13 (3-4) : 263 - 279
  • [7] SUBWORD UNITS FOR AUTOMATIC SPEECH RECOGNITION OF ANY VOCABULARY
    HOLMES, WJ
    PEARCE, DJB
    [J]. GEC JOURNAL OF RESEARCH, 1993, 11 (01): : 49 - 59
  • [8] Automatic generation of subword units for speech recognition systems
    Singh, R
    Raj, B
    Stern, RM
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 89 - 99
  • [9] Stochastic lexicon modeling for speech recognition
    Yun, SJ
    Oh, YH
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (02) : 28 - 30
  • [10] Hybrid Word-Subword Speech Recognition - a Powerful Tool to Search in Speech
    Cernocky, Jan
    Szoke, Igor
    Hanneman, Mirko
    Kombrink, Stefan
    Fapso, Michal
    [J]. PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE - RADIOELEKTRONIKA 2011, 2011, : 25 - 25