A comparison of lexicon-building methods for subword-based speech recognisers

被引:0
|
作者
Holter, T
Svendsen, T
机构
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A comparison of different algorithms for training of pronunciation dictionaries for use with subword-based speech recognises is given. An extension to existing sub-optimal solutions is presented, and is shown to give results close to the maximum likelihood solution. The DARPA Resource Management (RM) database was used for evaluating the lexicon-building algorithms. When compared to the initial lexicon derived from the DARPA RM-distribution, improvements of recognition rates have been obtained for all lexicons icons trained with the different criteria. The maximum likelihood solution resulted in an 11.5% reduction in word error rate, compared to the 10.5% reduction offered by the proposed sub-optimal method.
引用
收藏
页码:102 / 106
页数:5
相关论文
共 30 条
  • [1] SUBWORD-BASED LARGE-VOCABULARY SPEECH RECOGNITION
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    [J]. AT&T TECHNICAL JOURNAL, 1993, 72 (05): : 25 - 36
  • [2] Improving the Usage of Subword-Based Units for Turkish Speech Recognition
    Cetinkaya, Gozde
    Arisoy, Ebru
    Saraclar, Murat
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [3] Advances in subword-based HMM-DNN speech recognition across languages
    Smit, Peter
    Virpioja, Sami
    Kurimo, Mikko
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
  • [4] Combining key-phrase detection and subword-based verification for flexible speech understanding
    Kawahara, T
    Lee, CH
    Juang, BH
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1159 - 1162
  • [5] Subword-based Position Specific Posterior Lattices (S-PSPL) for Indexing Speech Information
    Pan, Yi-cheng
    Chang, Hung-lin
    Chen, Berlin
    Lee, Lin-shan
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1597 - +
  • [6] A comparison of neural-based visual recognisers for speech activity detection
    Raza S.
    Cuayáhuitl H.
    [J]. International Journal of Speech Technology, 2023, 26 (3) : 599 - 608
  • [7] Comparison of methods for determining speech voicing based on tests performed on paired consonants and continuous speech
    Malucha, Jan
    Sigmund, Milan
    [J]. JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2022, 73 (05): : 359 - 362
  • [8] Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination
    Duzenli, Timur
    Ozkurt, Nalan
    [J]. ISTANBUL UNIVERSITY-JOURNAL OF ELECTRICAL AND ELECTRONICS ENGINEERING, 2011, 11 (01): : 1355 - 1362
  • [9] Comparison of intensity-based methods for automatic speech rate computation
    Elvira-Garcia, Wendy
    Farrus, Mireia
    Alminana, Juan Maria Garrido
    [J]. LOQUENS, 2022, 9 (1-2):
  • [10] A comparison of spectral smoothing methods for segment concatenation based speech synthesis
    Chappell, DT
    Hansen, JHL
    [J]. SPEECH COMMUNICATION, 2002, 36 (3-4) : 343 - 374