A comparison of lexicon-building methods for subword-based speech recognisers

被引：0

作者：

Holter, T

Svendsen, T

机构：

来源：

1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2 | 1996年

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

A comparison of different algorithms for training of pronunciation dictionaries for use with subword-based speech recognises is given. An extension to existing sub-optimal solutions is presented, and is shown to give results close to the maximum likelihood solution. The DARPA Resource Management (RM) database was used for evaluating the lexicon-building algorithms. When compared to the initial lexicon derived from the DARPA RM-distribution, improvements of recognition rates have been obtained for all lexicons icons trained with the different criteria. The maximum likelihood solution resulted in an 11.5% reduction in word error rate, compared to the 10.5% reduction offered by the proposed sub-optimal method.

引用

页码：102 / 106

页数：5

共 30 条

[1] SUBWORD-BASED LARGE-VOCABULARY SPEECH RECOGNITION
LEE, CH
GAUVAIN, JL
PIERACCINI, R
RABINER, LR
[J]. AT&T TECHNICAL JOURNAL, 1993, 72 (05): : 25 - 36
[2] Improving the Usage of Subword-Based Units for Turkish Speech Recognition
Cetinkaya, Gozde
Arisoy, Ebru
Saraclar, Murat
[J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[3] Advances in subword-based HMM-DNN speech recognition across languages
Smit, Peter
Virpioja, Sami
Kurimo, Mikko
[J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
[4] Combining key-phrase detection and subword-based verification for flexible speech understanding
Kawahara, T
Lee, CH
Juang, BH
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1159 - 1162
[5] Subword-based Position Specific Posterior Lattices (S-PSPL) for Indexing Speech Information
Pan, Yi-cheng
Chang, Hung-lin
Chen, Berlin
Lee, Lin-shan
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1597 - +
[6] A comparison of neural-based visual recognisers for speech activity detection
Raza S.
Cuayáhuitl H.
[J]. International Journal of Speech Technology, 2023, 26 (3) : 599 - 608
[7] Comparison of methods for determining speech voicing based on tests performed on paired consonants and continuous speech
Malucha, Jan
Sigmund, Milan
[J]. JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2022, 73 (05): : 359 - 362
[8] Comparison OF Wavelet Based Feature Extraction Methods for Speech/Music Discrimination
Duzenli, Timur
Ozkurt, Nalan
[J]. ISTANBUL UNIVERSITY-JOURNAL OF ELECTRICAL AND ELECTRONICS ENGINEERING, 2011, 11 (01): : 1355 - 1362
[9] Comparison of intensity-based methods for automatic speech rate computation
Elvira-Garcia, Wendy
Farrus, Mireia
Alminana, Juan Maria Garrido
[J]. LOQUENS, 2022, 9 (1-2):
[10] A comparison of spectral smoothing methods for segment concatenation based speech synthesis
Chappell, DT
Hansen, JHL
[J]. SPEECH COMMUNICATION, 2002, 36 (3-4) : 343 - 374

← 1 2 3 →