Improving the Usage of Subword-Based Units for Turkish Speech Recognition

被引：0

作者：

Cetinkaya, Gozde ^{[1
]}

Arisoy, Ebru ^{[2
]}

Saraclar, Murat ^{[1
]}

机构：

[1] Bogazici Univ, Elekt Elekt Muhendisligi, Istanbul, Turkey

[2] MEF Univ, Elekt Elekt Muhendisligi, Istanbul, Turkey

来源：

2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU) | 2020年

关键词：

speech recognition; language modelling; acoustic modelling;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Subword units are often utilized to achieve better performance in speech recognition because of the high number of observed words in agglutinative languages. In this study, the proper use of subword units is explored in recognition by a reconsideration of details such as silence modeling and position-dependent phones. A modified lexicon by finite-state transducers is implemented to represent the subword units correctly. Also, we experiment with different types of word boundary markers and achieve the best performance by adding a marker both to the left and right side of a subword unit. In our experiments on a Turkish broadcast news dataset, the subword models do outperform word-based models and naive subword implementations. Results show that using proper subword units leads to a relative word error rate (WER) reductions, which is 2.4%, compared with the word level automatic speech recognition (ASR) system for Turkish.

引用

页数：4

共 50 条

[1] SUBWORD-BASED LARGE-VOCABULARY SPEECH RECOGNITION
LEE, CH
GAUVAIN, JL
PIERACCINI, R
RABINER, LR
[J]. AT&T TECHNICAL JOURNAL, 1993, 72 (05): : 25 - 36
[2] Advances in subword-based HMM-DNN speech recognition across languages
Smit, Peter
Virpioja, Sami
Kurimo, Mikko
[J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
[3] A comparison of lexicon-building methods for subword-based speech recognisers
Holter, T
Svendsen, T
[J]. 1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 102 - 106
[4] Automatic generation of subword units for speech recognition systems
Singh, R
Raj, B
Stern, RM
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 89 - 99
[5] SUBWORD UNITS FOR AUTOMATIC SPEECH RECOGNITION OF ANY VOCABULARY
HOLMES, WJ
PEARCE, DJB
[J]. GEC JOURNAL OF RESEARCH, 1993, 11 (01): : 49 - 59
[6] LARGE VOCABULARY SPEECH RECOGNITION USING SUBWORD UNITS
LEE, CH
GAUVAIN, JL
PIERACCINI, R
RABINER, LR
[J]. SPEECH COMMUNICATION, 1993, 13 (3-4) : 263 - 279
[7] Reduced sets of subword units for continuous speech recognition of Portuguese
dos Santos, SCB
Alcaim, A
[J]. ELECTRONICS LETTERS, 2000, 36 (06) : 586 - 588
[8] Combining key-phrase detection and subword-based verification for flexible speech understanding
Kawahara, T
Lee, CH
Juang, BH
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1159 - 1162
[9] Subword-based Compact Reconstruction of Word Embeddings
Sasaki, Shota
Suzuki, Jun
Inui, Kentaro
[J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3498 - 3508
[10] Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units
Holter, T
Svendsen, T
[J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 199 - 206

← 1 2 3 4 5 →