Improving the Usage of Subword-Based Units for Turkish Speech Recognition

被引:0
|
作者
Cetinkaya, Gozde [1 ]
Arisoy, Ebru [2 ]
Saraclar, Murat [1 ]
机构
[1] Bogazici Univ, Elekt Elekt Muhendisligi, Istanbul, Turkey
[2] MEF Univ, Elekt Elekt Muhendisligi, Istanbul, Turkey
关键词
speech recognition; language modelling; acoustic modelling;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Subword units are often utilized to achieve better performance in speech recognition because of the high number of observed words in agglutinative languages. In this study, the proper use of subword units is explored in recognition by a reconsideration of details such as silence modeling and position-dependent phones. A modified lexicon by finite-state transducers is implemented to represent the subword units correctly. Also, we experiment with different types of word boundary markers and achieve the best performance by adding a marker both to the left and right side of a subword unit. In our experiments on a Turkish broadcast news dataset, the subword models do outperform word-based models and naive subword implementations. Results show that using proper subword units leads to a relative word error rate (WER) reductions, which is 2.4%, compared with the word level automatic speech recognition (ASR) system for Turkish.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] SUBWORD-BASED LARGE-VOCABULARY SPEECH RECOGNITION
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    [J]. AT&T TECHNICAL JOURNAL, 1993, 72 (05): : 25 - 36
  • [2] Advances in subword-based HMM-DNN speech recognition across languages
    Smit, Peter
    Virpioja, Sami
    Kurimo, Mikko
    [J]. COMPUTER SPEECH AND LANGUAGE, 2021, 66
  • [3] A comparison of lexicon-building methods for subword-based speech recognisers
    Holter, T
    Svendsen, T
    [J]. 1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 102 - 106
  • [4] Automatic generation of subword units for speech recognition systems
    Singh, R
    Raj, B
    Stern, RM
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (02): : 89 - 99
  • [5] SUBWORD UNITS FOR AUTOMATIC SPEECH RECOGNITION OF ANY VOCABULARY
    HOLMES, WJ
    PEARCE, DJB
    [J]. GEC JOURNAL OF RESEARCH, 1993, 11 (01): : 49 - 59
  • [6] LARGE VOCABULARY SPEECH RECOGNITION USING SUBWORD UNITS
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    [J]. SPEECH COMMUNICATION, 1993, 13 (3-4) : 263 - 279
  • [7] Reduced sets of subword units for continuous speech recognition of Portuguese
    dos Santos, SCB
    Alcaim, A
    [J]. ELECTRONICS LETTERS, 2000, 36 (06) : 586 - 588
  • [8] Combining key-phrase detection and subword-based verification for flexible speech understanding
    Kawahara, T
    Lee, CH
    Juang, BH
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1159 - 1162
  • [9] Subword-based Compact Reconstruction of Word Embeddings
    Sasaki, Shota
    Suzuki, Jun
    Inui, Kentaro
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3498 - 3508
  • [10] Combined optimisation of baseforms and model parameters in speech recognition based on acoustic subword units
    Holter, T
    Svendsen, T
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 199 - 206