Stemmer and phonotactic rules to improve n-gram tagger-based indonesian phonemicization

被引:0
|
作者
Suyanto, Suyanto [1 ]
Sunyoto, Andi [2 ]
Ismail, Rezza Nafi [1 ]
Rachmawati, Ema [1 ]
Maharani, Warih [1 ]
机构
[1] Telkom Univ, Sch Comp, Bandung, Indonesia
[2] Univ Amikom Yogyakarta, Fac Comp Sci, Yogyakarta, Indonesia
关键词
grapheme-to-phoneme conversion; Indonesian language; n-gram; Phonotactic rules; Stemmer; MODEL;
D O I
10.1016/j.jksuci.2021.01.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A phonemicization or grapheme-to-phoneme conversion (G2P) is a process of converting a word into its pronunciation. It is one of the essential components in speech synthesis, speech recognition, and natural language processing. The deep learning (DL)-based state-of-the-art G2P model generally gives low phoneme error rate (PER) as well as word error rate (WER) for high-resource languages, such as English and European, but not for low-resource languages. Therefore, some conventional machine learning (ML) based G2P models incorporated with specific linguistic knowledge are preferable for low-resource languages. However, these models are poor for several low-resource languages because of various issues. For instance, an Indonesian G2P model works well for roots but gives a high PER for derivatives. Most errors come from the ambiguities of some roots and derivative words containing four prefixes: < ber >, < meng >, < peng >, and < ter >. In this research, an Indonesian G2P model based on n-gram combined with stemmer and phonotactic rules (NGTSP) is proposed to solve those problems. An investigation based on 5-fold cross-validation, using 50 k Indonesian words, informs that the proposed NGTSP gives a much lower PER of 0.78% than the state-of-the-art Transformer-based G2P model (1.14%). Besides, it also provides a much faster processing time. (C) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:3807 / 3814
页数:8
相关论文
共 50 条
  • [1] Augmented-syllabification of n-gram tagger for Indonesian words and named-entities
    Suyanto, Suyanto
    Sunyoto, Andi
    Ismail, Rezza Nafi
    Romadhony, Ade
    Sthevanie, Febryanti
    [J]. HELIYON, 2022, 8 (11)
  • [2] Improved N-gram Phonotactic Models For Language Recognition
    BenZeghiba, Mohamed Faouzi
    Gauvain, Jean-Luc
    Lamel, Lori
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2718 - 2721
  • [3] Regularized Subspace n-Gram Model for Phonotactic iVector Extraction
    Soufifar, Mehdi
    Burget, Lukas
    Plchot, Oldrich
    Cumani, Sandro
    Cernocky, Jan
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 74 - 78
  • [4] Searching Polyphonic Indonesian Folksongs Based on N-gram Indexing Technique
    Marsye, Aurora
    Adriani, Mirna
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 387 - 396
  • [5] Tokenization and N-gram for Indexing Indonesian Translation of the Quran
    Putra, Syopiansyah Jaya
    Gunawan, Muhamad Nur
    Suryatno, Agung
    [J]. 2018 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2018, : 158 - 161
  • [6] Comparison of different POS tagging techniques (n-gram, HMM and Brill's tagger) for Bangla
    Hasan, Fahim Muhammad
    UzZaman, Naushad
    Khan, Murnit
    [J]. ADVANCES AND INNOVATIONS IN SYSTEMS, COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2007, : 121 - 126
  • [7] Protein Classification Using N-gram Technique and Association Rules
    Kabli, Fatima
    Hamou, Reda Mohamed
    Amine, Abdelmalek
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2018, 6 (02) : 77 - 89
  • [8] Character n-Gram Embeddings to Improve RNN Language Models
    Takase, Sho
    Suzuki, Jun
    Nagata, Masaaki
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5074 - 5082
  • [9] Chinese Personal Name Recognition Using N-gram Model and Rules
    Chen Lin
    Zhang Hui
    Li Zhen'an
    [J]. 2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 450 - 453
  • [10] An ensemble text classification model combining strong rules and N-Gram
    Liu, Jinhong
    Lu, Yuliang
    [J]. ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 535 - +