Syllable-based Malay Word Stemmer

被引:0
|
作者
Lee, JunChoi [1 ]
Othman, Rosita Mohamad [1 ]
Mohamad, Nurul Zawiyah [1 ]
机构
[1] Univ Malaysia Sarawak, Fac Comp Sci & Informat Technol, Kota Samarahan 94300, Sarawak, Malaysia
关键词
Stemmer; Malay Text; Syllabification; rule-based;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word stemmer is one of the basic and crucial text processing tools in any languages. Word stemmer is not only useful in morphological study but also play an important role in word level context analysis. Due to the existence of prefix, suffix, infix and a combination of affixes in Malay word, it raises the complexity of performing stemming to Malay word. An approach to stem Malay word using syllabification algorithm is introduced. This approach performs stemming through comparing syllable in the word thus reduces the parsing processes. The approach shows high practicality as it produces a very high accuracy in the evaluation.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Research on Syllable-Based Language Model in Malay Speech Recognition
    Wei, Xiangfeng
    Zhang, Quan
    Yuan, Yi
    [J]. 2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 150 - 155
  • [2] The syllable-based word length effect and stimulus set specificity
    Bireta, Tamra J.
    Neath, Ian
    Surprenant, Aimee M.
    [J]. PSYCHONOMIC BULLETIN & REVIEW, 2006, 13 (03) : 434 - 438
  • [3] The syllable-based word length effect and stimulus set specificity
    Tamra J. Bireta
    Ian Neath
    Aimée M. Surprenant
    [J]. Psychonomic Bulletin & Review, 2006, 13 : 434 - 438
  • [4] Computer-based Malay articulation training for Malay plosives at isolated, syllable and word level
    Ting, HN
    Yunus, J
    Vandort, S
    Wong, LC
    [J]. ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1423 - 1426
  • [5] Malay Word Stemmer to Stem Standard and Slang Word Patterns on Social Media
    Kassim, Mohamad Nizam
    Maarof, Mohd Aizaini
    Zainal, Anazida
    Wahab, Amirudin Abdul
    [J]. DATA MINING AND BIG DATA, DMBD 2016, 2016, 9714 : 391 - 400
  • [6] Syllable-based reading improvement: Effects on word reading and reading comprehension in Grade 2
    Mueller, Bettina
    Richter, Tobias
    Karageorgos, Panagiotis
    [J]. LEARNING AND INSTRUCTION, 2020, 66
  • [7] Syllable-based Compression for XML Documents
    Chernik, Katsiaryna
    Lansky, Jan
    Galambos, Leo
    [J]. DATESO 2006 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 6TH ANNUAL INTERNATIONAL WORKSHOP, 2006, 176 : 21 - 31
  • [8] Syllable clustering and spectral discontinuity in syllable-based TTS systems
    Chen, FX
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 688 - 691
  • [9] Syllable-Based Burrows-Wheeler Transform
    Lansky, Jan
    Chernik, Katsiaryna
    Vlckova, Zuzana
    [J]. DATESO 2007 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 7TH ANNUAL INTERNATIONAL WORKSHOP, 2007, 235 : 1 - 10
  • [10] Training of Word Recognition with Willy Wordbear: A Syllable-Based Reading Promotion Program for Elementary School
    Mueller, Bettina
    Karageorgos, Panagiotis
    Richter, Tobias
    [J]. PRAXIS DER KINDERPSYCHOLOGIE UND KINDERPSYCHIATRIE, 2021, 70 (04) : 356 - 371