Syllable-based Malay Word Stemmer

被引:0
|
作者
Lee, JunChoi [1 ]
Othman, Rosita Mohamad [1 ]
Mohamad, Nurul Zawiyah [1 ]
机构
[1] Univ Malaysia Sarawak, Fac Comp Sci & Informat Technol, Kota Samarahan 94300, Sarawak, Malaysia
关键词
Stemmer; Malay Text; Syllabification; rule-based;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word stemmer is one of the basic and crucial text processing tools in any languages. Word stemmer is not only useful in morphological study but also play an important role in word level context analysis. Due to the existence of prefix, suffix, infix and a combination of affixes in Malay word, it raises the complexity of performing stemming to Malay word. An approach to stem Malay word using syllabification algorithm is introduced. This approach performs stemming through comparing syllable in the word thus reduces the parsing processes. The approach shows high practicality as it produces a very high accuracy in the evaluation.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Spoken document retrieval using both word-based and syllable-based document spaces with latent semantic indexing
    Ichikawa, Ken
    Tsuge, Satoru
    Kitaoka, Norihide
    Takeda, Kazuya
    Kita, Kenji
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [32] Performance of LVCSR with morpheme-based and syllable-based recognition units
    Kwon, OW
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1567 - 1570
  • [33] SYLLABLE-BASED SPEECH RECOGNITION USING ELECTROMYOGRAPHY AND DECISION SET CLASSIFIER
    Topalovic, Marko
    Damnjanovic, Dorde
    Peulic, Aleksandar
    Blagojevic, Milan
    Filipovic, Nenad
    [J]. BIOMEDICAL ENGINEERING-APPLICATIONS BASIS COMMUNICATIONS, 2015, 27 (02):
  • [34] On the Utility of Syllable-Based Acoustic Models for Pronunciation Variation Modelling
    Annika Hämäläinen
    Lou Boves
    Johan de Veth
    Louis ten Bosch
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [35] EMERGENT STOPS IN ENGLISH AND IN POLISH: AGAINST SYLLABLE-BASED ACCOUNTS
    Czaplicki, Bartlomiej
    [J]. POZNAN STUDIES IN CONTEMPORARY LINGUISTICS, 2010, 46 (02): : 177 - 191
  • [36] Development of syllable-based text to speech synthesis system in Bengali
    Narendra, N.
    Rao, K.
    Ghosh, Krishnendu
    Vempada, Ramu
    Maity, Sudhamay
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (03) : 167 - 181
  • [37] Syllable-Based Acoustic Modeling with CTC-SMBR-LSTM
    Qu, Zhongdi
    Haghani, Parisa
    Weinstein, Eugene
    Moreno, Pedro
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 173 - 177
  • [38] Automatic syllable-based phoneme recognition using ESTER corpus
    Le Blouch, Olivier
    Collen, Patrice
    [J]. PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION (ISCGAV'-07), 2007, : 77 - +
  • [39] Detecting Laughter and Filled Pauses Using Syllable-based Features
    An, Gouzhen
    Brizan, David Guy
    Rosenberg, Andrew
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 178 - 181
  • [40] A syllable-based pseudo-articulatory approach to speech recognition
    Zhang, L
    [J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 78 - 83