A Spell Checker for a Low-resourced and Morphologically Rich Language

被引:0
|
作者
Octaviano, Manolito, Jr. [1 ]
Borra, Allan [1 ]
机构
[1] De La Salle Univ, Coll Comp Studies, Manila, Philippines
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Spell checking plays an important role in improving the quality of documents by identifying misspelled words in the document. There are various efforts made towards advancement of spell checkers on other languages such as in English that has almost perfected spell checking system (e.g. Microsoft Word). However, few efforts were made to develop an efficient Filipino spell checker. One major challenge of existing Filipino spell checkers, being dictionary-based, is the lack of a complete dictionary to capture all inflected forms (e.g. isinasama 'including', isasama 'will be included', and isinama 'included' with the base form sama 'include'), borrowing (e.g. magtex 'to text' and nagtex 'texted'), and code-switching (e.g. magtext 'to text', and nag-text 'texted' with the base form 'text') of a word. In addition, existing systems cannot handle code switching wherein valid words are being marked as erroneous. In this research, a spell checking is designed for Filipino low-resourced morphologically rich language. It detects and corrects typographical errors in the language and introduces a modified version of metaphone algorithm for ranking the candidate suggestions. The system results to 81% recall, 53.64% precision, 64.53% f-measure, and 87.78% suggestion adequacy on 100 sentences taken from exercise documents of Filipino students.
引用
收藏
页码:1853 / 1856
页数:4
相关论文
共 50 条
  • [41] Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning
    Chen, Wenda
    Hasegawa-Johnson, Mark
    Chen, Nancy F.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2047 - 2051
  • [42] Surface Realization Architecture for Low-resourced African Languages
    Mahlaza, Zola
    Keet, C. Maria
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [43] Neural Machine Translation for Low-Resourced Indian Languages
    Choudhary, Himanshu
    Rao, Shivansh
    Rohilla, Rajesh
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3610 - 3615
  • [44] Survey on Spell Checker for Tamil Language Using Natural Language Processing
    Selvaraj, P. A.
    Jagadeesan, M.
    Harikrishnan, M.
    Vijayapriya, R.
    Jayasudha, K.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 170 - 174
  • [45] The Best of both Worlds: Dual Channel Language modeling for Hope Speech Detection in low-resourced Kannada
    Hande, Adeep
    Hegde, Siddhanth U.
    Sangeetha, Sivanesan
    Priyadharshini, Ruba
    Chakravarthi, Bharathi Raja
    PROCEEDINGS OF THE SECOND WORKSHOP ON LANGUAGE TECHNOLOGY FOR EQUALITY, DIVERSITY AND INCLUSION (LTEDI 2022), 2022, : 127 - 135
  • [46] Multilingual broad phoneme recognition and language-independent spoken term detection for low-resourced languages
    Deekshitha, G.
    Mary, Leena
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 7313 - 7323
  • [47] Pressure ulcer management in disasters in low-resourced countries Reply
    Sato, Tomoya
    Ichioka, Shigeru
    OSTOMY WOUND MANAGEMENT, 2013, 59 (02) : 9 - 9
  • [48] Practical quality improvement changes for a low-resourced pediatric unit
    Yager, Phoebe H.
    Callans, Kevin Mary
    Samost-Williams, Aubrey
    Bonilla, Jose A.
    Flores, Luis J. G.
    Hasbun, Susana C. A.
    Rodriguez, Angel E. A.
    Cardenas, Alejandra B. A.
    Nunez, Alexia M. L.
    Jayawardena, Asitha D. L.
    Zablah, Evelyn J.
    Hartnick, Christopher J.
    FRONTIERS IN PUBLIC HEALTH, 2024, 12
  • [49] Improving the Performance of Low-resourced Speaker Identification with Data Preprocessing
    Phyu, Win Lai Lai
    Naing, Hay Mar Soe
    Pa, Win Pa
    JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2023, 17 (03) : 275 - 291
  • [50] Adapting Mobile Medical Information Search to Low-Resourced Areas
    Hanbury, Allan
    Van Zyl, Hendra
    Boyer, Celia
    Barnard, Etienne
    2013 IST-AFRICA CONFERENCE AND EXHIBITION (IST-AFRICA), 2013,