Transliterating Latin to Amharic scripts using user-defined rules and character mappings

被引:0
|
作者
Zeleke Abebaw
Andreas Rauber
Solomon Atnafu
机构
[1] Addis Ababa University,IT Doctoral Program
[2] Technical University of Vienna,Institute of Information Systems Engineering
[3] Addis Ababa University,Department of Computer Science
关键词
Latin to Amharic script transliteration; Transliteration; Amharic language; Natural language processing; Rule-based transliteration; Social media text transliteration; Character mapping;
D O I
暂无
中图分类号
学科分类号
摘要
As social media platforms become increasingly accessible, individuals’ usage of new forms of textual communication (posts, comments, chats, etc.) on social media using local language scripts such as Amharic has increased tremendously. However, many users prefer to post comments in Latin scripts instead of local ones due to the availability of more convenient forms of character input using Latin keyboards. In existing Latin to Amharic transliteration systems, missing consideration of double consonants and double vowels has caused transliteration errors. Further, as there are multiple ways of character mapping conventions in existing systems, social media texts are susceptible to a wide variety of user adoptions during script production. The current systems have failed to address these gaps and adoptions. In this work, we present the RBLatAm (Rule-Based Latin to Amharic) transliteration system, a generic rule-based system that converts Amharic words which have been written using Latin script back into their native Amharic script. The system is based on mapping rules engineered from three existing transliteration systems (Microsoft, Google, SERA) and additional rules for double consonants, and conventions adopted on social media by speakers of Amharic. When tested on transliterated Amharic words of non-named entities, and named entities of persons, the system achieves an accuracy of 75.8% and 84.6%, respectively. The system also correctly transliterates words reported as errors in previous studies. This system drastically improves the basis for performing research on text mining for Amharic language texts by being able to process such texts even if they have originally been produced in Latin scripts.
引用
收藏
页码:63 / 75
页数:12
相关论文
共 50 条
  • [1] Transliterating Latin to Amharic scripts using user-defined rules and character mappings
    Abebaw, Zeleke
    Rauber, Andreas
    Atnafu, Solomon
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2023, 24 (01) : 63 - 75
  • [2] A Directive Generation Approach Using User-defined Rules
    Komatsu, Kazuhiko
    Egawa, Ryusuke
    Takizawa, Hiroyuki
    Kobayashi, Hiroaki
    [J]. 2016 FOURTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2016, : 515 - 521
  • [3] Inference of user-defined type qualifiers and qualifier rules
    Chin, B
    Markstrum, S
    Millstein, T
    Palsberg, J
    [J]. PROGRAMMING LANGUAGES AND SYSTEMS, PROCEEDINGS, 2006, 3924 : 264 - 278
  • [4] HIERARCHICAL ANALYSIS OF IC ARTWORK WITH USER-DEFINED RULES
    SCHEFFER, LK
    SOETARMAN, R
    [J]. IEEE DESIGN & TEST OF COMPUTERS, 1986, 3 (01): : 66 - 74
  • [5] An approach for proactive mobile recommendations based on user-defined rules
    Ilarri, Sergio
    Trillo-Lado, Raquel
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 242
  • [6] An approach for proactive mobile recommendations based on user-defined rules
    Ilarri, Sergio
    Trillo-Lado, Raquel
    [J]. Expert Systems with Applications, 2024, 242
  • [7] Parallelizing User-Defined Aggregations using Symbolic Execution
    Raychev, Veselin
    Musuvathi, Madanlal
    Mytkowicz, Todd
    [J]. SOSP'15: PROCEEDINGS OF THE TWENTY-FIFTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, 2015, : 153 - 167
  • [8] Data redistribution using MPI user-defined types
    Yang, CS
    Bai, SW
    [J]. FIRST INTERNATIONAL SYMPOSIUM ON CYBER WORLDS, PROCEEDINGS, 2002, : 47 - 53
  • [9] Implementing an Inference Engine for RDFS/OWL constructs and user-defined rules in oracle
    Wu, Zhe
    Eadon, George
    Das, Souripriya
    Chong, Eugene Inseok
    Kolovski, Vladimir
    Annamalai, Melliyal
    Srinivasan, Jagannathan
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1239 - +
  • [10] Implementing an Inference Engine for RDFS/OWL Constructs and User-Defined Rules in HBase
    Liu, Zhengbo
    Yao, Wenbin
    Wang, Dongbin
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG 2017), 2017, : 159 - 164