Towards the automatic generation of Arabic Lexical Recognition Tests using orthographic and phonological similarity maps

被引:0
|
作者
Salah, Saeed [1 ]
Nassar, Mohammad [1 ]
Zaghal, Raid [1 ]
Hamed, Osama [2 ]
机构
[1] Al Quds Univ, Dept Comp Sci, IL-20002 Jerusalem, Israel
[2] Palestine Tech Univ, Comp Syst Engn Dept, Tulkarm, Palestine, Israel
关键词
NLP; LRT; N-gram; Dialects; MSA; Orthographic; Phonological; ENGLISH; CORPUS;
D O I
10.1016/j.jksuci.2021.02.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lexical Recognition Test (LRT) themes are one of the main methods that are widely used to measure lan-guage proficiency of some common languages such as English, German and Spanish. However, similar research for Arabic is still at development stages, and existing proposals mainly use human-crafted meth-ods. In this paper, a new methodology, based on a newly developed algorithm, was proposed with the aim of automatically constructing high quality nonwords associated with a real quick measurement of Arabic proficiency levels (Arabic LRT). The suggested algorithm will automatically generate nonwords based on Arabic special characteristics they are orthography (spelling), phonology (pronunciation), n -grams and the word frequency map, which is an important factor to create a multi-level test. With the help of a large dataset of Arabic vocabulary, the proposed algorithm was experimented. For this purpose, a Web-based application, following the suggested methodology, was designed and implemented to facil-itate the process of collecting and analyzing learners' responses. The experimental results have shown that the LRT questions that were automatically generated by the proposed system had confused the learners, this is clear from the output of the confusion matrix which showed that (1/3) of the generated nonwords were able to distract the learners (with accuracy 65%). Consequentially, the results of recall and precision have smaller values, 0.52 and 0.48, respectively.(c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:8429 / 8439
页数:11
相关论文
共 50 条
  • [31] Automatic recognition of handwritten Arabic using maximally stable extremal region features
    Saeed, Usman
    Tahir, Muhammad
    AlGhamdi, Ahmed S.
    Alkatheiri, Mohammed S.
    OPTICAL ENGINEERING, 2020, 59 (05)
  • [32] Towards Automatic Persona Generation Using Social Media
    An, Jisun
    Cho, Hoyoun
    Kwak, Haewoon
    Jansen, Bernard J.
    Hassen, Mohammed Ziyaad
    2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, : 206 - 211
  • [33] Paraphrase identification and semantic text similarity analysis in Arabic news tweets using lexical, syntactic, and semantic features
    Al-Smadi, Mohammad
    Jaradat, Zain
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    INFORMATION PROCESSING & MANAGEMENT, 2017, 53 (03) : 640 - 652
  • [34] Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques
    Qin, Ying
    Lee, Tan
    Kong, Anthony Pak Hin
    Law, Sam Po
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [35] Towards an Automatic Gait Recognition System using Activity Recognition (Wearable Based)
    Bajrami, Gazmend
    Derawi, Mohammad Omar
    Bours, Patrick
    PROCEEDINGS OF THE 2011 3RD INTERNATIONAL WORKSHOP ON SECURITY AND COMMUNICATION NETWORKS (IWSCN 2011), 2011, : 23 - 30
  • [36] UTOPIA: Automatic Generation of Fuzz Driver using Unit Tests
    Jeong, Bokdeuk
    Jang, Joonun
    Yi, Hayoon
    Moon, Jiin
    Kim, Junsik
    Jeon, Intae
    Kim, Taesoo
    Shim, WooChul
    Hwang, Yong Ho
    2023 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, SP, 2023, : 2676 - 2692
  • [37] PRAXIS: Towards automatic cognitive assessment using gesture recognition
    Negin, Farhood
    Rodriguez, Pau
    Koperski, Michal
    Kerboua, Adlen
    Gonzalez, Jordi
    Bourgeois, Jeremy
    Chapoulie, Emmanuelle
    Robert, Philippe
    Bremond, Francois
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 106 : 21 - 35
  • [38] Towards automatic recognition of mining targets using an autonomous robot
    Quintana, J. Y.
    Garcia, R.
    Neumann, L.
    Campos, R.
    Weiss, T.
    Koeser, K.
    Mohrmann, J.
    Greinert, J.
    OCEANS 2018 MTS/IEEE CHARLESTON, 2018,
  • [39] Improved Arabic speech recognition system through the automatic generation of fine-grained phonetic transcriptions
    Alsharhan, Eiman
    Ramsay, Allan
    INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (02) : 343 - 353
  • [40] DYNAMIC ADJUSTMENT OF LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION USING WORD SIMILARITY
    Currey, Anna
    Illina, Irina
    Fohr, Dominique
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 426 - 432