Towards the automatic generation of Arabic Lexical Recognition Tests using orthographic and phonological similarity maps

被引:0
|
作者
Salah, Saeed [1 ]
Nassar, Mohammad [1 ]
Zaghal, Raid [1 ]
Hamed, Osama [2 ]
机构
[1] Al Quds Univ, Dept Comp Sci, IL-20002 Jerusalem, Israel
[2] Palestine Tech Univ, Comp Syst Engn Dept, Tulkarm, Palestine, Israel
关键词
NLP; LRT; N-gram; Dialects; MSA; Orthographic; Phonological; ENGLISH; CORPUS;
D O I
10.1016/j.jksuci.2021.02.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lexical Recognition Test (LRT) themes are one of the main methods that are widely used to measure lan-guage proficiency of some common languages such as English, German and Spanish. However, similar research for Arabic is still at development stages, and existing proposals mainly use human-crafted meth-ods. In this paper, a new methodology, based on a newly developed algorithm, was proposed with the aim of automatically constructing high quality nonwords associated with a real quick measurement of Arabic proficiency levels (Arabic LRT). The suggested algorithm will automatically generate nonwords based on Arabic special characteristics they are orthography (spelling), phonology (pronunciation), n -grams and the word frequency map, which is an important factor to create a multi-level test. With the help of a large dataset of Arabic vocabulary, the proposed algorithm was experimented. For this purpose, a Web-based application, following the suggested methodology, was designed and implemented to facil-itate the process of collecting and analyzing learners' responses. The experimental results have shown that the LRT questions that were automatically generated by the proposed system had confused the learners, this is clear from the output of the confusion matrix which showed that (1/3) of the generated nonwords were able to distract the learners (with accuracy 65%). Consequentially, the results of recall and precision have smaller values, 0.52 and 0.48, respectively.(c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:8429 / 8439
页数:11
相关论文
共 50 条
  • [21] AUTOMATIC EVALUATION OF TESTS USING PATTERN RECOGNITION TECHNIQUES
    Lacrama, Dan L.
    Gherhes, Vasile
    Karnyanszky, Tiberiu M.
    Crista, Ovidiu
    QUALITY MANAGEMENT IN HIGHER EDUCATION, VOL 2, 2010, : 99 - 102
  • [22] Automatic Generation of Semantic Features and Lexical Relations Using OWL Ontologies
    Al-Yahya, Maha
    Al-Khalifa, Hend
    Bahanshal, Alia
    Al-Oudah, Iman
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2011, 6716 : 15 - 26
  • [23] AUTOMATIC GENERATION OF EFFICIENT LEXICAL PROCESSORS USING FINITE STATE TECHNIQUES
    JOHNSON, WL
    PORTER, JH
    ACKLEY, SI
    ROSS, DT
    COMMUNICATIONS OF THE ACM, 1968, 11 (12) : 805 - &
  • [24] Image Recommendation for Automatic Report Generation using Semantic Similarity
    Hyun, Changhun
    park, Hyeyoung
    2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 259 - 262
  • [25] AUTOMATIC RECOGNITION OF ARABIC CHARACTER USING LOGIC STATEMENTS .2. DEVELOPMENT OF RECOGNITION ALGORITHM
    NURULULA, A
    NOUH, A
    JOURNAL OF ENGINEERING SCIENCES, 1988, 14 (02): : 355 - 367
  • [26] Towards automatic recognition of fonts using genetic approach
    Sarfraz, M.
    Raza, S.A.
    Recent Advances in Computers, Computing and Communications, 2002, : 290 - 295
  • [27] Automatic generation of difficulty maps for datasets using neural network
    Sanches, Silvio Ricardo Rodrigues
    Custodio Jr, Elton
    Correa, Cleber Gimenez
    Oliveira, Claiton
    Freire, Valdinei
    Saito, Priscila Tiemi Maeda
    Bugatti, Pedro Henrique
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (25) : 66499 - 66516
  • [28] Towards Unsupervised Learning for Arabic Handwritten Recognition Using Deep Architectures
    Elleuch, Mohamed
    Tagougui, Najiba
    Kherallah, Monji
    NEURAL INFORMATION PROCESSING, PT I, 2015, 9489 : 363 - 372
  • [29] INTEGRATED PRONUNCIATION LEARNING FOR AUTOMATIC SPEECH RECOGNITION USING PROBABILISTIC LEXICAL MODELING
    Rasipuram, Ramya
    Razavi, Marzieh
    Magimai-Doss, Mathew
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5176 - 5180
  • [30] ARABIC SPEECH PRONUNCIATION RECOGNITION AND CORRECTION USING AUTOMATIC SPEECH RECOGNIZER (ASR)
    Dahan, H. B.
    Mannan, A.
    INTED2012: INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE, 2012, : 4009 - 4016