Assessing English language sentences readability using machine learning models

被引:0
|
作者
Maqsood S. [1 ]
Shahid A. [1 ]
Afzal M.T. [2 ]
Roman M. [1 ]
Khan Z. [3 ]
Nawaz Z. [4 ]
Aziz M.H. [5 ]
机构
[1] Institute of Computing, Kohat University of Science and Technology, KPK, Kohat
[2] NAMAL Institue of Mianwali, Punjab, Mianwali
[3] Robotics and Internet of Things Lab, Prince Sultan University, Riyadh
[4] Department of Data Science, Faculty of Computing and Information Technology, University of the Punjab, Punjab, Lahore
[5] Mechanical Engineering Department, University of Sargodha, Punjab, Sargodha
关键词
Flesch-kincaid; Language learning; Machine learning; Natural language processing; Sentence readability;
D O I
10.7717/PEERJ-CS.818
中图分类号
学科分类号
摘要
Readability is an active field of research in the late nineteenth century and vigorously persuaded to date. The recent boom in data-driven machine learning has created aviable path forward for readability classification and ranking. The evaluation oftext readability is a time-honoured issue with even more relevance in today’sinformation-rich world. This paper addresses the task of readability assessment forthe English language. Given the input sentences, the objective is to predict its level ofreadability, which corresponds to the level of literacy anticipated from the targetreaders. This readability aspect plays a crucial role in drafting and comprehendingprocesses of English language learning. Selecting and presenting a suitable collectionof sentences for English Language Learners may play a vital role in enhancingtheir learning curve. In this research, we have used 30,000 English sentences forexperimentation. Additionally, they have been annotated into seven differentreadability levels using Flesch Kincaid. Later, various experiments were conductedusing five Machine Learning algorithms, i.e., KNN, SVM, LR, NB, and ANN.The classification models render excellent and stable results. The ANN modelobtained an F-score of 0.95% on the test set. The developed model may be used ineducation setup for tasks such as language learning, assessing the reading and writingabilities of a learner © Copyright 2022 Maqsood et al
引用
收藏
相关论文
共 50 条
  • [1] Assessing English language sentences readability using machine learning models
    Maqsood, Shazia
    Shahid, Abdul
    Afzal, Muhammad Tanvir
    Roman, Muhammad
    Khan, Zahid
    Nawaz, Zubair
    Aziz, Muhammad Haris
    PEERJ COMPUTER SCIENCE, 2022, 7
  • [2] Assessing Readability of Learning Materials on Artificial Intelligence in English for Second Language Learners
    Ehara, Yo
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS AND DOCTORAL CONSORTIUM, PT II, 2022, 13356 : 475 - 478
  • [3] Automatic readability assessment for sentences: neural, hybrid and large language models
    Liu, Fengkai
    Jin, Tan
    Lee, John S. Y.
    LANGUAGE RESOURCES AND EVALUATION, 2025,
  • [4] A Machine Learning Approach for English Sentences Classifier
    Al-Neami, Ahmed
    Al-Saedy, Hasan
    Richard, Gilles
    INNOVATION AND SUSTAINABLE COMPETITIVE ADVANTAGE: FROM REGIONAL DEVELOPMENT TO WORLD ECONOMIES, VOLS 1-5, 2012, : 80 - 85
  • [5] Readability Evaluation of Books in Chinese as a Foreign Language Using the Machine Learning Algorithm
    Ji, Qiong
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [6] Performance of different KNN models in prediction english language readability
    Altay, Osman
    2022 2ND INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2022, 2022, : 62 - 66
  • [7] A Methodology for Machine Translation of Simple Sentences from Kannada to English Language
    Kodabagi, Mallikarjun M.
    Angadi, S. A.
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 237 - 241
  • [8] Joint pairwise learning and masked language models for neural machine translation of English
    Yang, Shuhan
    Yang, Qun
    ARTIFICIAL LIFE AND ROBOTICS, 2025,
  • [9] CLASSIFICATION OF LANGUAGE LEARNERS' SENTENCES INTO NATIVE-LIKE OR NON-NATIVE-LIKE SENTENCES USING LEARNER SENTENCES AND MACHINE TRANSLATION SENTENCES AS LEARNING DATA
    Kotani, Katsunori
    Yoshimi, Takehiko
    3RD INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI2010), 2010,
  • [10] A Simple Present and Past Sentences Machine Translation from Arabic Language (AL) to English language
    Hmeidi, Ismail
    Al-Aiad, Ahmad
    Al-Momani, Sama
    Ibnian, Mohammad
    2016 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2016,