Assessing English language sentences readability using machine learning models

被引:0
|
作者
Maqsood S. [1 ]
Shahid A. [1 ]
Afzal M.T. [2 ]
Roman M. [1 ]
Khan Z. [3 ]
Nawaz Z. [4 ]
Aziz M.H. [5 ]
机构
[1] Institute of Computing, Kohat University of Science and Technology, KPK, Kohat
[2] NAMAL Institue of Mianwali, Punjab, Mianwali
[3] Robotics and Internet of Things Lab, Prince Sultan University, Riyadh
[4] Department of Data Science, Faculty of Computing and Information Technology, University of the Punjab, Punjab, Lahore
[5] Mechanical Engineering Department, University of Sargodha, Punjab, Sargodha
关键词
Flesch-kincaid; Language learning; Machine learning; Natural language processing; Sentence readability;
D O I
10.7717/PEERJ-CS.818
中图分类号
学科分类号
摘要
Readability is an active field of research in the late nineteenth century and vigorously persuaded to date. The recent boom in data-driven machine learning has created aviable path forward for readability classification and ranking. The evaluation oftext readability is a time-honoured issue with even more relevance in today’sinformation-rich world. This paper addresses the task of readability assessment forthe English language. Given the input sentences, the objective is to predict its level ofreadability, which corresponds to the level of literacy anticipated from the targetreaders. This readability aspect plays a crucial role in drafting and comprehendingprocesses of English language learning. Selecting and presenting a suitable collectionof sentences for English Language Learners may play a vital role in enhancingtheir learning curve. In this research, we have used 30,000 English sentences forexperimentation. Additionally, they have been annotated into seven differentreadability levels using Flesch Kincaid. Later, various experiments were conductedusing five Machine Learning algorithms, i.e., KNN, SVM, LR, NB, and ANN.The classification models render excellent and stable results. The ANN modelobtained an F-score of 0.95% on the test set. The developed model may be used ineducation setup for tasks such as language learning, assessing the reading and writingabilities of a learner © Copyright 2022 Maqsood et al
引用
收藏
相关论文
共 50 条
  • [21] Assessing regional competitiveness in Peru: An approach using nonlinear machine learning models
    Garcia-Lopez, Yvan J.
    Castro, Luis A. del Carpio
    PLOS ONE, 2025, 20 (02):
  • [22] Assessing the effect of climate change on drought and runoff using a machine learning models
    Jahangiri, E.
    Motamedvaziri, B.
    Kiadaliri, H.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2025, 22 (04) : 2205 - 2228
  • [23] Predicting Prenatal Depression and Assessing Model Bias Using Machine Learning Models
    Huang, Yongchao
    Alvernaz, Suzanne
    Kim, Sage J.
    Maki, Pauline
    Dai, Yang
    Bernabe, Beatriz Penalver
    BIOLOGICAL PSYCHIATRY: GLOBAL OPEN SCIENCE, 2024, 4 (06):
  • [24] Using Large Language Models to Develop Readability Formulas for Educational Settings
    Crossley, Scott
    Choi, Joon Suh
    Scherber, Yanisa
    Lucka, Mathis
    ARTIFICIAL INTELLIGENCE IN EDUCATION. POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2023, 2023, 1831 : 422 - 427
  • [25] Machine translation of simple English sentences to Hindi
    Ahmed, Mansoor
    Bhattacharyya, S.K.
    Advances in Modelling and Analysis B: Signals, Information, Data, Patterns, 1995, 33 (1-3): : 13 - 26
  • [26] Enhancing the Readability of Preoperative Patient Instructions Using Large Language Models
    Hong, Hyo Jung
    Schmiesing, Clifford A.
    Goodell, Alex J.
    ANESTHESIOLOGY, 2024, 141 (03) : 608 - 610
  • [27] Democratizing Language Learning using Machine Learning
    Gangopadhyay, Ahana
    Bardhan, Indrajit
    Das, Anirban
    Soman, Nitish Subhash
    Das, Santanu
    2022 56TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2022, : 137 - 141
  • [28] Explaining machine learning models with interactive natural language conversations using TalkToModel
    Slack, Dylan
    Krishna, Satyapriya
    Lakkaraju, Himabindu
    Singh, Sameer
    NATURE MACHINE INTELLIGENCE, 2023, 5 (08) : 873 - +
  • [29] Detection of Arabic offensive language in social media using machine learning models
    Mousa, Aya
    Shahin, Ismail
    Nassif, Ali Bou
    Elnagar, Ashraf
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 22
  • [30] Explaining machine learning models with interactive natural language conversations using TalkToModel
    Dylan Slack
    Satyapriya Krishna
    Himabindu Lakkaraju
    Sameer Singh
    Nature Machine Intelligence, 2023, 5 : 873 - 883