CHARACTERISTICS OF DATA MINING BY CLASSIFICATION EDUCATIONAL DATASET TO IMPROVE STUDENT'S EVALUATION

被引:0
|
作者
Jasim, Abdulrahman Ahmed [1 ]
Hazim, Layth Rafea [2 ]
Abdullah, Wisam Dawood [2 ]
机构
[1] Al Iraqia Univ, Coll Engn, Dept Network Engn, Baghdad, Iraq
[2] Tikrit Univ, Cisco Networking Acad, Tikrit, Iraq
来源
关键词
Classification methods; Educational data mining; Knowledge discovery; Machine learning; Performance prediction; PERFORMANCE;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The educational data mining (EDM) can be specified as one of the main fields related to high-quality research that involves mining datasets to address research questions related to education; such questions examine the ways in which people learn and teach. a large amount of data, including education data, are being collected, and much of them are unprocessed. The success of EDM was examined in this paper, and nine data mining techniques were explored including: bagging, multilayer perception (MLP), naive Bayes (NB), K-nearest neighbours (KNN), logistic regression (LR), support vector machine (SVM), XGBoost, decision tree (DT), and random forest (RF). Such techniques were used on an educational dataset obtained from certain learning management system which is referred to as Kalboard 360. This paper involves three major steps. Firstly, student performance model that contains exceptional feature's category, that are referred to as behavioural features, is introduced. Secondly, the dataset is pre-processed, and the pre-processing steps involve transforming the raw data into a usable format and verifying the connections between independent and dependent variables in sample dataset, which has been also referred to as the training dataset. Thirdly, the nine data mining approaches have been utilized on the acquired dataset to classify student performance into low, middle, and high levels. Afterwards, the performance measures were examined by using recall, precision, accuracy, as well as F1 score. RF (89%) obtained the best accuracy also other techniques were ordered in terms of accuracy: bagging (85%) > XGBoost 84% > NB (81%) > LR (81%) > MLP (77%) > DT (76%) > SVM (72%) > KNN (68%). Results were compared by using divided datasets (80:20 ratio) (80 for training: 20 for testing).
引用
收藏
页码:2825 / 2844
页数:20
相关论文
共 50 条
  • [31] Comparison of Data Mining Classification Algorithms on Educational Data under Different Conditions
    Koyuncu, Ilhan
    Gelbal, Selahattin
    JOURNAL OF MEASUREMENT AND EVALUATION IN EDUCATION AND PSYCHOLOGY-EPOD, 2020, 11 (04): : 325 - 345
  • [32] Evaluation of the impact of dataset characteristics for classification problems in biological applications
    Institute for Bioinformatics, University for Health Sciences, Medical Informatics and Technology , A-6060 Hall in Tirol, Austria
    不详
    不详
    World Acad. Sci. Eng. Technol., 2009, (966-970):
  • [33] Evaluation of the Impact of Dataset Characteristics for Classification Problems in Biological Applications
    Kusonmano, Kanthida
    Netzer, Michael
    Pfeifer, Bernhard
    Baumgartner, Christian
    Liedl, Klaus R.
    Graber, Armin
    World Academy of Science, Engineering and Technology, 2009, 58 : 966 - 970
  • [34] Educational Data Mining to Improve Learner Performance in Gauteng Primary Schools
    Ramaphosa, Khokhoni Innocentia Mpho
    Zuva, Tranos
    Kwuimi, Raoul
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN BIG DATA, COMPUTING AND DATA COMMUNICATION SYSTEMS (ICABCD), 2018,
  • [35] Prediction of student exam performance using data mining classification algorithms
    Khairy, Dalia
    Alharbi, Nouf
    Amasha, Mohamed A.
    Areed, Marwa F.
    Alkhalaf, Salem
    Abougalala, Rania A.
    EDUCATION AND INFORMATION TECHNOLOGIES, 2024, 29 (16) : 21621 - 21645
  • [36] DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE
    Bara, Adela
    Andreescu, Anca
    Botha, Iuliana
    Florea, Alexandra
    Velicanu, Manole
    QUALITY AND EFFICIENCY IN E-LEARNING, VOL 2, 2013, : 284 - 289
  • [37] Comments Data Mining for Evaluating Student's Performance
    Sorour, Shaymaa E.
    Mine, Tsunenori
    Goda, Kazumasa
    Hirokawa, Sachio
    2014 IIAI 3RD INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2014), 2014, : 25 - 30
  • [38] Mining association rules in student's assessment data
    Kumar, Varun
    Chadha, Anupama
    International Journal of Computer Science Issues, 2012, 9 (5 5-3): : 211 - 216
  • [39] Student Engagement Patterns in a Blended Learning Environment: an Educational Data Mining Approach
    Nkomo, Larian M.
    Nat, Muesser
    TECHTRENDS, 2021, 65 (05) : 808 - 817
  • [40] Student Engagement Patterns in a Blended Learning Environment: an Educational Data Mining Approach
    Larian M. Nkomo
    Muesser Nat
    TechTrends, 2021, 65 : 808 - 817