Multi-Class Phased Prediction of Academic Performance and Dropout in Higher Education

被引:3
|
作者
Martins, Monica V. [1 ]
Baptista, Luis [1 ]
Machado, Jorge [1 ]
Realinho, Valentim [1 ,2 ]
机构
[1] Polythecn Inst Portalegre, P-7300110 Portalegre, Portugal
[2] VALORIZA Res Ctr Endogenous Resource Valorizat, P-7300555 Portalegre, Portugal
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 08期
关键词
machine learning; learning analytics; student performance prediction; dropout prediction; STUDENT PERFORMANCE;
D O I
10.3390/app13084702
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The application of intelligent systems in the higher education sector is an active field of research, powered by the abundance of available data and by the urgency to define effective, data-driven strategies to overcome students' dropout and improve students' academic performance. This work applies machine learning techniques to develop prediction models that can contribute to the early detection of students at risk of dropping out or not finishing their degree in due time. It also evaluates the best moment for performing the prediction along the student's enrollment year. The models are built on data of undergraduate students from a Polytechnic University in Portugal, enrolled between 2009 and 2017, comprising academic, social-demographic, and macroeconomic information at three different phases during the first academic year of the students. Five machine learning algorithms are used to train prediction models at each phase, and the most relevant features for the top performing models are identified. Results show that the best models use Random Forest, either incorporating strategies to deal with the imbalanced nature of the data or using such strategies at the data level. The best results are obtained at the end of the first semester, when some information about the academic performance after enrollment is already available. The overall results compare fairly with some similar works that address the early prediction of students' dropout or academic performance.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Queue mining for delay prediction in multi-class service processes
    Senderovich, Arik
    Weidlich, Matthias
    Gal, Avigdor
    Mandelbaum, Avishai
    INFORMATION SYSTEMS, 2015, 53 : 278 - 295
  • [32] Least Squares Revisited: Scalable Approaches for Multi-class Prediction
    Agarwal, Alekh
    Kakade, Sham M.
    Karampatziakis, Nikos
    Song, Le
    Valiant, Gregory
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 541 - 549
  • [33] Efficient set-valued prediction in multi-class classification
    Mortier, Thomas
    Wydmuch, Marek
    Dembczynski, Krzysztof
    Huellermeier, Eyke
    Waegeman, Willem
    DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (04) : 1435 - 1469
  • [34] The Influence of Multi-class Feature Selection on the Prediction of Diagnostic Phenotypes
    Ludwig Lausser
    Robin Szekely
    Lyn-Rouven Schirra
    Hans A. Kestler
    Neural Processing Letters, 2018, 48 : 863 - 880
  • [35] Dynamic and Probabilistic Multi-class Prediction of Tunnel Squeezing Intensity
    Chen, Yu
    Li, Tianbin
    Zeng, Peng
    Ma, Junjie
    Patelli, Edoardo
    Edwards, Ben
    ROCK MECHANICS AND ROCK ENGINEERING, 2020, 53 (08) : 3521 - 3542
  • [36] Back to Basics: An Interpretable Multi-Class Grade Prediction Framework
    Basma Alharbi
    Arabian Journal for Science and Engineering, 2022, 47 : 2171 - 2186
  • [37] Dynamic and Probabilistic Multi-class Prediction of Tunnel Squeezing Intensity
    Yu Chen
    Tianbin Li
    Peng Zeng
    Junjie Ma
    Edoardo Patelli
    Ben Edwards
    Rock Mechanics and Rock Engineering, 2020, 53 : 3521 - 3542
  • [38] A prediction method for multi-class systems based on limited data
    Kuznetsov, VA
    Knott, GD
    FOURTEENTH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2001, : 279 - 284
  • [39] A Comparison of MCC and CEN Error Measures in Multi-Class Prediction
    Jurman, Giuseppe
    Riccadonna, Samantha
    Furlanello, Cesare
    PLOS ONE, 2012, 7 (08):
  • [40] Efficient set-valued prediction in multi-class classification
    Thomas Mortier
    Marek Wydmuch
    Krzysztof Dembczyński
    Eyke Hüllermeier
    Willem Waegeman
    Data Mining and Knowledge Discovery, 2021, 35 : 1435 - 1469