Early Prediction of University Dropouts - A Random Forest Approach

被引:25
|
作者
Behr, Andreas [1 ]
Giese, Marco [1 ]
Teguim, Herve D. K. [1 ]
Theune, Katja [1 ]
机构
[1] Univ Duisburg Essen, Chair Stat, Essen, Germany
来源
JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK | 2020年 / 240卷 / 06期
关键词
student dropout; higher education; dropout prediction; educational data mining; random forest; HIGHER-EDUCATION; ACADEMIC-PERFORMANCE; PANEL ATTRITION; DETERMINANTS; DECISION; COLLEGE; PROBABILITY;
D O I
10.1515/jbnst-2019-0006
中图分类号
F [经济];
学科分类号
02 ;
摘要
We predict university dropout using random forests based on conditional inference trees and on a broad German data set covering a wide range of aspects of student life and study courses. We model the dropout decision as a binary classification (graduate or dropout) and focus on very early prediction of student dropout by stepwise modeling students' transition from school (pre-study) over the study-decision phase (decision phase) to the first semesters at university (early study phase). We evaluate how predictive performance changes over the three models, and observe a substantially increased performance when including variables from the first study experiences, resulting in an AUC (area under the curve) of 0.86. Important predictors are the final grade at secondary school, and also determinants associated with student satisfaction and their subjective academic self-concept and self-assessment. A direct outcome of this research is the provision of information to universitieswishing to implement early warning systems and more personalized counseling services to support students at risk of dropping out during an early stage of study.
引用
收藏
页码:743 / 789
页数:47
相关论文
共 50 条
  • [21] SPATIALLY AWARE LANDSLIDE SUSCEPTIBILITY PREDICTION USING A GEOGRAPHICAL RANDOM FOREST APPROACH
    Teke, A.
    Kavzoglu, T.
    8TH INTERNATIONAL CONFERENCE ON GEOINFORMATION ADVANCES, GEOADVANCES 2024, VOL. 48-4, 2024, : 363 - 370
  • [22] USING THE AUTOMATED RANDOM FOREST APPROACH FOR OBTAINING THE COMPRESSIVE STRENGTH PREDICTION OF RCA
    Wu, Yujie
    He, Xiaoming
    CIVIL ENGINEERING JOURNAL-STAVEBNI OBZOR, 2023, 32 (04): : 1 - 14
  • [23] Using the automated random forest approach for obtaining the compressive strength prediction of RCA
    Yujie Wu
    Xiaoming He
    Multiscale and Multidisciplinary Modeling, Experiments and Design, 2024, 7 : 855 - 867
  • [24] A Random Forest based approach to MHC class I epitope prediction and analysis
    Wilson, Eric A.
    Krishna, Sri
    Anderson, Karen S.
    JOURNAL OF IMMUNOLOGY, 2018, 200 (01):
  • [25] Educational Performance Prediction with Random Forest and Innovative Optimizers: A Data Mining Approach
    Chen, Yanli
    Jin, Ke
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 69 - 78
  • [26] Survival Analysis based Framework for Early Prediction of Student Dropouts
    Ameri, Sattar
    Fard, Mahtab J.
    Chinnam, Ratna B.
    Reddy, Chandan K.
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 903 - 912
  • [27] Using the automated random forest approach for obtaining the compressive strength prediction of RCA
    Wu, Yujie
    He, Xiaoming
    MULTISCALE AND MULTIDISCIPLINARY MODELING EXPERIMENTS AND DESIGN, 2024, 7 (02) : 855 - 867
  • [28] Crop yield prediction in cotton for regional level using random forest approach
    Prasad, N. R.
    Patel, N. R.
    Danodia, Abhishek
    SPATIAL INFORMATION RESEARCH, 2021, 29 (02) : 195 - 206
  • [29] DROPOUTS DISSATISFACTION WITH UNIVERSITY
    HAYES, SC
    AUSTRALIAN JOURNAL OF EDUCATION, 1977, 21 (02) : 141 - 149
  • [30] Random Forest for Breast Cancer Prediction
    Octaviani, T. L.
    Rustam, Z.
    PROCEEDINGS OF THE 4TH INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES (ISCPMS2018), 2019, 2168