Early Prediction of University Dropouts - A Random Forest Approach

被引:25
|
作者
Behr, Andreas [1 ]
Giese, Marco [1 ]
Teguim, Herve D. K. [1 ]
Theune, Katja [1 ]
机构
[1] Univ Duisburg Essen, Chair Stat, Essen, Germany
来源
关键词
student dropout; higher education; dropout prediction; educational data mining; random forest; HIGHER-EDUCATION; ACADEMIC-PERFORMANCE; PANEL ATTRITION; DETERMINANTS; DECISION; COLLEGE; PROBABILITY;
D O I
10.1515/jbnst-2019-0006
中图分类号
F [经济];
学科分类号
02 ;
摘要
We predict university dropout using random forests based on conditional inference trees and on a broad German data set covering a wide range of aspects of student life and study courses. We model the dropout decision as a binary classification (graduate or dropout) and focus on very early prediction of student dropout by stepwise modeling students' transition from school (pre-study) over the study-decision phase (decision phase) to the first semesters at university (early study phase). We evaluate how predictive performance changes over the three models, and observe a substantially increased performance when including variables from the first study experiences, resulting in an AUC (area under the curve) of 0.86. Important predictors are the final grade at secondary school, and also determinants associated with student satisfaction and their subjective academic self-concept and self-assessment. A direct outcome of this research is the provision of information to universitieswishing to implement early warning systems and more personalized counseling services to support students at risk of dropping out during an early stage of study.
引用
收藏
页码:743 / 789
页数:47
相关论文
共 50 条
  • [1] Predicting student dropouts using random forest
    Devi, Kapila
    Ratnoo, Saroj
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2022, 25 (07) : 1579 - 1590
  • [2] Prediction of Preeclampsia by Using Random Forest Approach.
    Xie, Fagen
    Zhuang, Zimin
    Fassett, Michael J.
    Getahun, Darios
    REPRODUCTIVE SCIENCES, 2019, 26 : 179A - 179A
  • [3] Prediction of University Patent Transfer Cycle Based on Random Survival Forest
    Deng, Disha
    Chen, Tao
    SUSTAINABILITY, 2023, 15 (01)
  • [4] Application of Random Forest Approach to QSAR Prediction of Aquatic Toxicity
    Polishchuk, Pavel G.
    Muratov, Eugene N.
    Artemenko, Anatoly G.
    Kolumbin, Oleg G.
    Muratov, Nail N.
    Kuz'min, Victor E.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (11) : 2481 - 2488
  • [5] Prediction of nanofluids viscosity using random forest (RF) approach
    Gholizadeh, Majid
    Jamei, Mehdi
    Ahmadianfar, Iman
    Pourrajab, Rashid
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 201
  • [6] A New Approach for CNYX Prediction Based on SSA and Random Forest
    Lai, Lin
    2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017), 2017, : 967 - 970
  • [7] A new approach for CNYX PREDICTION BASED on SSA and random forest
    Lai, Lin
    Proceedings - 2017 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2017, 2017, : 967 - 970
  • [8] Random Forest with Sampling Techniques for Handling Imbalanced Prediction of University Student Depression
    Sawangarreerak, Siriporn
    Thanathamathee, Putthiporn
    INFORMATION, 2020, 11 (11) : 1 - 13
  • [9] LONG-TERM SURVIVAL PREDICTION IN EARLY BREAST CANCER: A MACHINE LEARNING APPROACH WITH RANDOM SURVIVAL FOREST
    Yoon, H.
    Han, S.
    Suh, H. S.
    Park, C.
    VALUE IN HEALTH, 2024, 27 (06) : S268 - S268
  • [10] Energy Prediction of OpenMP Applications using Random Forest Modeling Approach
    Benedict, Shajulin
    Rejitha, R. S.
    Gschwandtner, Philipp
    Prodan, Radu
    Fahringer, Thomas
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1251 - 1260