Early Prediction of University Dropouts - A Random Forest Approach

被引:25
|
作者
Behr, Andreas [1 ]
Giese, Marco [1 ]
Teguim, Herve D. K. [1 ]
Theune, Katja [1 ]
机构
[1] Univ Duisburg Essen, Chair Stat, Essen, Germany
来源
JAHRBUCHER FUR NATIONALOKONOMIE UND STATISTIK | 2020年 / 240卷 / 06期
关键词
student dropout; higher education; dropout prediction; educational data mining; random forest; HIGHER-EDUCATION; ACADEMIC-PERFORMANCE; PANEL ATTRITION; DETERMINANTS; DECISION; COLLEGE; PROBABILITY;
D O I
10.1515/jbnst-2019-0006
中图分类号
F [经济];
学科分类号
02 ;
摘要
We predict university dropout using random forests based on conditional inference trees and on a broad German data set covering a wide range of aspects of student life and study courses. We model the dropout decision as a binary classification (graduate or dropout) and focus on very early prediction of student dropout by stepwise modeling students' transition from school (pre-study) over the study-decision phase (decision phase) to the first semesters at university (early study phase). We evaluate how predictive performance changes over the three models, and observe a substantially increased performance when including variables from the first study experiences, resulting in an AUC (area under the curve) of 0.86. Important predictors are the final grade at secondary school, and also determinants associated with student satisfaction and their subjective academic self-concept and self-assessment. A direct outcome of this research is the provision of information to universitieswishing to implement early warning systems and more personalized counseling services to support students at risk of dropping out during an early stage of study.
引用
收藏
页码:743 / 789
页数:47
相关论文
共 50 条
  • [41] Prediction of novel mouse TLR9 agonists using a random forest approach
    Khanna, Varun
    Li, Lei
    Fung, Johnson
    Ranganathan, Shoba
    Petrovsky, Nikolai
    BMC MOLECULAR AND CELL BIOLOGY, 2019, 20 (Suppl 2)
  • [42] Prediction of novel mouse TLR9 agonists using a random forest approach
    Varun Khanna
    Lei Li
    Johnson Fung
    Shoba Ranganathan
    Nikolai Petrovsky
    BMC Molecular and Cell Biology, 20
  • [43] A nonparametric test for random dropouts
    Listing, J
    Schlittgen, R
    BIOMETRICAL JOURNAL, 2003, 45 (01) : 113 - 127
  • [44] Tests if dropouts are missed at random
    Listing, J
    Schlittgen, R
    BIOMETRICAL JOURNAL, 1998, 40 (08) : 929 - 935
  • [45] Four Grade Levels-Based Models with Random Forest for Student Performance Prediction at a Multidisciplinary University
    Tran Thanh Dien
    Duy-Anh, Le
    Hong-Phat, Nguyen
    Van-Tuan, Nguyen
    Thanh-Chanh, Trinh
    Minh-Bang, Le
    Thanh-Hai, Nguyen
    Thai-Nghe, Nguyen
    COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, CISIS-2021, 2021, 278 : 1 - 12
  • [46] Generalized mixed-effects random forest: A flexible approach to predict university student dropout
    Pellagatti, Massimo
    Masci, Chiara
    Ieva, Francesca
    Paganoni, Anna M.
    STATISTICAL ANALYSIS AND DATA MINING, 2021, 14 (03) : 241 - 257
  • [47] DROPOUTS FROM AN AUSTRALIAN UNIVERSITY
    LAWRENCE, S
    AUSTRALIAN JOURNAL OF EDUCATION, 1971, 15 (03) : 305 - 313
  • [48] The Socioeconomic Gap in University Dropouts
    Vignoles, Anna F.
    Powdthavee, Nattavudh
    B E JOURNAL OF ECONOMIC ANALYSIS & POLICY, 2009, 9 (01):
  • [49] Prediction with Confidence Based on a Random Forest Classifier
    Devetyarov, Dmitry
    Nouretdinov, Ilia
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 37 - 44
  • [50] Prediction of β-Lactamase Proteins using Random Forest
    White, Clarence
    Dukka, K. C.
    FASEB JOURNAL, 2017, 31