The Machine Learning-Based Dropout Early Warning System for Improving the Performance of Dropout Prediction

被引:66
|
作者
Lee, Sunbok [1 ]
Chung, Jae Young [2 ]
机构
[1] Univ Houston, Dept Psychol, Houston, TX 77004 USA
[2] EwhaWomans Univ, Dept Educ, Seoul 03760, South Korea
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 15期
关键词
dropout; machine learning; big data; class-imbalance; oversampling; ensemble; CARD FRAUD DETECTION; IDENTIFICATION; STUDENTS;
D O I
10.3390/app9153093
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A dropout early warning system enables schools to preemptively identify students who are at risk of dropping out of school, to promptly react to them, and eventually to help potential dropout students to continue their learning for a better future. However, the inherent class imbalance between dropout and non-dropout students could pose difficulty in building accurate predictive modeling for a dropout early warning system. The present study aimed to improve the performance of a dropout early warning system: (a) by addressing the class imbalance issue using the synthetic minority oversampling techniques (SMOTE) and the ensemble methods in machine learning; and (b) by evaluating the trained classifiers with both receiver operating characteristic (ROC) and precision-recall (PR) curves. To that end, we trained random forest, boosted decision tree, random forest with SMOTE, and boosted decision tree with SMOTE using the big data samples of the 165,715 high school students from the National Education Information System (NEIS) in South Korea. According to our ROC and PR curve analysis, boosted decision tree showed the optimal performance.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Machine Learning-Based Cardiac Arrest Prediction for Early Warning System
    Chae, Minsu
    Gil, Hyo-Wook
    Cho, Nam-Jun
    Lee, Hwamin
    [J]. MATHEMATICS, 2022, 10 (12)
  • [2] Beyond Early Warning Indicators: High School Dropout and Machine Learning
    Sansone, Dario
    [J]. OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 2019, 81 (02) : 456 - 485
  • [3] Dropout early warning systems for high school students using machine learning
    Chung, Jae Young
    Lee, Sunbok
    [J]. CHILDREN AND YOUTH SERVICES REVIEW, 2019, 96 : 346 - 353
  • [4] Dropout Prediction by Interpretable Machine Learning Model Towards Preventing Student Dropout
    Katsuragi, Miki
    Tanaka, Kenji
    [J]. TRANSDISCIPLINARITY AND THE FUTURE OF ENGINEERING, 2022, 28 : 678 - 683
  • [5] Early dropout prediction via machine learning in professional online courses
    Urteaga, Ignacio
    Siri, Laura
    Garofalo, Guillermo
    [J]. RIED-REVISTA IBEROAMERICANA DE EDUCACION A DISTANCIA, 2020, 23 (02): : 147 - 167
  • [6] Machine Learning Application in MOOCs: Dropout Prediction
    Liang, Jiajun
    Li, Chao
    Zheng, Li
    [J]. 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE), 2016, : 52 - 57
  • [7] Application of the performance of machine learning techniques as support in the prediction of school dropout
    Auria Lucia Jiménez-Gutiérrez
    Cinthya Ivonne Mota-Hernández
    Efrén Mezura-Montes
    Rafael Alvarado-Corona
    [J]. Scientific Reports, 14
  • [8] Application of the performance of machine learning techniques as support in the prediction of school dropout
    Jimenez-Gutierrez, Auria Lucia
    Mota-Hernandez, Cinthya Ivonne
    Mezura-Montes, Efren
    Alvarado-Corona, Rafael
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [9] Dropout prediction in Moocs using deep learning and machine learning
    Ram B. Basnet
    Clayton Johnson
    Tenzin Doleck
    [J]. Education and Information Technologies, 2022, 27 : 11499 - 11513
  • [10] Dropout prediction in Moocs using deep learning and machine learning
    Basnet, Ram B.
    Johnson, Clayton
    Doleck, Tenzin
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2022, 27 (08) : 11499 - 11513