Student Dropout Prediction for University with High Precision and Recall

被引:9
|
作者
Kim, Sangyun [1 ]
Choi, Euteum [2 ]
Jun, Yong-Kee [3 ,4 ]
Lee, Seongjin [5 ]
机构
[1] Gyeongsang Natl Univ, Dept Informat, Jinju Daero 501, Jinjusi 52828, South Korea
[2] Gyeongsang Natl Univ, Res Ctr Aircraft Parts Technol, Jinju Daero 501, Jinjusi 52828, South Korea
[3] Gyeongsang Natl Univ, Div Aerosp & Software Engn, Jinju Daero 501, Jinjusi 52828, South Korea
[4] Gyeongsang Natl Univ, Dept Bio & Med Bigdata Program BK4, Jinju Daero 501, Jinjusi 52828, South Korea
[5] Gyeongsang Natl Univ, Dept AI Convergence Engn, Jinju Daero 501, Jinjusi 52828, South Korea
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 10期
基金
新加坡国家研究基金会;
关键词
dropout precision; dropout recall; machine learning; imbalanced data processing; hybrid method; big data; academic data; principle component analysis; K-means clustering; SMOTE;
D O I
10.3390/app13106275
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Application to student counseling and reducing the dropout rate in universities.Since a high dropout rate for university students is a significant risk to local communities and countries, a dropout prediction model using machine learning is an active research domain to prevent students from dropping out. However, it is challenging to fulfill the needs of consulting institutes and the office of academic affairs. To the consulting institute, the accuracy in the prediction is of the utmost importance; to the offices of academic affairs and other offices, the reason for dropping out is essential. This paper proposes a Student Dropout Prediction (SDP) system, a hybrid model to predict the students who are about to drop out of the university. The model tries to increase the dropout precision and the dropout recall rate in predicting the dropouts. We then analyzed the reason for dropping out by compressing the feature set with PCA and applying K-means clustering to the compressed feature set. The SDP system showed a precision value of 0.963, which is 0.093 higher than the highest-precision model of the existing works. The dropout recall and F1 scores, 0.766 and 0.808, respectively, were also better than those of gradient boosting by 0.117 and 0.011, making them the highest among the existing works; Then, we classified the reasons for dropping out into four categories: "Employed", "Did Not Register", "Personal Issue", and "Admitted to Other University." The dropout precision of "Admitted to Other University" was the highest, at 0.672. In post-verification, the SDP system increased counseling efficiency by accurately predicting dropouts with high dropout precision in the "High-Risk" group while including more dropouts in total dropouts. In addition, by predicting the reasons for dropouts and presenting guidelines to each department, the students could receive personalized counseling.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Improving Prediction of MOOCs Student Dropout Using a Feature Engineering Approach
    Ardchir, Soufiane
    Ouassit, Youssef
    Ounacer, Soumaya
    Jihal, Houda
    El Goumari, Mohamed Yassine
    Azouazi, Mohamed
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2019): VOL 1 - ADVANCED INTELLIGENT SYSTEMS FOR EDUCATION AND INTELLIGENT LEARNING SYSTEM, 2020, 1102 : 146 - 156
  • [43] High-recall, high-precision prediction of protein binding sites from 3D structure
    Wei, Ying
    Murga, Leonel F.
    Ondrechen, Mary Jo
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 232 : 315 - 315
  • [44] Student Enrollment and Dropout: An Evaluation Study of DCSA Program at Bangladesh Open University
    Rashid, Mohammad Mamunur
    Jahan, Monira
    Islam, Anwarul
    Ratna, Meherin Munjarin
    INTERNATIONAL REVIEW OF RESEARCH IN OPEN AND DISTRIBUTED LEARNING, 2015, 16 (04): : 18 - 32
  • [45] STUDENT WASTAGE AT EDINBURGH UNIVERSITY .1. FACTORS RELATED TO FAILURE AND DROPOUT
    KAPUR, RL
    UNIVERSITIES QUARTERLY, 1972, 26 (03): : 353 - 377
  • [46] Mixture Structural Equation Models for Classifying University Student Dropout in Latin America
    Viloria, Amelec
    Pineda Lezama, Omar Bonerge
    10TH INT CONF ON EMERGING UBIQUITOUS SYST AND PERVAS NETWORKS (EUSPN-2019) / THE 9TH INT CONF ON CURRENT AND FUTURE TRENDS OF INFORMAT AND COMMUN TECHNOLOGIES IN HEALTHCARE (ICTH-2019) / AFFILIATED WORKOPS, 2019, 160 : 629 - 634
  • [47] A Study on Student's Dropout in Education Programs of the Catholic of the North University Foundation
    Rodriguez Nunez, Luz Helena
    Londono Londono, Francisco Javier
    REVISTA VIRTUAL UNIVERSIDAD CATOLICA DEL NORTE, 2011, 33 : 328 - 355
  • [48] An Analysis of Student Satisfaction and its Relationship with Academic and Social Factors in University Dropout
    Esteban, Maria
    Bernardo, Ana B.
    Blanco, Elena
    Oserin, Palmira
    INTERNATIONAL JOURNAL OF EDUCATIONAL PSYCHOLOGY, 2024, 13 (03): : 219 - 239
  • [49] A High Precision Low Dropout Regulator with Nested Feedback Loops
    Kuo, Ron-Chi
    Tsai, Tung-Han
    Hsieh, Yu-Jie
    Wang, Chua-Chin
    PROCEEDINGS OF THE 2010 IEEE ASIA PACIFIC CONFERENCE ON CIRCUIT AND SYSTEM (APCCAS), 2010, : 664 - 667
  • [50] A high precision low dropout regulator with nested feedback loops
    Wang, Chua-Chin
    Kuo, Ron-Chi
    Tsai, Tung-Han
    MICROELECTRONICS JOURNAL, 2011, 42 (07) : 966 - 971