A methodology to design, develop, and evaluate machine learning models for predicting dropout in school systems: the case of Chile

被引:6
|
作者
Rodriguez, Patricio [1 ,2 ]
Villanueva, Alexis [3 ]
Dombrovskaia, Lioubov [4 ]
Valenzuela, Juan Pablo [1 ,2 ]
机构
[1] Univ Chile, Inst Educ, Periodista Jose Carrasco Tapia 75,Reg Metropolitan, Santiago 8330014, Chile
[2] Univ Chile, Ctr Adv Res Educ, Periodista Jose Carrasco Tapia 75,Reg Metropolitan, Santiago 8330014, Chile
[3] Univ Chile, Ctr Adv Res Educ, Santiago, Chile
[4] Univ Tecn Feder Santa Maria, Dept Informat, Santiago, Chile
关键词
Data science applications in education; Machine learning; Learning analytics; Student retention; TREE;
D O I
10.1007/s10639-022-11515-5
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
School dropout is a structural problem which permanently penalizes students and society in areas such as low qualification jobs, higher poverty levels and lower life expectancy, lower pensions, and higher economic burden for governments. Given these high consequences and the surge of the problem due to COVID-19 pandemic, in this paper we propose a methodology to design, develop, and evaluate a machine learning model for predicting dropout in school systems. In this methodology, we introduce necessary steps to develop a robust model to estimate the individual risk of each student to drop out of school. As advancement from previous research, this proposal focuses on analyzing individual trajectories of students, incorporating the student situation at school, family, among other levels, changes, and accumulation of events to predict dropout. Following the methodology, we create a model for the Chilean case based on data available mostly through administrative data from the educational system, and according to known factors associated with school dropout. Our results are better than those from previous research with a relevant sample size, with a predictive capability 20% higher for the actual dropout cases. Also, in contrast to previous work, the including non-individual dimensions results in a substantive contribution to the prediction of leaving school. We also illustrate applications of the model for Chilean case to support public policy decision making such as profiling schools for qualitative studies of pedagogic practices, profiling students' dropout trajectories and simulating scenarios.
引用
收藏
页码:10103 / 10149
页数:47
相关论文
共 50 条
  • [1] A methodology to design, develop, and evaluate machine learning models for predicting dropout in school systems: the case of Chile
    Patricio Rodríguez
    Alexis Villanueva
    Lioubov Dombrovskaia
    Juan Pablo Valenzuela
    [J]. Education and Information Technologies, 2023, 28 : 10103 - 10149
  • [2] An Interface Design Methodology for Serving Machine Learning Models
    Ogbuju, Emeka
    Ihinkalu, Olalekan
    Oladipo, Francisca
    [J]. PROCEEDINGS OF THE 4TH AFRICAN CONFERENCE FOR HUMAN COMPUTER INTERACTION, AFRICHI 2023, 2023, : 12 - 14
  • [3] Dropout early warning systems for high school students using machine learning
    Chung, Jae Young
    Lee, Sunbok
    [J]. CHILDREN AND YOUTH SERVICES REVIEW, 2019, 96 : 346 - 353
  • [4] MACHINE LEARNING OF DESIGN RULES - METHODOLOGY AND CASE-STUDY
    ARCISZEWSKI, T
    BLOEDORN, E
    MICHALSKI, RS
    MUSTAFA, M
    WNEK, J
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 1994, 8 (03) : 286 - 308
  • [5] Student dropout prediction in High Education, using machine learning and deep learning models: case of Ecuadorian university
    Davila, Gonzalo
    Haro, Juan
    Gonzalez-Eras, Alexandra
    Ruiz Vivanco, Omar
    Gilman Coronet, Daniel
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 1677 - 1684
  • [6] Use of Machine Learning to Develop and Evaluate Models Using Preoperative and Intraoperative Data to Identify Risks of Postoperative Complications
    Xue, Bing
    Li, Dingwen
    Lu, Chenyang
    King, Christopher R.
    Wildes, Troy
    Avidan, Michael S.
    Kannampallil, Thomas
    Abraham, Joanna
    [J]. JAMA NETWORK OPEN, 2021, 4 (03)
  • [7] Machine learning models for predicting maximum displacement of triple pendulum isolation systems
    Nguyen, Nam, V
    Nguyen, Hoang D.
    Dao, Nhan D.
    [J]. STRUCTURES, 2022, 36 : 404 - 415
  • [8] Machine Learning Models for Predicting the Quality Factor of FSO Systems with Multiple Transceivers
    Algedir, Amal A.
    Elganimi, Taissir Y.
    [J]. 2020 IEEE 2ND GLOBAL POWER, ENERGY AND COMMUNICATION CONFERENCE (IEEE GPECOM2020), 2020, : 308 - 311
  • [9] In Situ Monitoring Systems of The SLM Process: On the Need to Develop Machine Learning Models for Data Processing
    Yadav, Pinku
    Rigo, Olivier
    Arvieu, Corinne
    Le Guen, Emilie
    Lacoste, Eric
    [J]. CRYSTALS, 2020, 10 (06) : 1 - 26
  • [10] Virtual Models of Dynamic Systems: Design, Methodology, Applications, and Experiences in Learning of Mechatronics
    Fedak, V.
    Balogh, T.
    Ismeal, Godem A.
    [J]. 10TH IEEE INTERNATIONAL CONFERENCE ON EMERGING ELEARNING TECHNOLOGIES AND APPLICATIONS (ICETA 2012), 2012, : 111 - 116