Predicting academic performance using tree-based machine learning models: A case study of bachelor students in an engineering department in China

被引:8
|
作者
Zhang, Wei [1 ]
Wang, Yu [1 ]
Wang, Suyu [1 ]
机构
[1] South China Agr Univ, Coll Water Conservancy & Civil Engn, Guangzhou 510642, Guangdong, Peoples R China
关键词
Educational data mining; Quality of teaching and learning; Classification; Decision tree; Engineering education; Department administration;
D O I
10.1007/s10639-022-11170-w
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Educational data mining (DEM) provides valuable educational information by applying data mining tools and techniques to analyze data at educational institutions. In this paper, tree-based machine learning algorithms are used to predict students' overall academic performance in their bachelor's program. The transcript data of the students in the same department in a Chinese university were collected. All the courses in the bachelor's program were then divided into six typical categories, and the mean GPAs of each category were taken as primary input features for prediction. Three tree-based machine learning models were established, i.e. decision tree (DT), Gradient boosting decision tree (GBDT) and random forest (RF). Results show that we can successfully identify more than 80% of the students at low-performance risk using the RF model at the end of the second semester, which is meaningful because the global quality of teaching and learning of the department can be improved by taking targeted measures in time according to the machine learning model. Feature importance and the structure of decision tree were also analyzed to extract knowledge that is valuable for both students and teachers. The results of this case study can be used as a reference for other engineering departments in China.
引用
收藏
页码:13051 / 13066
页数:16
相关论文
共 50 条
  • [41] Using machine learning to predict factors affecting academic performance: the case of college students on academic probation
    Al-Alawi, Lamees
    Al Shaqsi, Jamil
    Tarhini, Ali
    Al-Busaidi, Adil S.
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2023, 28 (10) : 12407 - 12432
  • [42] Molecular insights on ABL kinase activation using tree-based machine learning models and molecular docking
    Philipe Oliveira Fernandes
    Diego Magno Martins
    Aline de Souza Bozzi
    João Paulo A. Martins
    Adolfo Henrique de Moraes
    Vinícius Gonçalves Maltarollo
    [J]. Molecular Diversity, 2021, 25 : 1301 - 1314
  • [43] Using machine learning to predict factors affecting academic performance: the case of college students on academic probation
    Lamees Al-Alawi
    Jamil Al Shaqsi
    Ali Tarhini
    Adil S. Al-Busaidi
    [J]. Education and Information Technologies, 2023, 28 : 12407 - 12432
  • [44] Efficient neural network- and tree-based machine learning models for predicting shear capacity of RC slender walls
    Nguyen S.-M.
    Tran N.-L.
    Nguyen T.-H.
    Tran V.-B.
    Nguyen D.-D.
    [J]. Asian Journal of Civil Engineering, 2024, 25 (4) : 3595 - 3609
  • [45] Molecular insights on ABL kinase activation using tree-based machine learning models and molecular docking
    Fernandes, Philipe Oliveira
    Martins, Diego Magno
    de Souza Bozzi, Aline
    Martins, Joao Paulo A.
    de Moraes, Adolfo Henrique
    Maltarollo, Vinicius Goncalves
    [J]. MOLECULAR DIVERSITY, 2021, 25 (03) : 1301 - 1314
  • [46] Discussion on the tree-based machine learning model in the study of landslide susceptibility
    Liu, Qiang
    Tang, Aiping
    Huang, Ziyuan
    Sun, Lixin
    Han, Xiaosheng
    [J]. NATURAL HAZARDS, 2022, 113 (02) : 887 - 911
  • [47] Discussion on the tree-based machine learning model in the study of landslide susceptibility
    Qiang Liu
    Aiping Tang
    Ziyuan Huang
    Lixin Sun
    Xiaosheng Han
    [J]. Natural Hazards, 2022, 113 : 887 - 911
  • [48] Predicting academic performance of students from VLE big data using deep learning models
    Waheed, Hajra
    Hassan, Saeed-Ul
    Aljohani, Naif Radi
    Hardman, Julie
    Alelyani, Salem
    Nawaz, Raheel
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2020, 104
  • [49] Predicting Joining Behavior of Freshmen Students using Machine Learning - A Case Study
    Kumar, Pawan
    Kumar, Varun
    Sobti, Rajeev
    [J]. 2020 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2020), 2020, : 141 - 145
  • [50] Simulation-Based Machine Learning for Predicting Academic Performance Using Big Data
    Zhang, Cheng
    Yang, Jinming
    Li, Mingxuan
    Deng, Meng
    [J]. INTERNATIONAL JOURNAL OF GAMING AND COMPUTER-MEDIATED SIMULATIONS, 2024, 16 (01)