Ensemble models based on CNN and LSTM for dropout prediction in MOOC

被引:6
|
作者
Talebi, Kowsar [1 ]
Torabi, Zeinab [1 ]
Daneshpour, Negin [1 ]
机构
[1] Shahid Rajaee Teacher Training Univ, Fac Comp Engn, Tehran, Iran
关键词
Student dropout; Ensemble models; Convolutional neural network; Long -short term memory; Massive open online courses;
D O I
10.1016/j.eswa.2023.121187
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Massive Open Online Courses (MOOCs) have gained a lot of popularity recently. Despite the large number of students enrolled in these courses, a large percentage drop out. Due to this, predicting student dropout has taken on fundamental importance in this area. Predicting dropout early allows course organizers and educators to intervene and provide targeted support to at-risk students. They can offer additional resources, personalized assistance, or interventions tailored to address specific challenges faced by students, increasing their chances of successful course completion. This study first pre-processes the dataset to create a thirty-day correlation matrix for each learner, enabling early dropout prediction by the end of the first week. Then, six new models have been proposed using ensemble classification techniques with Convolutional Neural Network (CNN) and Long-Short Term Memory (LSTM). CNN is used for automatic feature extraction, while LSTM considers the time series aspect of the data to improve early prediction performance. As ensemble classifiers can reduce the variance of prediction errors, using ensemble classifiers in addition to neural networks can enhance accuracy and F1 score without overfitting. The application of these techniques results in more accurate week-by-week dropout prediction. The experimental results on the KDD Cup 2015 dataset (representing XuetangX, a MOOC platform in China with 39 courses, 79,186 students, and 120,542 registered students, with 8,157,277 records collected over 30 days) show that all Bagging models improve performance of their base models. In one of the proposed models (Bagging LSTM-LSTM), at the end of the fifth week, the accuracy reached 94%, and the average accuracy reached 91%. Also, precision and recall reached an average of 92%, and F1 score reached 98%, which shows a significant improvement compared to previous researches.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Sinter Quality Prediction Based on Multi-Features CNN + LSTM
    Zhiwei Zhao
    Weijian Feng
    Song Liu
    Zhijian Xiong
    Yadi Zhao
    Huiyan Zhang
    Weifang Wang
    [J]. Arabian Journal for Science and Engineering, 2024, 49 : 4271 - 4286
  • [42] Prediction of Passenger Flow Based on CNN-LSTM Hybrid Model
    Wang Yu
    Wang Zhifei
    Wang Hongye
    Zhnag Junfeng
    Feng Ruilong
    [J]. 2019 12TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2019), 2019, : 132 - 135
  • [43] Motion trajectory prediction based on a CNN-LSTM sequential model
    Guo Xie
    Anqi Shangguan
    Rong Fei
    Wenjiang Ji
    Weigang Ma
    Xinhong Hei
    [J]. Science China Information Sciences, 2020, 63
  • [44] Spatial Simulation and Prediction of Air Temperature Based on CNN-LSTM
    Hou, Jingwei
    Wang, Yanjuan
    Hou, Bo
    Zhou, Ji
    Tian, Qiong
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [45] Research on Traffic Crash Prediction Based on CNN-LSTM Model
    Wang, Shaohua
    Zhang, Sinan
    Lu, Lei
    Zhang, Keke
    Liu, Xia
    Chen, Ning
    [J]. CICTP 2023: INNOVATION-EMPOWERED TECHNOLOGY FOR SUSTAINABLE, INTELLIGENT, DECARBONIZED, AND CONNECTED TRANSPORTATION, 2023, : 1185 - 1193
  • [46] Motion trajectory prediction based on a CNN-LSTM sequential model
    Guo XIE
    Anqi SHANGGUAN
    Rong FEI
    Wenjiang JI
    Weigang MA
    Xinhong HEI
    [J]. Science China(Information Sciences), 2020, 63 (11) : 248 - 268
  • [47] Motion trajectory prediction based on a CNN-LSTM sequential model
    Xie, Guo
    Shangguan, Anqi
    Fei, Rong
    Ji, Wenjiang
    Ma, Weigang
    Hei, Xinhong
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (11)
  • [48] CNN-Bi-LSTM Based Household Energy Consumption Prediction
    Gaur, Kshitij
    Singh, Sandeep Kumar
    [J]. ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 233 - 237
  • [49] RESEARCH ON GREENHOUSE ENVIRONMENT PREDICTION BASED ON GCAKF-CNN-LSTM
    Liu, Tianhong
    Qiao, Xianzhu
    Liu, Sixing
    Qi, Shengli
    [J]. Applied Engineering in Agriculture, 2024, 40 (02) : 181 - 187
  • [50] Logging curve prediction method based on CNN-LSTM-attention
    Shi, Mingjiang
    Yang, Bohan
    Chen, Rui
    Ye, Dingsheng
    [J]. EARTH SCIENCE INFORMATICS, 2022, 15 (04) : 2119 - 2131