Dropout prediction model in MOOC based on clickstream data and student sample weight

被引:17
|
作者
Jin, Cong [1 ]
机构
[1] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China
关键词
MOOC; Dropout prediction; Initial weight calculation; Intelligent optimization; Clickstream data;
D O I
10.1007/s00500-021-05795-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Currently, the high dropout rate of massive open online course (MOOC) has seriously affected its popularity and promotion. How to effectively predict the dropout status of students in MOOC so as to intervene as early as possible has become a hot topic. As we know, different students in MOOC have big differences in learning behaviors, learning habits, and learning time, etc. This leads to different student samples having different effects on the prediction performance of the machine learning-based dropout prediction model (DPM). This is because the performance of machine learning-based classifiers heavily depends on the quality of training samples. To solve this problem, in this paper, a new DPM based on machine learning is proposed. Since the traditional neighborhood concept has nothing to do with the label of the sample, a new neighborhood definition, i.e., the max neighborhood, is first given. It is not only related to the distance between samples, but also related to the labels of the samples. Then, the calculation and realization algorithm of the initial weight of each student sample is studied based on the definition of the max neighborhood, which is different from the commonly methods of randomly selecting initial values. Next, the optimization method of the initial weight of the student sample is further studied using the intelligent optimization method. Finally, the classifiers trained by the weighted training samples are used as DPM. Experimental results of direct observation and statistical testing on public data sets indicate that the training sample weighting and intelligent optimization technology can significantly improve the predictive performance of DPM.
引用
收藏
页码:8971 / 8988
页数:18
相关论文
共 50 条
  • [31] Deep analytic model for student dropout prediction in massive open online courses
    Mubarak, Ahmed A.
    Cao, Han
    Hezam, Ibrahim M.
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 93
  • [32] Learning behavior analysis and prediction based on MOOC data
    Jiang, Zhuoxuan
    Zhang, Yan
    Li, Xiaoming
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (03): : 614 - 628
  • [33] MOOCS DROPOUT PREDICTION BASED ON HIDDEN MARKOV MODEL
    Zhu, Huisheng
    Wang, Yan
    Chen, Shuwen
    Ni, Yiyang
    [J]. JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2024, 25 (05) : 879 - 889
  • [34] MOOC Dropout Prediction Using a Hybrid Algorithm Based on Decision Tree and Extreme Learning Machine
    Chen, Jing
    Feng, Jun
    Sun, Xia
    Wu, Nannan
    Yang, Zhengzheng
    Chen, Sushing
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2019, 2019
  • [35] Prediction analysis of student dropout in a Computer Science course using Educational Data Mining
    Costa, Alexandre G.
    Queiroga, Emanuel
    Primo, Tiago T.
    Mattos, Julio C. B.
    Cechinel, Cristian
    [J]. 2020 XV CONFERENCIA LATINOAMERICANA DE TECNOLOGIAS DE APRENDIZAJE (LACLO), 2020,
  • [36] Small Sample Fault Data Prediction Study Based on Weibull Model
    Wang, Hongpo
    Yang, Ge
    Bai, Linnan
    Yin, Juan
    Li, Qiang
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND MECHANICAL AUTOMATION (CSMA), 2015, : 9 - 14
  • [37] A survival analysis based volatility and sparsity modeling network for student dropout prediction
    Pan, Feng
    Huang, Bingyao
    Zhang, Chunhong
    Zhu, Xinning
    Wu, Zhenyu
    Zhang, Moyu
    Ji, Yang
    Ma, Zhanfei
    Li, Zhengchen
    [J]. PLOS ONE, 2022, 17 (05):
  • [38] A Deep Learning Model for MOOC Dropout Prediction Using Learner's Course-relevant Activities
    Sultan, Mohamad T.
    El Sayed, Hesham
    Khan, Manzoor Ahmed
    Abduljabar, Mohammed
    [J]. 2022 IEEE GLOBAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (GCAIOT), 2022, : 13 - 18
  • [39] Clickstream data mining assistance - A case-based reasoning task model
    Wanzeller, Cristina
    Belo, Orlando
    [J]. ICSOFT 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 2, 2006, : 61 - +
  • [40] A decision support framework to incorporate textual data for early student dropout prediction in higher education
    Phan, Minh
    De Caigny, Arno
    Coussement, Kristof
    [J]. DECISION SUPPORT SYSTEMS, 2023, 168