Research on learning behavior patterns from the perspective of educational data mining: Evaluation, prediction and visualization

被引:8
|
作者
Feng, Guiyun [1 ]
Fan, Muwei [1 ]
机构
[1] Guizhou Univ, Sch Management, Guiyang 550025, Peoples R China
关键词
Educational data mining; Learning behavior patterns; Evaluation methodologies; Classification algorithms; STUDENT PERFORMANCE; MODEL; LOGITBOOST;
D O I
10.1016/j.eswa.2023.121555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid growth of educational data creates the requirement to mine useful information from learning behavior patterns. The development of data mining technology makes educational data mining possible. The paper intends to use a public educational data set to study learning behavior patterns from the perspective of educational data mining, so as to promote the innovation of educational management. Firstly, in order to reduce the dimension of data analysis that facilitates the improvement in efficiency, principal component analysis is carried out to reduce the number of attributes in the data set. The significant attributes in the rotating principal component matrix rather than principal components which are not closely related to learning behavior patterns are extracted as the research variables. Then, a pseudo statistic is proposed to determine the number of clusters and the preprocessed data set is clustered according to the extracted attributes. The clustering results are applied to add class labels to the data, which is convenient for the later data training. Finally, six classification algorithms J48, K-Nearest Neighbor, Bayes Net, Random Forest, Support Vector Machine and Logit Boost are used to train the data with labels and build prediction models. At the same time, the performance and applicable conditions of six classifiers in terms of accuracy, efficiency, error, and so on are discussed and compared. It is found that the performance of the integrated algorithm is better than that of a single classifier. In the integrated algorithm, compared with Random Forest, the running time of Logit Boost is shorter.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Educational Data Mining with Learning Management Systems
    Espigares Pinazo, Manuel Jesus
    Garcia Perez, Rafael
    REVISTA ELECTRONICA DE LEEME, 2011, (27): : 1 - 16
  • [42] Learning Analytics or Educational Data Mining? This is the Question ...
    Marcu, Daniela
    Danubianu, Mirela
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2019, 10 : 1 - 14
  • [43] Mining the data in programming assignments for educational research
    Edwards, Stephen H.
    Ly, Vinh
    IMSCI '07: INTERNATIONAL MULTI-CONFERENCE ON SOCIETY, CYBERNETICS AND INFORMATICS, VOL 1, PROCEEDINGS, 2007, : 135 - 140
  • [44] Research progress on educational data mining: A survey
    Zhou, Qing
    Mou, Chao
    Yang, Dan
    Ruan Jian Xue Bao/Journal of Software, 2015, 26 (11): : 3026 - 3042
  • [45] An Exploration of Using Data Mining in Educational Research
    Xu, Yonghong Jade
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2005, 4 (01) : 251 - 274
  • [46] Data mining for research and evaluation
    Van Horn, R
    PHI DELTA KAPPAN, 1998, 80 (03) : 251 - +
  • [47] Big Data Analytics and Mining for Crime Data Analysis, Visualization and Prediction
    Feng, Mingchen
    Zheng, Jiangbin
    Han, Yukang
    Ren, Jinchang
    Liu, Qiaoyuan
    ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, BICS 2018, 2018, 10989 : 605 - 614
  • [48] Semi Supervised Prediction Model in Educational Data Mining
    Hmiedi, Ismail
    Najadat, Hassan
    Halloush, Zain
    Jalabneh, Ibtihal
    2019 INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2019, : 27 - 31
  • [49] Educational Data Mining: Dropout Prediction in XuetangX MOOCs
    Chengjun Xu
    Guobin Zhu
    Jian Ye
    Jingqian Shu
    Neural Processing Letters, 2022, 54 : 2885 - 2900
  • [50] Prediction of Students Performance using Educational Data Mining
    Devasia, Tismy
    Vinushree, T. P.
    Hegde, Vinayak
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON DATA MINING AND ADVANCED COMPUTING (SAPIENCE), 2016, : 91 - 95