Research on learning behavior patterns from the perspective of educational data mining: Evaluation, prediction and visualization

被引:8
|
作者
Feng, Guiyun [1 ]
Fan, Muwei [1 ]
机构
[1] Guizhou Univ, Sch Management, Guiyang 550025, Peoples R China
关键词
Educational data mining; Learning behavior patterns; Evaluation methodologies; Classification algorithms; STUDENT PERFORMANCE; MODEL; LOGITBOOST;
D O I
10.1016/j.eswa.2023.121555
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid growth of educational data creates the requirement to mine useful information from learning behavior patterns. The development of data mining technology makes educational data mining possible. The paper intends to use a public educational data set to study learning behavior patterns from the perspective of educational data mining, so as to promote the innovation of educational management. Firstly, in order to reduce the dimension of data analysis that facilitates the improvement in efficiency, principal component analysis is carried out to reduce the number of attributes in the data set. The significant attributes in the rotating principal component matrix rather than principal components which are not closely related to learning behavior patterns are extracted as the research variables. Then, a pseudo statistic is proposed to determine the number of clusters and the preprocessed data set is clustered according to the extracted attributes. The clustering results are applied to add class labels to the data, which is convenient for the later data training. Finally, six classification algorithms J48, K-Nearest Neighbor, Bayes Net, Random Forest, Support Vector Machine and Logit Boost are used to train the data with labels and build prediction models. At the same time, the performance and applicable conditions of six classifiers in terms of accuracy, efficiency, error, and so on are discussed and compared. It is found that the performance of the integrated algorithm is better than that of a single classifier. In the integrated algorithm, compared with Random Forest, the running time of Logit Boost is shorter.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] iCreate: Mining Creative Thinking Patterns from Contextualized Educational Data
    Shabani, Nasrin
    Beheshti, Amin
    Farhood, Helia
    Bower, Matt
    Garrett, Michael
    Rokny, Hamid Alinejad
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS AND DOCTORAL CONSORTIUM, PT II, 2022, 13356 : 352 - 356
  • [22] Visualization analysis of educational data statistics based on big data mining
    Yuan, Yaodong
    Xu, Hongyan
    Krishnamurthy, M.
    Vijayakumar, P.
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (03) : 1785 - 1793
  • [23] Educational Data Mining & Students' Performance Prediction
    Abu Saa, Amjad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (05) : 212 - 220
  • [24] Research on Data Mining Algorithms for Automotive Customers' Behavior Prediction Problem
    Huang Lan
    Zhou Chun-guang
    Zhou Yu-qin
    Wang Zhe
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 677 - 681
  • [25] Applications of educational data mining and learning analytics on data from cybersecurity training
    Svabensky, Valdemar
    Vykopal, Jan
    Celeda, Pavel
    Kraus, Lydia
    EDUCATION AND INFORMATION TECHNOLOGIES, 2022, 27 (09) : 12179 - 12212
  • [26] Applications of educational data mining and learning analytics on data from cybersecurity training
    Valdemar Švábenský
    Jan Vykopal
    Pavel Čeleda
    Lydia Kraus
    Education and Information Technologies, 2022, 27 : 12179 - 12212
  • [27] From metabolism and behavior to respiratory physiology: and educational an research perspective
    Schlenker, Evelyn Heymann
    ADVANCES IN PHYSIOLOGY EDUCATION, 2020, 44 (04) : 540 - 544
  • [28] A Review on Visualization of Educational Data in Online Learning
    Dewan, M. Ali Akber
    Pachon, Walter Moreno
    Lin, Fuhua
    LEARNING TECHNOLOGIES AND SYSTEMS, ICWL 2020, SETE 2020, 2021, 12511 : 15 - 24
  • [29] Research on mining collaborative behaviour patterns of dynamic supply chain network from the perspective of big data
    Leng, Kaijun
    Jing, Linbo
    Lin, I-Ching
    Chang, Sheng-Hung
    Lam, Anthony
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (Suppl 1): : 113 - 121
  • [30] On Prediction of Research Excellence using Data Mining and Deep Learning Techniques
    Urooj, Amber
    Khan, Hikmat Ullah
    Iqbal, Saqib
    Althebyan, Qutaibah
    2021 EIGHTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORK ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2021, : 50 - 55