Predictive Model to Analyze Real and Synthetic Data for Learners' Performance Prediction Using Regression Techniques

被引:0
|
作者
Shabnam, Aras S. J. [1 ]
Ramachandriah, Tanuja [1 ]
Haladappa, Manjula S. [1 ]
机构
[1] Bangalore Univ, UVCE, Bangalore, Karnataka, India
来源
ONLINE LEARNING | 2025年 / 29卷 / 01期
关键词
Learners'performance prediction; educational data analytics; predictive models; privacy preservation; synthetic data generation; regression analysis;
D O I
10.24059/olj.v29i1.4390
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
redicting learner performance with precision is critical within educational systems, offering a basis for tailored interventions and instruction. The advent of big data analytics presents an opportunity to employ Machine Learning (ML) techniques to this end. Real-world dataavailability is often hampered by privacy concerns, prompting a shift towards synthetic data generation. This study presents an empirical comparison of real, synthetic, and hybrid (real + synthetic) datasets in forecasting learner performance, deploying an array of regression-based ML algorithms, including Random Forest, Gradient Boosting, Support Vector Regression, XGBoost, and K-nearest Neighbor. Our methodology encompasses the generation of synthetic data via generative model, followed by the application of these algorithms to each dataset. The models are evaluated using precision metrics to assess their predictive accuracy. The study reveals that synthetic data can match real data in terms of predictive performance, with hybrid datasets achieving an accuracy of up to 87.76%, highlighting the effectiveness of combining both data types. These findings highlight the potential of synthetic data as an effective alternative when access to actual data is limited, promoting progress in educational technology andML.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Predictive Big Data Analytics Using Multiple Linear Regression Model
    Khine, Kyi Lai Lai
    Nyunt, Thi Thi Soe
    BIG DATA ANALYSIS AND DEEP LEARNING APPLICATIONS, 2019, 744 : 9 - 19
  • [22] USING PREDICTIVE MODELS TO ANALYZE LUNG CANCER DATA
    Tang, G.
    VALUE IN HEALTH, 2009, 12 (03) : A36 - A37
  • [23] A surrogate model based on feature selection techniques and regression learners to improve soybean yield prediction in southern France
    Corrales, David Camilo
    Schoving, Celine
    Raynal, Helene
    Debaeke, Philippe
    Journet, Etienne-Pascal
    Constantin, Julie
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 192
  • [24] Classification model To Predict the Learners' Academic Performance using Big Data
    Rajeswari, S.
    Lawrance, R.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [25] Prediction model to analyze the performance of VMD desalination process
    Yang, Chaohuan
    Peng, Xin
    Zhao, Yajing
    Wang, Xin
    Fu, Jingxia
    Liu, Kai
    Li, Yingdong
    Li, Pingli
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 132
  • [26] Using Pattern Recognition Techniques to Analyze Educational Data
    Vieira, Camilo
    Magana, Alejandra J.
    Boutin, Mireille
    2017 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE), 2017,
  • [27] A novel range prediction model using gradient descent optimization and regression techniques
    Kumar V.
    Krishna P.R.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (10) : 14277 - 14289
  • [28] Prediction model to analyze the performance of VMD desalination process
    Yang, Chaohuan
    Peng, Xin
    Zhao, Yajing
    Wang, Xin
    Fu, Jingxia
    Liu, Kai
    Li, Yingdong
    Li, Pingli
    Computers and Chemical Engineering, 2020, 132
  • [29] BuildingLAB: A tool to analyze performance of model predictive controllers for buildings
    Cigler, Jiri
    Tomasko, Pavel
    Siroky, Jan
    ENERGY AND BUILDINGS, 2013, 57 : 34 - 41
  • [30] Machine Learning Approaches for Prediction of Facial Rejuvenation Using Real and Synthetic Data
    Shah, Syed Afaq Ali
    Bennamoun, Mohammed
    Molton, Michael K.
    IEEE ACCESS, 2019, 7 : 23779 - 23787