Predictive Model to Analyze Real and Synthetic Data for Learners' Performance Prediction Using Regression Techniques

被引:0
|
作者
Shabnam, Aras S. J. [1 ]
Ramachandriah, Tanuja [1 ]
Haladappa, Manjula S. [1 ]
机构
[1] Bangalore Univ, UVCE, Bangalore, Karnataka, India
来源
ONLINE LEARNING | 2025年 / 29卷 / 01期
关键词
Learners'performance prediction; educational data analytics; predictive models; privacy preservation; synthetic data generation; regression analysis;
D O I
10.24059/olj.v29i1.4390
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
redicting learner performance with precision is critical within educational systems, offering a basis for tailored interventions and instruction. The advent of big data analytics presents an opportunity to employ Machine Learning (ML) techniques to this end. Real-world dataavailability is often hampered by privacy concerns, prompting a shift towards synthetic data generation. This study presents an empirical comparison of real, synthetic, and hybrid (real + synthetic) datasets in forecasting learner performance, deploying an array of regression-based ML algorithms, including Random Forest, Gradient Boosting, Support Vector Regression, XGBoost, and K-nearest Neighbor. Our methodology encompasses the generation of synthetic data via generative model, followed by the application of these algorithms to each dataset. The models are evaluated using precision metrics to assess their predictive accuracy. The study reveals that synthetic data can match real data in terms of predictive performance, with hybrid datasets achieving an accuracy of up to 87.76%, highlighting the effectiveness of combining both data types. These findings highlight the potential of synthetic data as an effective alternative when access to actual data is limited, promoting progress in educational technology andML.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] USE OF A FUEL PERFORMANCE MODEL TO ANALYZE MATERIALS DATA
    HOMAN, FJ
    TRANSACTIONS OF THE AMERICAN NUCLEAR SOCIETY, 1971, 14 (02): : 629 - &
  • [42] Intelligent Breast Cancer Prediction Model Using Data Mining Techniques
    Shen, Runjie
    Yang, Yuanyuan
    Shao, Fengfeng
    2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 1, 2014, : 384 - 387
  • [43] Trajectory Prediction for Using Real Data and Real Meteorological Data
    Kim, Yong Kyun
    Han, Jong Wook
    Park, Hyodal
    UBIQUITOUS COMPUTING APPLICATION AND WIRELESS SENSOR, 2015, 331 : 89 - 103
  • [44] Using Partial Least Squares Regression to Analyze Cellular Response Data
    Kreeger, Pamela K.
    SCIENCE SIGNALING, 2013, 6 (271)
  • [45] Gaussian Process Regression for a PMV Prediction Model using Environmental Monitoring Data
    Yoon, Young Ran
    Moon, Hyeun Jun
    Kim, Sun Ho
    Kim, Jeong Won
    PROCEEDINGS OF BUILDING SIMULATION 2019: 16TH CONFERENCE OF IBPSA, 2020, : 2540 - 2545
  • [46] Inference for Multivariate Regression Model Based on Multiply Imputed Synthetic Data Generated via Posterior Predictive Sampling
    Moura, Ricardo
    Sinha, Bimal
    Coelho, Carlos A.
    APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2017, 1836
  • [47] Performance Prediction and Optimization of Ramjet for Projectiles Using Support Vector Regression Model
    Zhang N.
    Shi J.
    Wang Z.
    Zhao X.
    Binggong Xuebao/Acta Armamentarii, 2023, 44 (10): : 2944 - 2953
  • [48] Development of a Predictive Model for Progressive Supranuclear Palsy Using Real World Data
    Viscidi, E.
    Zabar, Y.
    Dam, T.
    Juneja, M.
    Kupferman, J.
    Kupelian, V.
    Eaton, S.
    Litvan, I.
    Hoglinger, G.
    MOVEMENT DISORDERS, 2019, 34 : S762 - S762
  • [49] Impact on Inference Model Performance for ML Tasks Using Real-Life Training Data and Synthetic Training Data from GANs
    Faltings, Ulrike
    Bettinger, Tobias
    Barth, Swen
    Schaefer, Michael
    INFORMATION, 2022, 13 (01)
  • [50] Performance study of model predictive control with reference prediction for real-time hybrid simulation
    Zeng, Chen
    Guo, Wei
    Shao, Ping
    JOURNAL OF VIBRATION AND CONTROL, 2024, 30 (7-8) : 1659 - 1673