Missing data analysis using machine learning methods to predict the performance of technical students

被引:3
|
作者
Melo Junior, Gilberto de [1 ]
Alcala, Symone G. Soares [2 ]
Furriel, Geovanne Pereira [1 ]
Vieira, Silvio L. [1 ]
机构
[1] Univ Fed Goias, Elect & Comp Engn, Goiania, Go, Brazil
[2] Univ Fed Goias, Fac Sci & Technol, Goiania, Go, Brazil
来源
关键词
Missing Data Treatment Methods; Machine Learning; Evaluation of algorithms; CLASSIFICATION;
D O I
10.5335/rbca.v12i2.10565
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Machine learning (ML) has become an emerging technology able to solve problems in many areas, including education, medicine, robotic and aerospace. ML is a specific field of artificial intelligence which designs computational models able to learn from data. However, to develop a ML model, it is necessary to ensure data quality, since real-world data is incomplete, noisy and inconsistent. This paper evaluates state-of-the-art missing data treatment methods using ML algorithms to classify the performance of technical high school students at the Federal Institute of Goias in Brazil. The aim is to provide an efficient computational tool to aid educational performance that allows the educators to verify the student's tendency to fail. The results indicate that ignoring and discarding method outperforms other missing data treatment methods. Moreover, the tests reveal that Sequential Minimal Optimization, Neural Networks and Bagging outperform the other ML algorithms, such as Naive Bayes and Decision tree, in terms of classification accuracy.
引用
收藏
页码:134 / 143
页数:10
相关论文
共 50 条
  • [1] Prediction of missing temperature data using different machine learning methods
    Okan Mert Katipoğlu
    [J]. Arabian Journal of Geosciences, 2022, 15 (1)
  • [2] ANALYSIS OF LArTPC DATA USING MACHINE LEARNING METHODS
    Falko, A.
    Gogota, O.
    Yermolenko, R.
    Kadenko, I.
    [J]. JOURNAL OF PHYSICAL STUDIES, 2024, 28 (01):
  • [3] Analysis of Fleet Data Using Machine Learning Methods
    Ebel, André
    Riemer, Thomas
    Reuss, Hans-Christian
    [J]. Tongji Daxue Xuebao/Journal of Tongji University, 2021, 49 : 186 - 193
  • [4] A Machine Learning Model to Predict the Performance of University Students
    Canagareddy, Derinsha
    Subarayadu, Khuslendra
    Hurbungs, Visham
    [J]. SMART AND SUSTAINABLE ENGINEERING FOR NEXT GENERATION APPLICATIONS, 2019, 561 : 313 - 322
  • [5] Using Machine Learning Methods to Understand Students' Performance in an Engineering Course
    Kwan, Wei Lek
    Pee, Gim-Yang Maggie
    Koh, Li Ling Apple
    Tan, Mei Xuan
    [J]. PROCEEDINGS OF THE 2022 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON 2022), 2022, : 537 - 540
  • [6] Using Machine Learning Methods to Predict Experimental High Throughput Screening Data
    Mballo, Cherif
    Makarenkov, Vladimir
    [J]. COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2010, 13 (05) : 430 - 441
  • [7] Missing Data Analysis Using Statistical and Machine Learning Methods in Facility-Based Maternal Health Records
    Memon S.M.Z.
    Wamala R.
    Kabano I.H.
    [J]. SN Computer Science, 3 (5)
  • [8] Data Oriented Financial Analysis using Machine Learning Methods
    Altan, Cisem
    Kalayci, Sacide
    Koroglu, Bilge
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 37 - 41
  • [9] Analysis of Machine Learning Based Imputation of Missing Data
    Rizvi, Syed Tahir Hussain
    Latif, Muhammad Yasir
    Amin, Muhammad Saad
    Telmoudi, Achraf Jabeur
    Shah, Nasir Ali
    [J]. CYBERNETICS AND SYSTEMS, 2023,
  • [10] Performance Of Soil Prediction Using Machine Learning For Data Clustering Methods
    Rajeshwari, M.
    Shunmuganathan, N.
    Sankarasubramanian, R.
    [J]. JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (02) : 825 - 831