Student Performance Prediction with Decision Tree Ensembles and Feature Selection Techniques

被引:0
|
作者
Ahmad, Amir [1 ]
Ray, Santosh [2 ]
Khan, Md. Tabrej [3 ]
Nawaz, Ali [1 ]
机构
[1] United Arab Emirates Univ, Coll Informat Technol, Al Ain, U Arab Emirates
[2] Liwa Coll, Fac Informat Technol, Abu Dhabi, U Arab Emirates
[3] Pacific Acad Higher Educ & Res Univ, Fac Comp Sci, Udaipur, Rajasthan, India
关键词
Student dropout prediction; classification; ensembles; decision trees; imbalanced class; feature selection; CLASSIFICATION; PROJECTION; SMOTE;
D O I
10.1142/S0219649225500169
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
The prevalence of student dropout in academic settings is a serious issue that affects individuals and society as a whole. Timely intervention and support can be provided to such students if we get an accurate prediction of student performance. However, class imbalance and data complexity in education data are major challenges for traditional predictive analytics. Our research focusses on utilising machine learning techniques to predict student performance while handling imbalanced datasets. To address the imbalanced class problem, we employed both oversampling and undersampling techniques in our decision tree ensemble methods for the risk classification of prospective students. The effectiveness of classifiers was evaluated by varying the sizes of the ensembles and the oversampling and undersampling ratios. Additionally, we conducted experiments to integrate the feature selection processes with the best ensemble classifiers to further enhance the prediction. Based on the extensive experimentation, we concluded that ensemble methods such as Random Forest, Bagging, and Random Undersampling Boosting perform well in terms of performance measures such as Recall, Precision, F1-score, Area Under the Receiver Operating Characteristic Curve, and Geometric Mean. The F1-score of 0.849 produced by the Random Undersampling Boost classifier in conjunction with the Least Absolute Shrinkage and Selection Operator feature selection method indicates that this ensemble produces the best results.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] A New Method for Stance Detection Based on Feature Selection Techniques and Ensembles of Classifiers
    Vychegzhanin, Sergey
    Kotelnikov, Evgeny
    IEEE ACCESS, 2021, 9 : 134899 - 134915
  • [42] Feature-selection ability of the decision-tree algorithm and the impact of feature-selection/extraction on decision-tree results based on hyperspectral data
    Wang, Y. Y.
    Li, J.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2008, 29 (10) : 2993 - 3010
  • [43] Earthquake Prediction in California Using Feature Selection Techniques
    Roiz-Pagador, Joaquin
    Chacon-Maldonado, Andres
    Ruiz, Roberto
    Asencio-Cortes, Gualberto
    16TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2021), 2022, 1401 : 728 - 738
  • [44] Feature Selection Techniques for Gender Prediction from Blogs
    Shahana, P. H.
    Outman, Bini
    2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 355 - 359
  • [45] Evaluation of Feature Selection Techniques for Software Maintenance Prediction
    Nanda, Sheena
    Bala, Anju
    Saxena, Sharad
    2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND INFORMATION TECHNOLOGY FOR SUSTAINABLE SOLUTION (CSITSS-2017), 2017, : 76 - 81
  • [46] HARDWARE IMPLEMENTATION OF DECISION TREE ENSEMBLES
    Struharik, Rastislav J. R.
    Novak, Ladislav A.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2013, 22 (05)
  • [47] Decision tree simplification for classifier ensembles
    Windeatt, T
    Ardeshir, G
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (05) : 749 - 776
  • [48] Variable randomness in decision tree ensembles
    Liu, Fei Tony
    Ting, Kai Ming
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 81 - 90
  • [49] Feature Selection with Dynamic Classifier Ensembles
    Kiziloz, Hakan Ezgi
    Deniz, Ayca
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 2038 - 2043
  • [50] Predictive Modeling of Student Performance Using RFECV-RF for Feature Selection and Machine Learning Techniques
    Harif, Abdellatif
    Kassimi, Moulay Abdellah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 231 - 240