Student Performance Prediction with Decision Tree Ensembles and Feature Selection Techniques

被引:0
|
作者
Ahmad, Amir [1 ]
Ray, Santosh [2 ]
Khan, Md. Tabrej [3 ]
Nawaz, Ali [1 ]
机构
[1] United Arab Emirates Univ, Coll Informat Technol, Al Ain, U Arab Emirates
[2] Liwa Coll, Fac Informat Technol, Abu Dhabi, U Arab Emirates
[3] Pacific Acad Higher Educ & Res Univ, Fac Comp Sci, Udaipur, Rajasthan, India
关键词
Student dropout prediction; classification; ensembles; decision trees; imbalanced class; feature selection; CLASSIFICATION; PROJECTION; SMOTE;
D O I
10.1142/S0219649225500169
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
The prevalence of student dropout in academic settings is a serious issue that affects individuals and society as a whole. Timely intervention and support can be provided to such students if we get an accurate prediction of student performance. However, class imbalance and data complexity in education data are major challenges for traditional predictive analytics. Our research focusses on utilising machine learning techniques to predict student performance while handling imbalanced datasets. To address the imbalanced class problem, we employed both oversampling and undersampling techniques in our decision tree ensemble methods for the risk classification of prospective students. The effectiveness of classifiers was evaluated by varying the sizes of the ensembles and the oversampling and undersampling ratios. Additionally, we conducted experiments to integrate the feature selection processes with the best ensemble classifiers to further enhance the prediction. Based on the extensive experimentation, we concluded that ensemble methods such as Random Forest, Bagging, and Random Undersampling Boosting perform well in terms of performance measures such as Recall, Precision, F1-score, Area Under the Receiver Operating Characteristic Curve, and Geometric Mean. The F1-score of 0.849 produced by the Random Undersampling Boost classifier in conjunction with the Least Absolute Shrinkage and Selection Operator feature selection method indicates that this ensemble produces the best results.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Credal-Decision-Tree-Based Ensembles for Spatial Prediction of Landslides
    Gui, Jingyun
    Perez-Rey, Ignacio
    Yao, Miao
    Zhao, Fasuo
    Chen, Wei
    WATER, 2023, 15 (03)
  • [32] Genetic Algorithm Based Feature Selection With Ensemble Methods For Student Academic Performance Prediction
    Farissi, Al
    Dahlan, Halina Mohamed
    Samsuryadi
    3RD FORUM IN RESEARCH, SCIENCE, AND TECHNOLOGY (FIRST 2019) INTERNATIONAL CONFERENCE, 2020, 1500
  • [33] Criteria Ensembles in Feature Selection
    Somol, Petr
    Grim, Jiri
    Pudil, Pavel
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2009, 5519 : 304 - 313
  • [34] Effective Feature Prediction Models for Student Performance
    Alsubhi, Bashayer
    Alharbi, Basma
    Aljojo, Nahla
    Banjar, Ameen
    Tashkandi, Araek
    Alghoson, Abdullah
    Al-Tirawi, Anas
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2023, 13 (05) : 11937 - 11944
  • [35] Relational tree ensembles and feature rankings
    Petkovic, Matej
    Ceci, Michelangelo
    Pio, Gianvito
    Skrlj, Blaz
    Kersting, Kristian
    Dzeroski, Sago
    KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [36] The influence of the pool of candidates on the performance of selection and combination techniques in ensembles
    Coelho, Guilhenne P.
    Von Zuben, Fernando J.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 5132 - 5139
  • [37] Empirical evaluation of the performance of data sampling and feature selection techniques for software fault prediction
    Rathi, Sonika Chandrakant
    Misra, Sanjay
    Colomo-Palacios, Ricardo
    Adarsh, R.
    Neti, Lalita Bhanu Murthy
    Kumar, Lov
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223
  • [38] Decision tree with optimal feature selection for hearing fault detection
    Nguyen, Ngoc-Tu
    Lee, Hong-Hee
    JOURNAL OF POWER ELECTRONICS, 2008, 8 (01) : 101 - 107
  • [39] Interactive Reinforcement Learning for Feature Selection With Decision Tree in the Loop
    Fan, Wei
    Liu, Kunpeng
    Liu, Hao
    Ge, Yong
    Xiong, Hui
    Fu, Yanjie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1624 - 1636
  • [40] Feature Selection Methods Based on Decision Rule and Tree Models
    Paja, Wieslaw
    INTELLIGENT DECISION TECHNOLOGIES 2016, PT II, 2016, 57 : 63 - 70