Ensemble Feature Selection Framework for Paddy Yield Prediction in Cauvery Basin using Machine Learning Classifiers

被引:0
|
作者
Sathya, P. [1 ]
Gnanasekaran, P. [2 ,3 ]
机构
[1] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Comp Applicat, Chennai, India
[2] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Informat Technol, Chennai, India
[3] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Informat Technol, Chennai 600048, India
来源
COGENT ENGINEERING | 2023年 / 10卷 / 02期
关键词
classification; ensemble learning; feature selection; paddy; yield; INFORMATION; ALGORITHM;
D O I
10.1080/23311916.2023.2250061
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Machine learning technique involves a significant amount of time and the model performance reduces in multi-dimensional datasets because of redundant features. Feature selection is a significant step in machine learning and involves the selection of a subset of relevant data feature from larger data feature, which further enhances model performance by simplification. Crop yield prediction is a significant application of machine learning, and feature selection plays a crucial role in it. To predict paddy yield accurately, weather, soil, and crop attributes are essential factors. Therefore, feature selection techniques are employed to identify relevant and non-redundant attributes from a larger dataset, which simplifies the prediction model. In this study, an ensemble feature selection method is proposed that selects an optimized subset of attributes by combining various attribute subsets based on mutual information between attributes and between attributes and the class. The proposed ensemble approach is validated using five classification techniques, including K-nearest neighbor, Random Forest, Support Vector Machine, Naive Bayes, and Bagging. Several evaluation metrics such as Accuracy, Error rate, Kappa, Precision, Recall, Specificity, and F1 score are used to assess the performance of the ensemble approach and compare it with other feature selection techniques. The experimental results indicate that the proposed ensemble approach with the Random Forest classifier outperforms other classifiers, with Accuracy and Error rate values of 0.9491 and 0.0509, respectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Sarcopenia feature selection and risk prediction using machine learning
    Yoo, Jun-Il
    Park, Chan-Ho
    Kim, Hyeonmok
    [J]. JOURNAL OF BONE AND MINERAL RESEARCH, 2019, 34 : 145 - 145
  • [22] Prediction of Heart Failure by using Machine Learning and Feature Selection
    Aslam, Muhammad Haseeb
    Hussain, Syed Fawad
    [J]. 2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET'22), 2022, : 160 - 165
  • [23] Stroke Treatment Prediction Using Features Selection Methods and Machine Learning Classifiers
    Chourib, I.
    Guillard, G.
    Farah, I. R.
    Solaiman, B.
    [J]. IRBM, 2022, 43 (06) : 678 - 686
  • [24] Prediction of Radiation Induced Lymphedema for Head & Neck Cancer Patients Using Ensemble Feature Selection and Machine Learning
    Teo, P.
    Rogacki, K.
    Gopalakrishnan, M.
    Mittal, B.
    Das, I.
    Abazeed, M.
    Gentile, M.
    [J]. MEDICAL PHYSICS, 2022, 49 (06) : E266 - E267
  • [25] Combining feature selection, feature learning and ensemble learning for software fault prediction
    Hung Duy Tran
    Le Thi My Hanh
    Nguyen Thanh Binh
    [J]. PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 78 - 85
  • [26] Prediction of functional outcomes of schizophrenia with genetic biomarkers using a bagging ensemble machine learning method with feature selection
    Lin, Eugene
    Lin, Chieh-Hsin
    Lane, Hsien-Yuan
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [27] Prediction of functional outcomes of schizophrenia with genetic biomarkers using a bagging ensemble machine learning method with feature selection
    Eugene Lin
    Chieh-Hsin Lin
    Hsien-Yuan Lane
    [J]. Scientific Reports, 11
  • [28] An Ensemble Machine Learning Framework for Cotton Crop Yield Prediction Using Weather Parameters: A Case Study of Pakistan
    Haider, Syed Tahseen
    Ge, Wenping
    Li, Jianqiang
    Rehman, Saif Ur
    Imran, Azhar
    Sharaf, Mohamed Abdel Fattah
    Haider, Syed Muhammad
    [J]. IEEE ACCESS, 2024, 12 : 124045 - 124061
  • [29] Android malware classification using optimum feature selection and ensemble machine learning
    Islam, Rejwana
    Sayed, Moinul Islam
    Saha, Sajal
    Hossain, Mohammad Jamal
    Masud, Md Abdul
    [J]. Internet of Things and Cyber-Physical Systems, 2023, 3 : 100 - 111
  • [30] Diabetes Mellitus Disease Prediction Using Machine Learning Classifiers with Oversampling and Feature Augmentation
    Ahamed, B. Shamreen
    Arya, Meenakshi S.
    Nancy, Auxilia Osvin V.
    [J]. ADVANCES IN HUMAN-COMPUTER INTERACTION, 2022, 2022