Prediction of Quality of Water According to a Random Forest Classifier

被引:0
|
作者
Alomani, Shahd Maadi [1 ]
Alhawiti, Najd Ibrahim [1 ]
Alhakamy, A'aeshah [1 ,2 ,3 ]
机构
[1] Univ Tabuk, Master Artificial Intelligence, Fac Comp & Informat Technol, Tabuk, Saudi Arabia
[2] Univ Tabuk, Ind Innovat & Robot Ctr IIRC, Tabuk, Saudi Arabia
[3] Univ Tabuk, Fac Comp & Informat Technol, Dept Comp Sci, Tabuk, Saudi Arabia
关键词
Big data; machine learning; classification; random forest; water quality; PySpark;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Potable or drinking water is a daily life necessity for humans. The safety of this water is a concern in many regions around the world, since polluted waters are increasing and causing the spread of disease among populations. Continuous management and evaluation of the water which is meant for drinking is very essential and must be taken seriously. Often, the quality of water is evaluated through regular laboratory testing and analysis which can be tiresome and time consuming. On the other hand, advanced technologies using big data with the help of machine learning can have better results in terms of potability evaluation. For this reason, several studies have been conducted on predicting the quality of water and the several factors and classification that affect the prediction model. In this study, a random forest model was developed using PySpark classification to predict the potability of river water by relying on ten different features: pH, hardness, presence of solids, presence of chloramines, presence of sulfate, conductivity, organic carbon, trihalomethanes, turbidity, and finally potability. In addition, The developed model was able to predict water potability classification with a 1.0 accuracy, and 1.0 F1-score.
引用
收藏
页码:892 / 899
页数:8
相关论文
共 50 条
  • [1] Outlier Prediction Using Random Forest Classifier
    Mohandoss, Divya Pramasani
    Shi, Yong
    Suo, Kun
    [J]. 2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 27 - 33
  • [2] Prediction with Confidence Based on a Random Forest Classifier
    Devetyarov, Dmitry
    Nouretdinov, Ilia
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 37 - 44
  • [3] Prediction of Incident Delirium Using a Random Forest classifier
    John P. Corradi
    Stephen Thompson
    Jeffrey F. Mather
    Christine M. Waszynski
    Robert S. Dicks
    [J]. Journal of Medical Systems, 2018, 42
  • [4] Prediction of Incident Delirium Using a Random Forest classifier
    Corradi, John P.
    Thompson, Stephen
    Mather, Jeffrey F.
    Waszynski, Christine M.
    Dicks, Robert S.
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (12)
  • [5] Football Match Result Prediction Using the Random Forest Classifier
    Pugsee, Pakawan
    Pattawong, Pattarachai
    [J]. PROCEEDINGS OF 2019 2ND INTERNATIONAL CONFERENCE ON BIG DATA TECHNOLOGIES (ICBDT 2019), 2019, : 154 - 158
  • [6] A RANDOM FOREST CLASSIFIER FOR THE PREDICTION OF TESTOSTERONE DEFICIENCY IN THE COMMUNITY SETTING
    Novaes, M.
    Carvalho, O. L.
    Tiraboschi, T.
    Ferreira, P. H.
    SilvaS, C.
    Zambrano, J. C.
    Ribeiro, A. P.
    Gomes, C.
    Miranda, E.
    Bessa Jr, J.
    [J]. JOURNAL OF SEXUAL MEDICINE, 2021, 18 (03): : S42 - S42
  • [7] Default Risk Prediction Using Random Forest and XGBoosting Classifier
    Sharma, Alok Kumar
    Li, Li-Hua
    Ahmad, Ramli
    [J]. 2021 INTERNATIONAL CONFERENCE ON SECURITY AND INFORMATION TECHNOLOGIES WITH AI, INTERNET COMPUTING AND BIG-DATA APPLICATIONS, 2023, 314 : 91 - 101
  • [8] A Random Forest Based Classifier for Error Prediction of Highly Individualized Products
    Groener, Gerd
    [J]. MACHINE LEARNING FOR CYBER PHYSICAL SYSTEMS, ML4CPS 2018, 2019, 9 : 26 - 35
  • [9] Link quality prediction based on random forest
    Liu, Linlan
    Gao, Shengrong
    Shu, Jian
    [J]. Tongxin Xuebao/Journal on Communications, 2019, 40 (04): : 202 - 211
  • [10] Thresholding a Random Forest Classifier
    Baumann, Florian
    Li, Fangda
    Ehlers, Arne
    Rosenhahn, Bodo
    [J]. ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT II, 2014, 8888 : 95 - 106