Identifying schools at high-risk for elevated lead in drinking water using only publicly available data

被引:15
|
作者
Lobo, G. P. [1 ]
Laraway, J. [2 ]
Gadgil, A. J. [1 ]
机构
[1] Univ Calif Berkeley, Dept Civil & Environm Engn, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Environm Sci Policy & Management, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Lead in school drinking water; Lead leaching; Machine learning; Environmental justice; Public data mining; SUPPLY SYSTEMS; TAP WATER; CORROSION; VARIABILITY; PB; MONOCHLORAMINE; ORTHOPHOSPHATE; DISINFECTION; BRASS; FULL;
D O I
10.1016/j.scitotenv.2021.150046
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Estimating the risk of lead contamination of schools' drinking water at the State level is a complex, important, and unexplored challenge. Variable water quality among water systems and changes in water chemistry during distribution affect lead dissolution rates from pipes and fittings. In addition, the locations of lead-bearing plumbing materials are uncertain. We tested the capability of six machine learning models to predict the likelihood of lead contamination of drinking water at the schools' taps using only publicly available datasets. The predictive features used in the models correspond to those with a proven correlation to the dominant, but commonly unavailable, factors that govern lead leaching: the presence of lead-bearing plumbing materials and water quality conducive to lead corrosion. By combining water chemistry data from public reports, socioeconomic information from the US census, and spatial features using Geographic Information Systems, we trained and tested models to estimate the likelihood of lead contaminated tap water in over 8,000 schools across California and Massachusetts. Our best-performing model was a Random Forest, with a 10-fold cross validation score of 0.88 for Massachusetts and 0.78 for California using the average Area Under the Receiver Operating Characteristic Curve (ROC AUC) metric. The model was then used to assign a lead leaching risk category to half of the schools across California (the other half was used for training). There was good agreement between the modeled risk categories and the actual lead leaching outcomes for every school; however, the model overestimated the lead leaching risk in up to 17% of the schools. This model is the first of its kind to offer a tool to predict the risk of lead leaching in schools at the State level. Further use of this model can help deploy limited resources more effectively to prevent childhood lead exposure from school drinking water. (c) 2021 Published by Elsevier B.V.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Transition age youth in publicly funded systems: Identifying high-risk youth for policy planning and improved service delivery
    Heflinger, Craig Anne
    Hoffman, Cheri
    JOURNAL OF BEHAVIORAL HEALTH SERVICES & RESEARCH, 2008, 35 (04): : 390 - 401
  • [22] Identifying high-risk commercial vehicle drivers using sociodemographic characteristics
    Sagar, Shraddha
    Stamatiadis, Nikiforos
    Wright, Samantha
    Cambron, Aaron
    ACCIDENT ANALYSIS AND PREVENTION, 2020, 143 (143):
  • [23] Transition Age Youth in Publicly Funded Systems: Identifying High-Risk Youth for Policy Planning and Improved Service Delivery
    Craig Anne Heflinger
    Cheri Hoffman
    The Journal of Behavioral Health Services & Research, 2008, 35 : 390 - 401
  • [24] Identifying electrode bridging from electrical distance distributions: A survey of publicly-available EEG data using a new method
    Alschuler, Daniel M.
    Tenke, Craig E.
    Bruder, Gerard E.
    Kayser, Juergen
    CLINICAL NEUROPHYSIOLOGY, 2014, 125 (03) : 484 - 490
  • [25] Quantification of unreported water use for supplemental crop irrigation in humid climates using publicly available agricultural data
    Sangha, Laljeet
    Shortridge, Julie
    AGRICULTURAL WATER MANAGEMENT, 2023, 287
  • [27] High-risk clones of Pseudomonas aeruginosa contaminate the drinking water networks of French cities
    Horikian, Ani
    Jeanvoine, Audrey
    Amarache, Abdallah
    Tourtet, Morgane
    Ory, Jerome
    Boulestreau, Helene
    van der Mee Marquet, Nathalie
    Lemaitre, Nadine
    Eveillard, Matthieu
    Lepelletier, Didier
    Bertrand, Xavier
    Valot, Benoit
    Hocquet, Didier
    NPJ CLEAN WATER, 2024, 7 (01)
  • [28] Effect of various drinking water on human micronucleus frequency in high-risk population of PHC
    Liu, E
    Zhang, QN
    Li, WG
    WORLD JOURNAL OF GASTROENTEROLOGY, 1998, 4 (02) : 183 - 184
  • [29] Identifying subgroups at high-risk of ICI associated cardiac adverse events: An approach using topological data analysis
    Heilbroner, Samuel
    ANNALS OF ONCOLOGY, 2021, 32 : S232 - S232
  • [30] High-Risk Lead Extraction Using a Hybrid Approach: The Blade and the Lightsaber
    Koneru, Jayanthi N.
    Ellenbogen, Kenneth A.
    JOURNAL OF CARDIOVASCULAR ELECTROPHYSIOLOGY, 2014, 25 (06) : 622 - 623