Machine Learning Models of Arsenic in Private Wells Throughout the Conterminous United States As a Tool for Exposure Assessment in Human Health Studies

被引:60
|
作者
Lombard, Melissa A. [11 ]
Bryan, Molly Scannell [1 ]
Jones, Daniel K. [2 ]
Bulka, Catherine [3 ]
Bradley, Paul M. [4 ]
Backer, Lorraine C. [5 ]
Focazio, Michael J. [6 ]
Silverman, Debra T. [7 ]
Toccalino, Patricia [8 ]
Argos, Maria [9 ]
Gribble, Matthew O. [10 ]
Ayotte, Joseph D. [11 ]
机构
[1] Univ Illinois, Inst Minor Hlth Res, Chicago, IL 60612 USA
[2] US Geol Survey, Utah Water Sci Ctr, West Valley City, UT 84119 USA
[3] Univ N Carolina, Chapel Hill, NC 27599 USA
[4] US Geol Survey, South Atlantic Water Sci Ctr, Columbia, SC 29210 USA
[5] Ctr Dis Control & Prevent, Natl Ctr Environm Hlth, Chamblee, GA 30341 USA
[6] US Geol Survey, Tox Subst Hydrol Program, Reston, VA 20192 USA
[7] NCI, Occupat & Environm Epidemiol Branch, Rockville, MD 20850 USA
[8] US Geol Survey, Northwest Pacific Isl Reg, Portland, OR 97232 USA
[9] Univ Illinois, Sch Publ Hlth, Chicago, IL 60612 USA
[10] Emory Univ, Gangarosa Dept Environm Hlth, Atlanta, GA 30322 USA
[11] US Geol Survey, New England Water Sci Ctr, Pembroke, NH 03275 USA
关键词
BLADDER-CANCER MORTALITY; DRINKING-WATER WELLS; CENTRAL VALLEY; CONTIGUOUS US; NEW-ENGLAND; GROUNDWATER; CONTAMINATION; ASSOCIATION; VARIABILITY; MECHANISMS;
D O I
10.1021/acs.est.0c05239
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Arsenic from geologic sources is widespread in groundwater within the United States (U.S.). In several areas, groundwater arsenic concentrations exceed the U.S. Environmental Protection Agency maximum contaminant level of 10 mu g per liter (mu g/L). However, this standard applies only to public-supply drinking water and not to private-supply, which is not federally regulated and is rarely monitored. As a result, arsenic exposure from private wells is a potentially substantial, but largely hidden, public health concern. Machine learning models using boosted regression trees (BRT) and random forest classification (RFC) techniques were developed to estimate probabilities and concentration ranges of arsenic in private wells throughout the conterminous U.S. Three BRT models were fit separately to estimate the probability of private well arsenic concentrations exceeding 1, 5, or 10 mu g/L whereas the RFC model estimates the most probable category (<= 5, >5 to <= 10, or >10 mu g/L). Overall, the models perform best at identifying areas with low concentrations of arsenic in private wells. The BRT 10 mu g/L model estimates for testing data have an overall accuracy of 91.2%, sensitivity of 33.9%, and specificity of 98.2%. Influential variables identified across all models included average annual precipitation and soil geochemistry. Models were developed in collaboration with public health experts to support U.S.-based studies focused on health effects from arsenic exposure.
引用
收藏
页码:5012 / 5023
页数:12
相关论文
共 9 条
  • [1] Assessing the Impact of Drought on Arsenic Exposure from Private Domestic Wells in the Conterminous United States
    Lombard, Melissa A.
    Daniel, Johnni
    Jeddy, Zuha
    Hay, Lauren E.
    Ayotte, Joseph D.
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2021, 55 (03) : 1822 - 1831
  • [2] Assessing the Impact of Drought on Arsenic Exposure from Private Domestic Wells in the Conterminous United States
    Lombard, Melissa A.
    Daniel, Johnni
    Jeddy, Zuha
    Hay, Lauren E.
    Ayotte, Joseph D.
    Environmental Science and Technology, 2021, 55 (03): : 1822 - 1831
  • [3] Arsenic in the shallow ground waters of conterminous United States: Assessment, health risks, and costs for MCL compliance
    Twarakavi, NKC
    Kaluarachchi, JJ
    JOURNAL OF THE AMERICAN WATER RESOURCES ASSOCIATION, 2006, 42 (02): : 275 - 294
  • [4] Evaluation of twelve evapotranspiration products from machine learning, remote sensing and land surface models over conterminous United States
    Xu, Tongren
    Guo, Zhixia
    Xia, Youlong
    Ferreira, Vagner G.
    Liu, Shaomin
    Wang, Kaicun
    Yao, Yunjun
    Zhang, Xiaojuan
    Zhao, Changsen
    JOURNAL OF HYDROLOGY, 2019, 578
  • [5] At the crossroads: Hazard assessment and reduction of health risks from arsenic in private well waters of the northeastern United States and Atlantic Canada
    Zheng, Yan
    Ayotte, Joseph D.
    SCIENCE OF THE TOTAL ENVIRONMENT, 2015, 505 : 1237 - 1247
  • [6] Development of a Machine Learning Modeling Tool for Predicting Human Immunodeficiency Virus Incidence Using Public Health Data From a County in the Southern United States
    Saldana, Carlos S.
    Burkhardt, Elizabeth
    Pennisi, Alfred
    Oliver, Kirsten
    Olmstead, John
    Holland, David P.
    Gettings, Jenna
    Mauck, Daniel
    Austin, David
    Wortley, Pascale
    Saldana Ochoa, Karla, V
    CLINICAL INFECTIOUS DISEASES, 2024, 79 (03) : 717 - 726
  • [7] The role of decision tree and machine learning models for outcome prediction of bupropion exposure: A nationwide analysis of more than 14 000 patients in the United States
    Mehrpour, Omid
    Saeedi, Farhad
    Vohra, Varun
    Abdollahi, Jafar
    Shirazi, Farshad M.
    Goss, Foster
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2023, 133 (01) : 98 - 110
  • [8] Techniques to Improve Ecological Interpretability of Black-Box Machine Learning Models: Case Study on Biological Health of Streams in the United States with Gradient Boosted Trees
    Welchowski T.
    Maloney K.O.
    Mitchell R.
    Schmid M.
    Journal of Agricultural, Biological and Environmental Statistics, 2022, 27 (1) : 175 - 197
  • [9] Enhancing Predictive Models for Assessing 5G Exposure Effects on Human Health and Cognition through Supervised Machine Learning: A Multi-Stage Feature Selection Approach
    Sofri, Tasneem
    Andrew, Allan Melvin
    Rahim, Hasliza A.
    Nishizaki, Hiromitsu
    Kamarudin, Latifah Munirah
    Wong, Peng Wen
    Soh, Ping Jack
    PRZEGLAD ELEKTROTECHNICZNY, 2024, 100 (06): : 122 - 128