Predicting emerging chemical content in consumer products using machine learning

被引:1
|
作者
Thornton, Luka Lila [1 ,2 ]
Carlson, David E. [1 ,3 ]
Wiesner, Mark R. [1 ,2 ]
机构
[1] Duke Univ, Dept Civil & Environm Engn, 121 Hudson Hall, Durham, NC 27708 USA
[2] Ctr Environm Implicat NanoTechnol CEINT, Durham, NC USA
[3] Duke Univ, Med Ctr, Dept Biostat & Bioinformat, 2424 Erwin Rd, Suite 1102 Hock Plaza, Durham, NC 27710 USA
基金
美国国家科学基金会;
关键词
Exposure modeling; Chemical function; Nanomaterials; Arti ficial intelligence; Cheminformatics; Environmental exposure; Consumer product safety; Nanotechnology; NANOPARTICLES;
D O I
10.1016/j.scitotenv.2022.154849
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
their consequent risk, we need to know their concentrations in products, or chemical weight fractions. Unfortunately, manufacturers rarely report comprehensive weight fraction data on product labels. The goal of this study was to evaluate the utility of machine learning strategies for predicting weight fractions when chemical constituent data are limited. A "data-poor" framework was developed and tested using a small dataset on consumer products containing engineered nanomaterials to represent emerging substances. A second, more traditional framework was applied to a "data-rich" product dataset comprised of bulk-scale organic chemicals for comparison purposes. Feature variables included chemical properties, functional use categories (e.g., antimicrobial), product categories (e.g., makeup), product matrix categories, and whether weight fractions were manufacturer-reported or experimentally obtained. Classification into three weight fraction bins was done using a random forest or nonlinear support vector classifier. An ablation study revealed that functional use data improved predictive performance when included alongside chemical property data, suggesting the utility of functional use categories in evaluating the safety and sustainability of emerging chemicals. Models could roughly stratify material-product observations into order of magnitude weight fractions with moderate success; the best of these achieved an average balanced accuracy of 73% on the nanomaterials product data. Framework comparisons also revealed a positive trend in sample size versus average balanced accuracy, suggesting great promise for machine learning approaches with continued investment in chemical data collection.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] A machine learning approach to predicting psychosis using semantic density and latent content analysis
    Neguine Rezaii
    Elaine Walker
    Phillip Wolff
    npj Schizophrenia, 5
  • [22] Predicting moisture content and husked rice yield using electrical parameters and machine learning
    Deng, An
    Xu, Yongyang
    Qiu, Weiqiang
    Li, Li
    Jin, Yinzhe
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2024, 40 (15): : 245 - 252
  • [23] Predicting moisture content during maize nixtamalization using machine learning with NIR spectroscopy
    Burns, Michael J.
    Renk, Jonathan S.
    Eickholt, David P.
    Gilbert, Amanda M.
    Hattery, Travis J.
    Holmes, Mark
    Anderson, Nickolas
    Waters, Amanda J.
    Kalambur, Sathya
    Flint-Garcia, Sherry A.
    Yandeau-Nelson, Marna D.
    Annor, George A.
    Hirsch, Candice N.
    THEORETICAL AND APPLIED GENETICS, 2021, 134 (11) : 3743 - 3757
  • [24] Predicting moisture content during maize nixtamalization using machine learning with NIR spectroscopy
    Michael J. Burns
    Jonathan S. Renk
    David P. Eickholt
    Amanda M. Gilbert
    Travis J. Hattery
    Mark Holmes
    Nickolas Anderson
    Amanda J. Waters
    Sathya Kalambur
    Sherry A. Flint-Garcia
    Marna D. Yandeau-Nelson
    George A. Annor
    Candice N. Hirsch
    Theoretical and Applied Genetics, 2021, 134 : 3743 - 3757
  • [25] Consumer Preference Elicitation of Complex Products Using Fuzzy Support Vector Machine Active Learning
    Huang, Dongling
    Luo, Lan
    MARKETING SCIENCE, 2016, 35 (03) : 445 - 464
  • [26] Predicting the neutral hydrogen content of galaxies from optical data using machine learning
    Rafieferantsoa, Mika
    Andrianomena, Sambatra
    Dave, Romeel
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2018, 479 (04) : 4509 - 4525
  • [27] A machine learning approach to predicting psychosis using semantic density and latent content analysis
    Rezaii, Neguine
    Walker, Elaine
    Wolff, Phillip
    NPJ SCHIZOPHRENIA, 2019, 5 (1):
  • [28] Consumer product prediction using machine learning
    Ajitha, P.
    Tamilvizhi, T.
    Sowjanya, K. Naga
    Surendran, R.
    Bala, Bhoomeshwar
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (03): : 565 - 574
  • [29] Predicting Risk of 30-Day Readmissions Using Two Emerging Machine Learning Methods
    Mahajan, Satish M.
    Mahajan, Amey S.
    King, Robert
    Negahban, Sahand
    NURSING INFORMATICS 2018: ICT TO IMPROVE QUALITY AND SAFETY AT THE POINT OF CARE, 2018, 250 : 250 - 255
  • [30] Predicting Chemical Reaction Barriers with a Machine Learning Model
    Singh, Aayush R.
    Rohr, Brian A.
    Gauthier, Joseph A.
    Norskov, Jens K.
    CATALYSIS LETTERS, 2019, 149 (09) : 2347 - 2354