Improved Machine Learning Models by Data Processing for Predicting Life-Cycle Environmental Impacts of Chemicals

被引:14
|
作者
You, Shijie [1 ]
Sun, Ye [1 ]
Wang, Xiuheng [1 ]
Ren, Nanqi [1 ]
Liu, Yanbiao [2 ]
机构
[1] Harbin Inst Technol, Sch Environm, State Key Lab Urban Water Resource & Environm, Harbin 150090, Peoples R China
[2] Donghua Univ, Coll Environm Sci & Engn, Text Pollut Controlling Engn Ctr Minist Ecol & Env, Shanghai 201620, Peoples R China
基金
中国国家自然科学基金;
关键词
life cycle assessment (LCA); machine learning; data processing; feature selection; weighted Euclidean distance; FEATURE-SELECTION; NEURAL-NETWORK;
D O I
10.1021/acs.est.2c04945
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Machine learning (ML) provides an efficient manner for rapid prediction of the life-cycle environmental impacts of chemicals, but challenges remain due to low prediction accuracy and poor interpretability of the models. To address these issues, we focused on data processing by using a mutual information-permutation importance (MI-PI) feature selection method to filter out irrelevant molecular descriptors from the input data, which improved the model interpretability by preserving the physicochemical meanings of original molecular descriptors without generation of new variables. We also applied a weighted Euclidean distance method to mine the data most relevant to the predicted targets by quantifying the contribution of each feature, thereby the prediction accuracy was improved. On the basis of above data processing, we developed artificial neural network (ANN) models for predicting the life-cycle environmental impacts of chemicals with R2 values of 0.81, 0.81, 0.84, 0.75, 0.73, and 0.86 for global warming, human health, metal depletion, freshwater ecotoxicity, particulate matter formation, and terrestrial acidification, respectively. The ML models were interpreted using the Shapley additive explanation method by quantifying the contribution of each input molecular descriptor to environmental impact categories. This work suggests that the combination of feature selection by MI-PI and source data selection based on weighted Euclidean distance has a promising potential to improve the accuracy and interpretability of the models for predicting the life-cycle environmental impacts of chemicals.
引用
收藏
页码:3434 / 3444
页数:11
相关论文
共 50 条
  • [31] Principles for the development and use of benchmarks for life-cycle related environmental impacts of buildings
    Luetzkendorf, T.
    Balouktsi, M.
    LIFE-CYCLE ANALYSIS AND ASSESSMENT IN CIVIL ENGINEERING: TOWARDS AN INTEGRATED VISION, 2019, : 783 - 790
  • [32] The Life-cycle Assessment and Environmental Impacts of Electricity Production in Porto Santo Island
    Torabi, Roham
    Rizzoli, Nicolo
    Arosio, Valeria
    Morgado-Dias, F.
    2017 INTERNATIONAL CONFERENCE IN ENERGY AND SUSTAINABILITY IN SMALL DEVELOPING ECONOMIES (ES2DE), 2017,
  • [33] Assessing relationships among life-cycle environmental impacts with dimension reduction techniques
    Gutierrez, Ester
    Lozano, Sebastian
    Moreira, M. Teresa
    Feijoo, Gumersindo
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2010, 91 (04) : 1002 - 1011
  • [34] Life-cycle assessment of airport pavement design alternatives for energy and environmental impacts
    Wang, Hao
    Thakkar, Chinmay
    Chen, Xiaodan
    Murrel, Scott
    JOURNAL OF CLEANER PRODUCTION, 2016, 133 : 163 - 171
  • [35] Analytical Review of Life-Cycle Environmental Impacts of Carbon Capture and Utilization Technologies
    Garcia-Garcia, Guillermo
    Fernandez, Marta Cruz
    Armstrong, Katy
    Woolass, Steven
    Styring, Peter
    CHEMSUSCHEM, 2021, 14 (04) : 995 - 1015
  • [36] Evaluating environmental impacts of pig slurry treatment technologies with a life-cycle perspective
    Yuan, Zengwei
    Pan, Xiao
    Chen, Tianming
    Liu, Xuewei
    Zhang, You
    Jiang, Songyan
    Sheng, Hu
    Zhang, Ling
    JOURNAL OF CLEANER PRODUCTION, 2018, 188 : 840 - 850
  • [37] Assessing the life-cycle environmental impacts of the wood pallet sector in the United States
    Alanya-Rosenbaum, S.
    Bergman, R. D.
    Gething, B.
    JOURNAL OF CLEANER PRODUCTION, 2021, 320
  • [38] Multi-objective Optimization of Product Life-Cycle Costs and Environmental Impacts
    Cerri, Daniele
    Taisch, Marco
    Terzi, Sergio
    ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS: COMPETITIVE MANUFACTURING FOR INNOVATIVE PRODUCTS AND SERVICES, AMPS 2012, PT I, 2013, 397 : 391 - 396
  • [39] Considerations in assessing environmental impacts of essential metals in life-cycle impact analysis
    van Tilborg, W
    Van Assche, F
    Cook, M
    LIFE-CYCLE ASSESSMENT OF METALS: ISSUES AND RESEARCH DIRECTIONS, 2003, : 220 - 223
  • [40] An environmental life-cycle design tool for assessing impacts of CRT and LCD monitors
    Socolof, ML
    Swanson, MB
    Kincaid, LE
    Overly, JG
    Hart, KM
    Singh, D
    PROCEEDINGS OF THE 1999 IEEE INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND THE ENVIRONMENT, ISEE - 1999, 1999, : 232 - 237