Computational chromatography: A machine learning strategy for demixing individual chemical components in complex mixtures

被引:19
|
作者
Bajomo, Mary M. [1 ]
Ju, Yilong [1 ]
Zhou, Jingyi [2 ,4 ]
Elefterescu, Simina [3 ]
Farr, Corbin [1 ,5 ]
Zhao, Yiping [6 ]
Neumann, Oara [2 ]
Nordlander, Peter [2 ,7 ]
Patel, Ankit [8 ]
Halas, Naomi J. [1 ,2 ,9 ]
机构
[1] Rice Univ, Dept Chem, Houston, TX 77005 USA
[2] Rice Univ, Lab Nanophoton, Houston, TX 77005 USA
[3] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[4] Rice Univ, Dept Mat Sci & Nanoengn, Houston, TX 77005 USA
[5] Univ Houston, Dept Biochem, Houston, TX 77204 USA
[6] Univ Georgia, Dept Phys & Astron, Athens, GA 30602 USA
[7] Rice Univ, Dept Elect & Comp Engn, Houston, TX 77005 USA
[8] Rice Univ, Dept Phys & Astron, Houston, TX 77005 USA
[9] Baylor Coll Med, Dept Neurosci, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
surface-enhanced Raman scattering; polycyclic aromatic hydrocarbons; machine  learning; nanoparticles; nonnegative matrix factorization; POLYCYCLIC AROMATIC-HYDROCARBONS; ENHANCED RAMAN-SPECTROSCOPY; ALGORITHMS; SERS; SCATTERING;
D O I
10.1073/pnas.2211406119
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Surface-enhanced Raman spectroscopy (SERS) holds exceptional promise as a stream-lined chemical detection strategy for biological and environmental contaminants com-pared with current laboratory methods. Priority pollutants such as polycyclic aromatic hydrocarbons (PAHs), detectable in water and soil worldwide and known to induce multiple adverse health effects upon human exposure, are typically found in multi -component mixtures. By combining the molecular fingerprinting capabilities of SERS with the signal separation and detection capabilities of machine learning (ML), we examine whether individual PAHs can be identified through an analysis of the SERS spectra of multicomponent PAH mixtures. We have developed an unsupervised ML method we call Characteristic Peak Extraction, a dimensionality reduction algorithm that extracts characteristic SERS peaks based on counts of detected peaks of the mixture. By analyzing the SERS spectra of two-component and four-component PAH mixtures where the concentration ratios of the various components vary, this algorithm is able to extract the spectra of each unknown component in the mixture of unknowns, which is then subsequently identified against a SERS spectral library of PAHs. Combining the molecular fingerprinting capabilities of SERS with the signal separation and detection capabilities of ML, this effort is a step toward the computational demixing of unknown chemical components occurring in complex multicomponent mixtures.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Computational material discovery: Machine learning approach to discover new chemical structures
    Takeda, Seiji
    Masuda, Koji
    Hsu, Hsiang Han
    Heroux, Jean
    Hama, Toshiyuki
    Piunova, Victoria
    Pitera, Jed
    Nakano, Daiju
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [22] Computational Fluency and Strategy Choice Predict Individual and Cross-National Differences in Complex Arithmetic
    Vasilyeva, Marina
    Laski, Elida V.
    Shen, Chen
    DEVELOPMENTAL PSYCHOLOGY, 2015, 51 (10) : 1489 - 1500
  • [23] Human cell line toxicity of binary and ternary chemical mixtures in comparison to individual toxic effects of their components
    Malich, G
    Markovic, B
    Winder, C
    ARCHIVES OF ENVIRONMENTAL CONTAMINATION AND TOXICOLOGY, 1998, 35 (03) : 370 - 376
  • [24] Human Cell Line Toxicity of Binary and Ternary Chemical Mixtures in Comparison to Individual Toxic Effects of Their Components
    G. Malich
    B. Markovic
    C. Winder
    Archives of Environmental Contamination and Toxicology, 1998, 35 : 370 - 376
  • [25] Machine Learning Data Augmentation as a Tool to Enhance Quantitative Composition-Activity Relationships of Complex Mixtures. A New Application to Dissect the Role of Main Chemical Components in Bioactive Essential Oils
    Ragno, Alessio
    Baldisserotto, Anna
    Antonini, Lorenzo
    Sabatino, Manuela
    Sapienza, Filippo
    Baldini, Erika
    Buzzi, Raissa
    Vertuani, Silvia
    Manfredini, Stefano
    MOLECULES, 2021, 26 (20):
  • [26] A machine learning methodology for reliability evaluation of complex chemical production systems
    Zhao, Fanrui
    Wu, Jinkui
    Zhao, Yuanpei
    Ji, Xu
    Zhou, Li
    Sun, Zhongping
    RSC ADVANCES, 2020, 10 (34) : 20374 - 20384
  • [27] Physiologically based modeling of the maximal effect of metabolic interactions on the kinetics of components of complex chemical mixtures
    Haddad, S
    Charest-Tardif, G
    Krishnan, K
    JOURNAL OF TOXICOLOGY AND ENVIRONMENTAL HEALTH-PART A, 2000, 61 (03) : 209 - 223
  • [28] CHEMICAL IONIZATION MASS-SPECTROMETRY FOR SELECTIVE DETECTION AND ANALYSIS OF COMPONENTS IN COMPLEX-MIXTURES
    MUNSON, B
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1989, 197 : 52 - PETR
  • [29] Leveraging high-throughput molecular simulations and machine learning for the design of chemical mixtures
    Chew, Alex K.
    Afzal, Mohammad Atif Faiz
    Kaplan, Zachary
    Collins, Eric M.
    Gattani, Suraj
    Misra, Mayank
    Chandrasekaran, Anand
    Leswing, Karl
    Halls, Mathew D.
    NPJ COMPUTATIONAL MATERIALS, 2025, 11 (01)
  • [30] Quantitative analysis of pyrolysis characteristics and chemical components of tobacco materials based on machine learning
    Wu, Zhifeng
    Zhang, Qi
    Yu, Hongxiao
    Fu, Lili
    Yang, Zhen
    Lu, Yan
    Guo, Zhongya
    Li, Yasen
    Zhou, Xiansheng
    Liu, Yingjie
    Wang, Le
    FRONTIERS IN CHEMISTRY, 2024, 12