Computational chromatography: A machine learning strategy for demixing individual chemical components in complex mixtures

被引:19
|
作者
Bajomo, Mary M. [1 ]
Ju, Yilong [1 ]
Zhou, Jingyi [2 ,4 ]
Elefterescu, Simina [3 ]
Farr, Corbin [1 ,5 ]
Zhao, Yiping [6 ]
Neumann, Oara [2 ]
Nordlander, Peter [2 ,7 ]
Patel, Ankit [8 ]
Halas, Naomi J. [1 ,2 ,9 ]
机构
[1] Rice Univ, Dept Chem, Houston, TX 77005 USA
[2] Rice Univ, Lab Nanophoton, Houston, TX 77005 USA
[3] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[4] Rice Univ, Dept Mat Sci & Nanoengn, Houston, TX 77005 USA
[5] Univ Houston, Dept Biochem, Houston, TX 77204 USA
[6] Univ Georgia, Dept Phys & Astron, Athens, GA 30602 USA
[7] Rice Univ, Dept Elect & Comp Engn, Houston, TX 77005 USA
[8] Rice Univ, Dept Phys & Astron, Houston, TX 77005 USA
[9] Baylor Coll Med, Dept Neurosci, Houston, TX 77030 USA
基金
美国国家卫生研究院;
关键词
surface-enhanced Raman scattering; polycyclic aromatic hydrocarbons; machine  learning; nanoparticles; nonnegative matrix factorization; POLYCYCLIC AROMATIC-HYDROCARBONS; ENHANCED RAMAN-SPECTROSCOPY; ALGORITHMS; SERS; SCATTERING;
D O I
10.1073/pnas.2211406119
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Surface-enhanced Raman spectroscopy (SERS) holds exceptional promise as a stream-lined chemical detection strategy for biological and environmental contaminants com-pared with current laboratory methods. Priority pollutants such as polycyclic aromatic hydrocarbons (PAHs), detectable in water and soil worldwide and known to induce multiple adverse health effects upon human exposure, are typically found in multi -component mixtures. By combining the molecular fingerprinting capabilities of SERS with the signal separation and detection capabilities of machine learning (ML), we examine whether individual PAHs can be identified through an analysis of the SERS spectra of multicomponent PAH mixtures. We have developed an unsupervised ML method we call Characteristic Peak Extraction, a dimensionality reduction algorithm that extracts characteristic SERS peaks based on counts of detected peaks of the mixture. By analyzing the SERS spectra of two-component and four-component PAH mixtures where the concentration ratios of the various components vary, this algorithm is able to extract the spectra of each unknown component in the mixture of unknowns, which is then subsequently identified against a SERS spectral library of PAHs. Combining the molecular fingerprinting capabilities of SERS with the signal separation and detection capabilities of ML, this effort is a step toward the computational demixing of unknown chemical components occurring in complex multicomponent mixtures.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Data-Quality-Navigated Machine Learning Strategy with Chemical Intuition to Improve Generalization
    Yang, Songran
    Sun, Ming
    Shi, Chaojie
    Liu, Yiran
    Guo, Yanzhi
    Liu, Yijing
    Lu, Zhiyun
    Huang, Yan
    Pu, Xuemei
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2024, 20 (23) : 10633 - 10648
  • [42] Multi-objective optimisation with hybrid machine learning strategy for complex catalytic processes
    Tai, Xin Yee
    Ocone, Raffaella
    Christie, Steven D. R.
    Xuan, Jin
    ENERGY AND AI, 2022, 7
  • [43] QMaC: A Quantum Mechanics/Machine Learning-based Computational Tool for Chemical Product Design
    Liu, Qilei
    Tang, Kun
    Zhang, Jinyuan
    Feng, Yixuan
    Xu, Chenyang
    Liu, Linlin
    Du, Jian
    Zhang, Lei
    30TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, PTS A-C, 2020, 48 : 1807 - 1812
  • [44] Regression prediction of tobacco chemical components during curing based on color quantification and machine learning
    Yang Meng
    Qiang Xu
    Guangqing Chen
    Jianjun Liu
    Shuoye Zhou
    Yanling Zhang
    Aiguo Wang
    Jianwei Wang
    Ding Yan
    Xianjie Cai
    Junying Li
    Xuchu Chen
    Qiuying Li
    Qiang Zeng
    Weimin Guo
    Yuanhui Wang
    Scientific Reports, 14 (1)
  • [45] TOXICOLOGIC EVALUATION OF INDIVIDUAL CHEMICAL-COMPOUNDS AND THEIR COMPLEX-MIXTURES USING A MOTILE CELL TEST OBJECT
    KAYUMOV, RI
    ESKOV, AP
    AREFEV, IM
    LAPPO, VG
    ROTENBERG, YS
    BULLETIN OF EXPERIMENTAL BIOLOGY AND MEDICINE, 1988, 105 (01) : 59 - 62
  • [46] Liquid chromatography tandem mass spectrometry of free base alkyl porphyrins for the characterization of the macrocyclic substituents in components of complex mixtures
    Rosell-Melé, A
    Carter, JF
    Maxwell, JR
    RAPID COMMUNICATIONS IN MASS SPECTROMETRY, 1999, 13 (07) : 568 - 573
  • [47] ANALYSIS OF COMPLEX CHEMICAL-MIXTURES BY GAS-CHROMATOGRAPHY FOURIER-TRANSFORM INFRARED-SPECTROSCOPY
    DOUMENQ, P
    GUILIANO, M
    MILLE, G
    ANALUSIS, 1989, 17 (1-2) : 39 - 49
  • [48] Machine Learning Analysis of Raman Spectra To Quantify the Organic Constituents in Complex Organic-Mineral Mixtures
    Zarei, Mahsa
    Solomatova, Natalia V.
    Aghaei, Hoda
    Rothwell, Austin
    Wiens, Jeffrey
    Melo, Luke
    Good, Travis G.
    Shokatian, Sadegh
    Grant, Edward
    ANALYTICAL CHEMISTRY, 2023, 95 (43) : 15908 - 15916
  • [49] A Bagging Strategy-Based Kernel Extreme Learning Machine for Complex Network Intrusion Detection
    Yin, Shoulin
    Li, Hang
    Laghari, Asif Ali
    Karim, Shahid
    Jumani, Awais Khan
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2021, 8 (33)
  • [50] Supervised Machine Learning for Understanding and Improving the Computational Performance of Chemical Production Scheduling MIP Models br
    Kim, Boeun
    Maravelias, Christos T.
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2022, 61 (46) : 17124 - 17136