Machine learning-based quantitative structure–retention relationship models for predicting the retention indices of volatile organic pollutants

被引:0
|
作者
B. Sepehri
R. Ghavami
S. Farahbakhsh
R. Ahmadi
机构
[1] University of Kurdistan,Chemometrics Laboratory, Department of Chemistry, Faculty of Science
关键词
Quantitative structure–retention relationship; Volatile organic compounds; Epsilon-support vector regression; Deep learning; R software;
D O I
暂无
中图分类号
学科分类号
摘要
In this research, a dataset including 206 volatile organic compounds was used to develop quantitative structure–retention relationship models for predicting the retention indices of volatile organic compounds on DB-5 stationary phase. A total of 141 molecules were put in train set to build models and 65 molecules were put in test set to validate models, externally. By using stepwise-multiple linear regression, two descriptors including X1sol (solvation connectivity index chi-1) and AAC (mean information index on atomic composition) were selected to create linear and nonlinear quantitative structure–retention relationship models. Multiple linear regression, epsilon-support vector regression and deep learning-based artificial neural network were used as modeling techniques. All models were validated by calculating several statistical parameters for both train and test sets that show created models have high predictive power. R2 values for the test set of multiple linear regression, epsilon-support vector regression and deep learning-based artificial neural network models were 0.90, 0.94 and 0.94, respectively. Results show the Van der Waals interactions of molecules with methyl groups in DB-5 stationary phase and the electrostatic interactions of atoms with partial negative charge in molecules with the hydrogen atoms of phenyl groups in DB-5 stationary phase are responsible for the separation of volatile organic compounds in DB-5 stationary phase. Finally, these created models were used to predict the retention indices of 694 volatile organic compounds that had no retention index data on DB-5 stationary phase.
引用
收藏
页码:1457 / 1466
页数:9
相关论文
共 50 条
  • [41] Quantitative Structure-retention Relationship Study of Volatile Components from Rosa Banksiae Ait
    程利平
    包晓净
    王根礼
    结构化学, 2012, 31 (08) : 1201 - 1211
  • [42] Improvement of quantitative structure-retention relationship models for chromatographic retention prediction of peptides applying individual local partial least squares models
    Andries, Jan P. M.
    Goodarzi, Mohammad
    Vander Heyden, Yvan
    TALANTA, 2020, 219
  • [43] Development of machine learning-based quantitative structure-activity relationship models for predicting plasma half-lives of drugs in six common food animal species
    Wu, Pei-Yu
    Chou, Wei-Chun
    Wu, Xue
    Kamineni, Venkata N.
    Kuchimanchi, Yashas
    Tell, Lisa A.
    Maunsell, Fiona P.
    Lin, Zhoumeng
    TOXICOLOGICAL SCIENCES, 2024, 203 (01) : 52 - 66
  • [44] Prediction of retention data of phenolic compounds by quantitative structure retention relationship models under reverse-phase liquid chromatography
    Vinci, Roberto Lagana
    Arena, Katia
    Rigano, Francesca
    Cacciola, Francesco
    Dugo, Paola
    Mondello, Luigi
    JOURNAL OF CHROMATOGRAPHY A, 2024, 1730
  • [45] Quantitative structure-retention relationship for the Kovats retention indices of a large set of terpenes: A combined data splitting-feature selection strategy
    Hemmateenejad, Bahram
    Javadnia, Katayoun
    Elyasi, Maryam
    ANALYTICA CHIMICA ACTA, 2007, 592 (01) : 72 - 81
  • [46] Cross-column density functional theory-based quantitative structure-retention relationship model development powered by machine learning
    Mazraedoost, Sargol
    Zuvela, Petar
    Ulenberg, Szymon
    Baczek, Tomasz
    Liu, J. Jay
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2024, 416 (12) : 2951 - 2968
  • [47] Exploration and Evaluation of Machine Learning-Based Models for Predicting Enzymatic Reactions
    Watanabe, Naoki
    Murata, Masahiro
    Ogawa, Teppei
    Vavricka, Christopher J.
    Kondo, Akihiko
    Ogino, Chiaki
    Araki, Michihiro
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (03) : 1833 - 1843
  • [48] Machine learning-based models for genomic predicting neoadjuvant Machine learning-based models for genomic predicting neoadjuvant chemotherapeutic sensitivity in cervical cancer chemotherapeutic sensitivity in cervical cancer
    Guo, Lu
    Wang, Wei
    Xie, Xiaodong
    Wang, Shuihua
    Zhang, Yudong
    BIOMEDICINE & PHARMACOTHERAPY, 2023, 159
  • [49] Quantitative structure-retention relationship studies for predicting the gas chromatography retention indices of polycyclic aromatic hydrocarbons - Quasi-length of carbon chain and pseudo-conjugated system surface
    Kang, JJ
    Cao, CZ
    Li, ZL
    JOURNAL OF CHROMATOGRAPHY A, 1998, 799 (1-2) : 361 - 367
  • [50] Quantitative structure-retention relationship study on the binding of organic solvents to the corn protein, zein
    Zagyi, M.
    Cserhati, T.
    JOURNAL OF LIQUID CHROMATOGRAPHY & RELATED TECHNOLOGIES, 2007, 30 (03) : 351 - 362