Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure-Activity Modeling and Dataset Comparison

被引:104
|
作者
Kireeva, N. [1 ,2 ]
Baskin, I. I. [1 ,3 ]
Gaspar, H. A. [1 ]
Horvath, D. [1 ]
Marcou, G. [1 ]
Varnek, A. [1 ]
机构
[1] Univ Strasbourg, Lab Infochim, CNRS, UMR 7177, F-67000 Strasbourg, France
[2] Inst Phys Chem & Electrochem RAS, Moscow 119991, Russia
[3] Moscow MV Lomonosov State Univ, Dept Chem, Moscow 119991, Russia
关键词
Generative topographic maps; Dimensionality reduction; Manifold learning; Data visualization; Predicting activity profiles; Comparison of databases; Bhattacharyya kernel; LIKELIHOOD;
D O I
10.1002/minf.201100163
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Here, the utility of Generative Topographic Maps (GTM) for data visualization, structure-activity modeling and database comparison is evaluated, on hand of subsets of the Database of Useful Decoys (DUD). Unlike other popular dimensionality reduction approaches like Principal Component Analysis, Sammon Mapping or Self-Organizing Maps, the great advantage of GTMs is providing data probability distribution functions (PDF), both in the high-dimensional space defined by molecular descriptors and in 2D latent space. PDFs for the molecules of different activity classes were successfully used to build classification models in the framework of the Bayesian approach. Because PDFs are represented by a mixture of Gaussian functions, the Bhattacharyya kernel has been proposed as a measure of the overlap of datasets, which leads to an elegant method of global comparison of chemical libraries.
引用
收藏
页码:301 / 312
页数:12
相关论文
共 35 条
  • [31] Comparison between 5,10,15,20-tetraaryl- and 5,15-diarylporphyrins as photosensitizers: Synthesis, photodynamic activity, and quantitative structure-activity relationship modeling
    Banfi, Stefano
    Caruso, Enrico
    Buccafurni, Loredana
    Murano, Roberto
    Monti, Elena
    Gariboldi, Marzia
    Papa, Ester
    Gramatica, Paola
    JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (11) : 3293 - 3304
  • [32] Application of quantitative structure-activity relationships (QSARs) for modeling drug and chemical transport across the human placenta barrier: A multivariate data analysis approach
    Giaginis, Costas
    Zira, Athina
    Tsantili-Kakoulidou, Anna
    Theocharis, Stamatios
    TOXICOLOGY LETTERS, 2009, 189 : S258 - S258
  • [33] Modeling Chemical Interaction Profiles: II. Molecular Docking, Spectral Data-Activity Relationship, and Structure-Activity Relationship Models for Potent and Weak Inhibitors of Cytochrome P450 CYP3A4 Isozyme
    Tie, Yunfeng
    McPhail, Brooks
    Hong, Huixiao
    Pearce, Bruce A.
    Schnackenberg, Laura K.
    Ge, Weigong
    Buzatu, Dan A.
    Wilkes, Jon G.
    Fuscoe, James C.
    Tong, Weida
    Fowler, Bruce A.
    Beger, Richard D.
    Demchuk, Eugene
    MOLECULES, 2012, 17 (03) : 3407 - 3460
  • [34] Modeling Chemical Interaction Profiles: I. Spectral Data-Activity Relationship and Structure-Activity Relationship Models for Inhibitors and Non-inhibitors of Cytochrome P450 CYP3A4 and CYP2D6 Isozymes
    McPhail, Brooks
    Tie, Yunfeng
    Hong, Huixiao
    Pearce, Bruce A.
    Schnackenberg, Laura K.
    Ge, Weigong
    Valerio, Luis G., Jr.
    Fuscoe, James C.
    Tong, Weida
    Buzatu, Dan A.
    Wilkes, Jon G.
    Fowler, Bruce A.
    Demchuk, Eugene
    Beger, Richard D.
    MOLECULES, 2012, 17 (03): : 3383 - 3406
  • [35] Deep Learning-Based Structure-Activity Relationship Modeling for Multi-Category Toxicity Classification: A Case Study of 10K Tox21 Chemicals With High-Throughput Cell-Based Androgen Receptor Bioassay Data
    Idakwo, Gabriel
    Thangapandian, Sundar
    Luttrell, Joseph
    Zhou, Zhaoxian
    Zhang, Chaoyang
    Gong, Ping
    FRONTIERS IN PHYSIOLOGY, 2019, 10