Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure-Activity Modeling and Dataset Comparison

被引:104
|
作者
Kireeva, N. [1 ,2 ]
Baskin, I. I. [1 ,3 ]
Gaspar, H. A. [1 ]
Horvath, D. [1 ]
Marcou, G. [1 ]
Varnek, A. [1 ]
机构
[1] Univ Strasbourg, Lab Infochim, CNRS, UMR 7177, F-67000 Strasbourg, France
[2] Inst Phys Chem & Electrochem RAS, Moscow 119991, Russia
[3] Moscow MV Lomonosov State Univ, Dept Chem, Moscow 119991, Russia
关键词
Generative topographic maps; Dimensionality reduction; Manifold learning; Data visualization; Predicting activity profiles; Comparison of databases; Bhattacharyya kernel; LIKELIHOOD;
D O I
10.1002/minf.201100163
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Here, the utility of Generative Topographic Maps (GTM) for data visualization, structure-activity modeling and database comparison is evaluated, on hand of subsets of the Database of Useful Decoys (DUD). Unlike other popular dimensionality reduction approaches like Principal Component Analysis, Sammon Mapping or Self-Organizing Maps, the great advantage of GTMs is providing data probability distribution functions (PDF), both in the high-dimensional space defined by molecular descriptors and in 2D latent space. PDFs for the molecules of different activity classes were successfully used to build classification models in the framework of the Bayesian approach. Because PDFs are represented by a mixture of Gaussian functions, the Bhattacharyya kernel has been proposed as a measure of the overlap of datasets, which leads to an elegant method of global comparison of chemical libraries.
引用
收藏
页码:301 / 312
页数:12
相关论文
共 35 条
  • [1] Generative topographic mapping: Universal tool for chemical space analysis
    Varnek, Alexandre
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 250
  • [2] Sparse Generative Topographic Mapping for Both Data Visualization and Clustering
    Kaneko, Hiromasa
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2018, 58 (12) : 2528 - 2535
  • [3] Visualization of tree-structured data through generative topographic mapping
    Gianniotis, Nikolaos
    Tino, Peter
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (08): : 1468 - 1493
  • [4] Chemical Data Visualization and Analysis with Incremental Generative Topographic Mapping: Big Data Challenge
    Gaspar, Helena A.
    Baskin, Igor I.
    Marcou, Gilles
    Horvath, Dragos
    Varnek, Alexandre
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (01) : 84 - 94
  • [5] Generative Topographic Mapping Approach to Modeling and Chemical Space Visualization of Human Intestinal Transporters
    Gimadiev T.R.
    Madzhidov T.I.
    Marcou G.
    Varnek A.
    BioNanoScience, 2016, 6 (4) : 464 - 472
  • [6] Data Visualization, Regression, Applicability Domains and Inverse Analysis Based on Generative Topographic Mapping
    Kaneko, Hiromasa
    MOLECULAR INFORMATICS, 2019, 38 (03)
  • [7] Data Visualization & Clustering: Generative Topographic Mapping Similarity Assessment Allied to Graph Theory Clustering
    Escobar, Matheus de Souza
    Kaneko, Hiromasa
    Funatsu, Kimito
    FRONTIERS IN MOLECULAR DESIGN AND CHEMIAL INFORMATION SCIENCE - HERMAN SKOLNIK AWARD SYMPOSIUM 2015: JURGEN BAJORATH, 2016, 1222 : 175 - 210
  • [8] NONLINEAR MAPPING FOR STRUCTURE-ACTIVITY AND STRUCTURE-PROPERTY MODELING
    DOMINE, D
    DEVILLERS, J
    CHASTRETTE, M
    KARCHER, W
    JOURNAL OF CHEMOMETRICS, 1993, 7 (04) : 227 - 242
  • [9] A comparison of methods for modeling quantitative structure-activity relationships
    Sutherland, JJ
    O'Brien, LA
    Weaver, DF
    JOURNAL OF MEDICINAL CHEMISTRY, 2004, 47 (22) : 5541 - 5554
  • [10] Comparison of Data Representation Languages in the Structure-Activity Problem
    Gusakova, S. M.
    Dobrinin, D. A.
    Kharchevnikova, N., V
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2019, 53 (05) : 225 - 233