Multidimensional scaling and visualization of large molecular similarity tables

被引:0
|
作者
Agrafiotis, DK [1 ]
Rassokhin, DN [1 ]
Lobanov, VS [1 ]
机构
[1] 3 Dimens Pharmaceut Inc, Exton, PA 19341 USA
关键词
data mining; data analysis; pattern recognition; dimensionality reduction; feature extraction; nonlinear mapping; Sammon mapping; multidimensional scaling; neural network; combinatorial chemistry; molecular similarity; molecular diversity; conformational analysis; distance geometry;
D O I
10.1002/1096-987X(20010415)22:5<488::AID-JCC1020>3.0.CO;2-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Multidimensional scaling (MDS) is a collection of statistical techniques that attempt to embed a set of patterns described by means of a dissimilarity matrix into a low-dimensional display plane in a way that preserves their original pairwise interrelationships as closely as possible. Unfortunately, current MDS algorithms are notoriously slow, and their use is limited to small data sets. In this article, we present a family of algorithms that combine nonlinear mapping techniques with neural networks, and make possible the scaling of very large data sets that are intractable with conventional methodologies. The method employs a nonlinear mapping algorithm to project a small random sample, and then "learns" the underlying transform using one or more multilayer perceptrons. The distinct advantage of this approach is that it captures the nonlinear mapping relationship in an explicit function, and allows the scaling of additional patterns as they become available, without the need to reconstruct the entire map. A novel encoding scheme is described, allowing this methodology to be used with a wide variety of input data representations and similarity functions. The potential of the algorithm is illustrated in the analysis of two combinatorial libraries and an ensemble of molecular conformations. The method is particularly useful for extracting low-dimensional Cartesian coordinate vectors from large binary spaces, such as those encountered in the analysis of large chemical data sets. (C) 2001 John Wiley & Sons, Inc.
引用
收藏
页码:488 / 500
页数:13
相关论文
共 50 条
  • [1] Visualization of rule's similarity using multidimensional scaling
    Tsumoto, S
    Hirano, S
    [J]. THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 339 - 346
  • [2] Multidimensional Scaling Visualization Using Parametric Similarity Indices
    Tenreiro Machado, J. A.
    Lopes, Antonio M.
    Galhano, Alexandra M.
    [J]. ENTROPY, 2015, 17 (04): : 1775 - 1794
  • [3] MULTIDIMENSIONAL-SCALING OF SIMILARITY
    TORGERSON, WS
    [J]. PSYCHOMETRIKA, 1965, 30 (04) : 379 - 393
  • [4] Visualization Methodology for Multidimensional Scaling
    Andreas Buja
    Deborah F. Swayne
    [J]. Journal of Classification, 2002, 19 : 7 - 43
  • [5] Visualization methodology for multidimensional scaling
    Buja, A
    Swayne, DF
    [J]. JOURNAL OF CLASSIFICATION, 2002, 19 (01) : 7 - 43
  • [6] Data visualization with multidimensional scaling
    Buja, Andreas
    Swayne, Deborah F.
    Littman, Michael L.
    Dean, Nathaniel
    Hofmann, Heike
    Chen, Lisha
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2008, 17 (02) : 444 - 472
  • [7] Multidimensional scaling and visualization of patterns in global large-scale accidents
    Lopes, Antonio M.
    Machado, J. A. Tenreiro
    [J]. CHAOS SOLITONS & FRACTALS, 2022, 157
  • [8] Multidimensional scaling and visualization of patterns in global large-scale accidents
    Lopes, Antonio M.
    Tenreiro Machado, J. A.
    [J]. CHAOS SOLITONS & FRACTALS, 2022, 157
  • [9] Interactive visualization system with multidimensional scaling
    Song, M
    Lin, X
    [J]. ASIST 2002: PROCEEDINGS OF THE 65TH ASIST ANNUAL MEETING, VOL 39, 2002, 2002, 39 : 562 - 563
  • [10] Multidimensional scaling visualization of earthquake phenomena
    António M. Lopes
    J. A. Tenreiro Machado
    C. M. A. Pinto
    A. M. S. F. Galhano
    [J]. Journal of Seismology, 2014, 18 : 163 - 179