Visualization strategies to aid interpretation of high-dimensional genotoxicity data

被引:1
|
作者
Dertinger, Stephen D. [1 ]
Briggs, Erica [1 ]
Hussien, Yusuf [2 ]
Bryce, Steven M. [1 ]
Avlasevich, Svetlana L. [1 ]
Conrad, Adam [1 ]
Johnson, George E. [2 ]
Williams, Andrew [3 ]
Bemis, Jeffrey C. [1 ]
机构
[1] Litron Labs, 3500 Winton Pl, Rochester, NY 14623 USA
[2] Swansea Univ, Inst Life Sci, Swansea, Wales
[3] Environm Hlth Sci & Res Bur, Hlth Canada, Ottawa, ON, Canada
关键词
dimensionality reduction; hierarchical clustering; multidimensional scaling; parallel coordinate plots; principal component analysis; spider plots; ToxPi; t-distributed stochastic neighbor embedding; uniform manifold approximation; VITRO MICRONUCLEUS TEST; AURORA KINASE INHIBITOR; DOUBLE-STRAND BREAKS; INDUCED ER STRESS; A GENE MUTATION; IN-VITRO; DNA-DAMAGE; CELL-DEATH; CHROMOSOME SEGREGATION; NONGENOTOXIC CHEMICALS;
D O I
10.1002/em.22604
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This article describes a range of high-dimensional data visualization strategies that we have explored for their ability to complement machine learning algorithm predictions derived from MultiFlow (R) assay results. For this exercise, we focused on seven biomarker responses resulting from the exposure of TK6 cells to each of 126 diverse chemicals over a range of concentrations. Obviously, challenges associated with visualizing seven biomarker responses were further complicated whenever there was a desire to represent the entire 126 chemical data set as opposed to results from a single chemical. Scatter plots, spider plots, parallel coordinate plots, hierarchical clustering, principal component analysis, toxicological prioritization index, multidimensional scaling, t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection are each considered in turn. Our report provides a comparative analysis of these techniques. In an era where multiplexed assays and machine learning algorithms are becoming the norm, stakeholders should find some of these visualization strategies useful for efficiently and effectively interpreting their high-dimensional data.
引用
收藏
页码:156 / 178
页数:23
相关论文
共 50 条
  • [1] High-dimensional data visualization
    Tang, Lin
    NATURE METHODS, 2020, 17 (02) : 129 - 129
  • [2] High-dimensional data visualization
    Lin Tang
    Nature Methods, 2020, 17 : 129 - 129
  • [3] Visualization strategies and techniques for high-dimensional spatio-temporal data
    Schmidt, B
    Streit, U
    Uhlenkuken, C
    GEOGRAPHICAL INFORMATION '97: FROM RESEARCH TO APPLICATION THROUGH COOPERATION, VOLS 1 AND 2, 1997, : 248 - 253
  • [4] Dynamic visualization of high-dimensional data
    Eric D. Sun
    Rong Ma
    James Zou
    Nature Computational Science, 2023, 3 : 86 - 100
  • [5] Visualization for high-dimensional data: VisHD
    Yang, CC
    Chiang, CC
    Hung, YP
    Lee, GC
    Ninth International Conference on Information Visualisation, Proceedings, 2005, : 692 - 696
  • [6] Visualization and data mining of high-dimensional data
    Inselberg, A
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2002, 60 (1-2) : 147 - 159
  • [7] Semisupervised visualization of high-dimensional data
    Kouropteva O.
    Okun O.
    Pietikäinen M.
    Pattern Recognition and Image Analysis, 2007, 17 (04) : 612 - 620
  • [8] Dynamic visualization of high-dimensional data
    Sun, Eric D.
    Ma, Rong
    Zou, James
    NATURE COMPUTATIONAL SCIENCE, 2023, 3 (01): : 86 - +
  • [9] Missing data in interactive high-dimensional data visualization
    Swayne, DF
    Buja, A
    COMPUTATIONAL STATISTICS, 1998, 13 (01) : 15 - 26
  • [10] Principles of high-dimensional data visualization in astronomy
    Goodman, A. A.
    ASTRONOMISCHE NACHRICHTEN, 2012, 333 (5-6) : 505 - 514