Visualization strategies to aid interpretation of high-dimensional genotoxicity data

被引:1
|
作者
Dertinger, Stephen D. [1 ]
Briggs, Erica [1 ]
Hussien, Yusuf [2 ]
Bryce, Steven M. [1 ]
Avlasevich, Svetlana L. [1 ]
Conrad, Adam [1 ]
Johnson, George E. [2 ]
Williams, Andrew [3 ]
Bemis, Jeffrey C. [1 ]
机构
[1] Litron Labs, 3500 Winton Pl, Rochester, NY 14623 USA
[2] Swansea Univ, Inst Life Sci, Swansea, Wales
[3] Environm Hlth Sci & Res Bur, Hlth Canada, Ottawa, ON, Canada
关键词
dimensionality reduction; hierarchical clustering; multidimensional scaling; parallel coordinate plots; principal component analysis; spider plots; ToxPi; t-distributed stochastic neighbor embedding; uniform manifold approximation; VITRO MICRONUCLEUS TEST; AURORA KINASE INHIBITOR; DOUBLE-STRAND BREAKS; INDUCED ER STRESS; A GENE MUTATION; IN-VITRO; DNA-DAMAGE; CELL-DEATH; CHROMOSOME SEGREGATION; NONGENOTOXIC CHEMICALS;
D O I
10.1002/em.22604
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This article describes a range of high-dimensional data visualization strategies that we have explored for their ability to complement machine learning algorithm predictions derived from MultiFlow (R) assay results. For this exercise, we focused on seven biomarker responses resulting from the exposure of TK6 cells to each of 126 diverse chemicals over a range of concentrations. Obviously, challenges associated with visualizing seven biomarker responses were further complicated whenever there was a desire to represent the entire 126 chemical data set as opposed to results from a single chemical. Scatter plots, spider plots, parallel coordinate plots, hierarchical clustering, principal component analysis, toxicological prioritization index, multidimensional scaling, t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection are each considered in turn. Our report provides a comparative analysis of these techniques. In an era where multiplexed assays and machine learning algorithms are becoming the norm, stakeholders should find some of these visualization strategies useful for efficiently and effectively interpreting their high-dimensional data.
引用
收藏
页码:156 / 178
页数:23
相关论文
共 50 条
  • [41] Network-based Clustering and Embedding for High-Dimensional Data Visualization
    Zhang, Hengyuan
    Chen, Xiaowu
    2013 INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS (CAD/GRAPHICS), 2013, : 290 - 297
  • [42] Focused multidimensional scaling: interactive visualization for exploration of high-dimensional data
    Urpa, Lea M.
    Anders, Simon
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [43] Focused multidimensional scaling: interactive visualization for exploration of high-dimensional data
    Lea M. Urpa
    Simon Anders
    BMC Bioinformatics, 20
  • [44] Multidimensional scaling with discrimination coefficients for supervised visualization of high-dimensional data
    Daniel Berrar
    Georg Ohmayer
    Neural Computing and Applications, 2011, 20 : 1211 - 1218
  • [45] Very Fast Interactive Visualization of Large Sets of High-dimensional Data
    Dzwinel, Witold
    Wcislo, Rafal
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 572 - 581
  • [46] Dynamic visualization of statistical learning in the context of high-dimensional textual data
    Greenacre, Michael
    Hastie, Trevor
    JOURNAL OF WEB SEMANTICS, 2010, 8 (2-3): : 163 - 168
  • [47] Visualization of high-dimensional data using an association of multidimensional scaling to clustering
    Naud, A
    2004 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2004, : 252 - 255
  • [48] PolarViz: a discriminating visualization and visual analytics tool for high-dimensional data
    Wang, Yan Chao
    Zhang, Qian
    Lin, Feng
    Goh, Chi Keong
    Seah, Hock Soon
    VISUAL COMPUTER, 2019, 35 (11): : 1567 - 1582
  • [49] High-dimensional data analysis with subspace comparison using matrix visualization
    Wang, Junpeng
    Liu, Xiaotong
    Shen, Han-Wei
    INFORMATION VISUALIZATION, 2019, 18 (01) : 94 - 109
  • [50] DD-HDS: A method for visualization and exploration of high-dimensional data
    Lespinats, Sylvain
    Verleysen, Michel
    Giron, Alain
    Fertil, Bernard
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (05): : 1265 - 1279