Visualization strategies to aid interpretation of high-dimensional genotoxicity data

被引:1
|
作者
Dertinger, Stephen D. [1 ]
Briggs, Erica [1 ]
Hussien, Yusuf [2 ]
Bryce, Steven M. [1 ]
Avlasevich, Svetlana L. [1 ]
Conrad, Adam [1 ]
Johnson, George E. [2 ]
Williams, Andrew [3 ]
Bemis, Jeffrey C. [1 ]
机构
[1] Litron Labs, 3500 Winton Pl, Rochester, NY 14623 USA
[2] Swansea Univ, Inst Life Sci, Swansea, Wales
[3] Environm Hlth Sci & Res Bur, Hlth Canada, Ottawa, ON, Canada
关键词
dimensionality reduction; hierarchical clustering; multidimensional scaling; parallel coordinate plots; principal component analysis; spider plots; ToxPi; t-distributed stochastic neighbor embedding; uniform manifold approximation; VITRO MICRONUCLEUS TEST; AURORA KINASE INHIBITOR; DOUBLE-STRAND BREAKS; INDUCED ER STRESS; A GENE MUTATION; IN-VITRO; DNA-DAMAGE; CELL-DEATH; CHROMOSOME SEGREGATION; NONGENOTOXIC CHEMICALS;
D O I
10.1002/em.22604
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This article describes a range of high-dimensional data visualization strategies that we have explored for their ability to complement machine learning algorithm predictions derived from MultiFlow (R) assay results. For this exercise, we focused on seven biomarker responses resulting from the exposure of TK6 cells to each of 126 diverse chemicals over a range of concentrations. Obviously, challenges associated with visualizing seven biomarker responses were further complicated whenever there was a desire to represent the entire 126 chemical data set as opposed to results from a single chemical. Scatter plots, spider plots, parallel coordinate plots, hierarchical clustering, principal component analysis, toxicological prioritization index, multidimensional scaling, t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection are each considered in turn. Our report provides a comparative analysis of these techniques. In an era where multiplexed assays and machine learning algorithms are becoming the norm, stakeholders should find some of these visualization strategies useful for efficiently and effectively interpreting their high-dimensional data.
引用
收藏
页码:156 / 178
页数:23
相关论文
共 50 条
  • [31] Quality Metrics in High-Dimensional Data Visualization: An Overview and Systematization
    Bertini, Enrico
    Tatu, Andrada
    Keim, Daniel
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (12) : 2203 - 2212
  • [32] SCALABLE VISUALIZATION FOR HIGH-DIMENSIONAL SINGLE-CELL DATA
    Kim, Juho
    Russell, Nate
    Peng, Jian
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2017, 2017, : 623 - 634
  • [33] Supervised model-based visualization of high-dimensional data
    Kontkanen, Petri
    Lahtinen, Jussi
    Myllymäki, Petri
    Silander, Tomi
    Tirri, Henry
    Intelligent Data Analysis, 2000, 4 (3-4) : 213 - 227
  • [34] Beyond Multidimensional Data in Model Visualization: High-Dimensional and Complex Nonnumeric Data
    Villa-Vialaneix, Nathalie
    Ruiz-Gazen, Anne
    STATISTICAL ANALYSIS AND DATA MINING, 2015, 8 (04) : 232 - 239
  • [35] CytoSPADE: high-performance analysis and visualization of high-dimensional cytometry data
    Linderman, Michael D.
    Bjornson, Zach
    Simonds, Erin F.
    Qiu, Peng
    Bruggner, Robert V.
    Sheode, Ketaki
    Meng, Teresa H.
    Plevritis, Sylvia K.
    Nolan, Garry P.
    BIOINFORMATICS, 2012, 28 (18) : 2400 - 2401
  • [36] Visualization and unsupervised predictive clustering of high-dimensional multimodal neuroimaging data
    Mwangi, Benson
    Soares, Jair C.
    Hasan, Khader M.
    JOURNAL OF NEUROSCIENCE METHODS, 2014, 236 : 19 - 25
  • [37] Multidimensional scaling with discrimination coefficients for supervised visualization of high-dimensional data
    Berrar, Daniel
    Ohmayer, Georg
    NEURAL COMPUTING & APPLICATIONS, 2011, 20 (08): : 1211 - 1218
  • [38] The state-of-the-art on tours for dynamic visualization of high-dimensional data
    Lee, Stuart
    Cook, Dianne
    da Silva, Natalia
    Laa, Ursula
    Spyrison, Nicholas
    Wang, Earo
    Zhang, H. Sherry
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (04)
  • [39] Burning Sage: Reversing the Curse of Dimensionality in the Visualization of High-Dimensional Data
    Laa, Ursula
    Cook, Dianne
    Lee, Stuart
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (01) : 40 - 49
  • [40] Histogram Equalization and Specification for High-dimensional Data Visualization using RadViz
    Wang, Yan-Chao
    Zhang, Qian
    Lin, Feng
    Goh, Chi-Keong
    Wang, Xuan
    Seah, Hock-Soon
    CGI'17: PROCEEDINGS OF THE COMPUTER GRAPHICS INTERNATIONAL CONFERENCE, 2017,