GeoWaVe: geometric median clustering with weighted voting for ensemble clustering of cytometry data

被引:2
|
作者
Burton, Ross J. [1 ,2 ]
Cuff, Simone M. [1 ]
Morgan, Matt P. [2 ]
Artemiou, Andreas [3 ]
Eberl, Matthias [1 ,4 ]
机构
[1] Cardiff Univ, Sch Med, Div Infect & Immun, Cardiff F14 4XN, Wales
[2] Cardiff & Vale Univ Hlth Board, Univ Hosp Wales, Adult Crit Care, Cardiff CF14 4XW, Wales
[3] Cardiff Univ, Sch Math, Cardiff CF24 4AG, Wales
[4] Cardiff Univ, Syst Immun Res Inst, Cardiff F14 4XN, Wales
基金
英国惠康基金; 英国医学研究理事会;
关键词
SINGLE; FLOW;
D O I
10.1093/bioinformatics/btac751
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
MotivationClustering is an unsupervised method for identifying structure in unlabelled data. In the context of cytometry, it is typically used to categorize cells into subpopulations of similar phenotypes. However, clustering is greatly dependent on hyperparameters and the data to which it is applied as each algorithm makes different assumptions and generates a different 'view' of the dataset. As such, the choice of clustering algorithm can significantly influence results, and there is often not one preferred method but different insights to be obtained from different methods. To overcome these limitations, consensus approaches are needed that directly address the effect of competing algorithms. To the best of our knowledge, consensus clustering algorithms designed specifically for the analysis of cytometry data are lacking.ResultsWe present a novel ensemble clustering methodology based on geometric median clustering with weighted voting (GeoWaVe). Compared to graph ensemble clustering methods that have gained popularity in single-cell RNA sequencing analysis, GeoWaVe performed favourably on different sets of high-dimensional mass and flow cytometry data. Our findings provide proof of concept for the power of consensus methods to make the analysis, visualization and interpretation of cytometry data more robust and reproducible. The wide availability of ensemble clustering methods is likely to have a profound impact on our understanding of cellular responses, clinical conditions and therapeutic and diagnostic options.Availability and implementationGeoWaVe is available as part of the CytoCluster package and published on the Python Package Index . Benchmarking data described are available from .Supplementary informationare available at Bioinformatics online.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Clustering cancer gene expression data by projective clustering ensemble
    Yu, Xianxue
    Yu, Guoxian
    Wang, Jun
    PLOS ONE, 2017, 12 (02):
  • [22] Complementary ensemble clustering of biomedical data
    Fodeh, Samah Jamal
    Brandt, Cynthia
    Thai Binh Luong
    Haddad, Ali
    Schultz, Martin
    Murphy, Terrence
    Krauthammer, Michael
    JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (03) : 436 - 443
  • [23] Weighted-object ensemble clustering: methods and analysis
    Ren, Yazhou
    Domeniconi, Carlotta
    Zhang, Guoji
    Yu, Guoxian
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (02) : 661 - 689
  • [24] Weighted-object ensemble clustering: methods and analysis
    Yazhou Ren
    Carlotta Domeniconi
    Guoji Zhang
    Guoxian Yu
    Knowledge and Information Systems, 2017, 51 : 661 - 689
  • [25] Double Weighted Ensemble Clustering for Cancer Subtypes Analysis
    Zhang, Xin
    Huo, Hua
    IEEE ACCESS, 2022, 10 : 41477 - 41488
  • [26] A Novel Approach for Geometric Clustering based on Tensor Voting Framework
    Lombardi, Gabriele
    Rozza, Alessandro
    Casiraghi, Elena
    Campadelli, Paola
    Neural Nets WIRN11, 2011, 234 : 129 - 138
  • [27] Clustering via geometric median shift over Riemannian manifolds
    Wang, Yang
    Huang, Xiaodi
    Wu, Lin
    INFORMATION SCIENCES, 2013, 220 : 292 - 305
  • [28] Spam Detection using Dynamic Weighted Voting based on Clustering
    Saeedian, Mehmoush Famil
    Beigy, Hamid
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 122 - 126
  • [29] Weighted Topological Clustering for Categorical Data
    Rogovschi, Nicoleta
    Nadif, Mohamed
    NEURAL INFORMATION PROCESSING, PT I, 2011, 7062 : 599 - +
  • [30] Weighted clustering of called ACGH data
    Van Wieringen, Wessel
    de Wiel, Mark A. Van
    Ylstra, Bauke
    CELLULAR ONCOLOGY, 2007, 29 (02) : 121 - 121