AutoClassWeb: a simple web interface for Bayesian clustering of omics data

被引:0
|
作者
Poulain, Pierre [1 ]
Camadro, Jean-Michel [1 ]
机构
[1] Univ Paris Cite, CNRS, Inst Jacques Monod, F-75013 Paris, France
关键词
Clustering; Genomics; Proteomics; Bayesian; Autoclass; Machine learning;
D O I
10.1186/s13104-022-06129-6
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Objective: Data clustering is a common exploration step in the omics era, notably in genomics and proteomics where many genes or proteins can be quantified from one or more experiments. Bayesian clustering is a powerful unsupervised algorithm that can classify several thousands of genes or proteins. AutoClass C, its original implementation, handles missing data, automatically determines the best number of clusters but is not user-friendly. Results: We developed an online tool called AutoClassWeb, which provides an easy-to-use and simple web interface for Bayesian clustering with AutoClass. Input data are entered as TSV files and quality controlled. Results are provided in formats that ease further analyses with spreadsheet programs or with programming languages, such as Python or R. AutoClassWeb is implemented in Python and is published under the 3-Clauses BSD license. The source code is available at https://github.com/pierrepo/autoclassweb along with a detailed documentation.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] AutoClassWeb: a simple web interface for Bayesian clustering of omics data
    Pierre Poulain
    Jean-Michel Camadro
    BMC Research Notes, 15
  • [2] A Bayesian Nonparametric Model for Integrative Clustering of Omics Data
    Peneva, Iliana
    Savage, Richard S.
    BAYESIAN STATISTICS AND NEW GENERATIONS, BAYSM 2018, 2019, 296 : 105 - 114
  • [3] Generalized Bayesian Factor Analysis for Integrative Clustering with Applications to Multi-Omics Data
    Min, Eun Jeong
    Chang, Changgee
    Long, Qi
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 109 - 119
  • [4] Web Databases for Omics Data in Tomato
    Suzuki, Ayako
    Suwabe, Keita
    Yano, Kentaro
    JOURNAL OF THE JAPANESE SOCIETY FOR HORTICULTURAL SCIENCE, 2009, 78 (01): : 23 - 31
  • [5] A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data
    Mo, Qianxing
    Shen, Ronglai
    Guo, Cui
    Vannucci, Marina
    Chan, Keith S.
    Hilsenbeck, Susan G.
    BIOSTATISTICS, 2018, 19 (01) : 71 - 86
  • [6] Bayesian feedback in data clustering
    Jain, A. K.
    Mallapragada, Pavan K.
    Law, Martin
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 374 - +
  • [7] BAYESIAN CLUSTERING OF DATA SETS
    MENZEFRICKE, U
    COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1981, 10 (01): : 65 - 77
  • [8] Bayesian clustering with uncertain data
    Nicholls, Kath
    Kirk, Paul D. W.
    Wallace, Chris
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (09)
  • [9] Integrative Clustering Analysis for Omics Data with Missingness
    Zhao, Yinqi
    Darst, Burcu
    Conti, David V.
    GENETIC EPIDEMIOLOGY, 2021, 45 (07) : 806 - 806
  • [10] Clustering multilayer omics data using MuNCut
    Hidalgo, Sebastian J. Teran
    Ma, Shuangge
    BMC GENOMICS, 2018, 19