iBBiG: iterative binary bi-clustering of gene sets

被引:35
|
作者
Gusenleitner, Daniel [1 ]
Howe, Eleanor A. [1 ,2 ]
Bentink, Stefan [1 ,3 ]
Quackenbush, John [1 ,3 ,4 ]
Culhane, Aedin C. [1 ,3 ]
机构
[1] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02115 USA
[2] Univ Oxford, Dept Stat, Oxford OX1 3TG, England
[3] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[4] Dana Farber Canc Inst, Dept Canc Biol, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
ENRICHMENT ANALYSIS; BIOLOGICAL PROCESSES; MICROARRAY DATA; EXPRESSION DATA; DISEASES; CCL5;
D O I
10.1093/bioinformatics/bts438
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Meta-analysis of genomics data seeks to identify genes associated with a biological phenotype across multiple datasets; however, merging data from different platforms by their features (genes) is challenging. Meta-analysis using functionally or biologically characterized gene sets simplifies data integration is biologically intuitive and is seen as having great potential, but is an emerging field with few established statistical methods. Results: We transform gene expression profiles into binary gene set profiles by discretizing results of gene set enrichment analyses and apply a new iterative bi-clustering algorithm (iBBiG) to identify groups of gene sets that are coordinately associated with groups of phenotypes across multiple studies. iBBiG is optimized for meta-analysis of large numbers of diverse genomics data that may have unmatched samples. It does not require prior knowledge of the number or size of clusters. When applied to simulated data, it outperforms commonly used clustering methods, discovers overlapping clusters of diverse sizes and is robust in the presence of noise. We apply it to meta-analysis of breast cancer studies, where iBBiG extracted novel gene set-phenotype association that predicted tumor metastases within tumor subtypes.
引用
收藏
页码:2484 / 2492
页数:9
相关论文
共 50 条
  • [21] SUBIC: A Supervised Bi-Clustering Approach for Precision Medicine
    Nezhad, Milad Zafar
    Zhu, Dongxiao
    Sadati, Najibesadat
    Yang, Kai
    Levy, Phillip
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 755 - 760
  • [22] Network-aided Bi-Clustering for discovering cancer subtypes
    Yu, Guoxian
    Yu, Xianxue
    Wang, Jun
    SCIENTIFIC REPORTS, 2017, 7
  • [23] LINEAR COHERENT BI-CLUSTERING VIA BEAM SEARCHING AND SAMPLE SET CLUSTERING
    Shi, Yi
    Hasan, Maryam
    Cai, Zhipeng
    Lin, Guohui
    Schuurmans, Dale
    DISCRETE MATHEMATICS ALGORITHMS AND APPLICATIONS, 2012, 4 (02)
  • [24] Analyzing movement trajectories using a Markov bi-clustering method
    Erez, Keren
    Goldberger, Jacob
    Sosnik, Ronen
    Shemesh, Moshe
    Rothstein, Susan
    Abeles, Moshe
    JOURNAL OF COMPUTATIONAL NEUROSCIENCE, 2009, 27 (03) : 543 - 552
  • [25] FunCC: A new bi-clustering algorithm for functional data with misalignment
    Galvani, Marta
    Torti, Agostino
    Menafoglio, Alessandra
    Vantini, Simone
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 160
  • [26] Bi-MARS: A Bi-clustering based Memetic Algorithm for Recommender Systems
    Bansal, Saumya
    Baliyan, Niyati
    APPLIED SOFT COMPUTING, 2020, 97
  • [27] Bi-clustering continuous data with self-organizing map
    Benabdeslem, Khalid
    Allab, Kais
    NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1551 - 1562
  • [28] Network-aided Bi-Clustering for discovering cancer subtypes
    Guoxian Yu
    Xianxue Yu
    Jun Wang
    Scientific Reports, 7
  • [29] Analyzing movement trajectories using a Markov bi-clustering method
    Keren Erez
    Jacob Goldberger
    Ronen Sosnik
    Moshe Shemesh
    Susan Rothstein
    Moshe Abeles
    Journal of Computational Neuroscience, 2009, 27 : 543 - 552
  • [30] Bi-clustering continuous data with self-organizing map
    Khalid Benabdeslem
    Kais Allab
    Neural Computing and Applications, 2013, 22 : 1551 - 1562