BicPAMS: software for biological data analysis with pattern-based biclustering

被引:47
|
作者
Henriques, Rui [1 ]
Ferreira, Francisco L.
Madeira, Sara C. [1 ]
机构
[1] Univ Lisbon, INESC ID, Lisbon, Portugal
来源
BMC BIOINFORMATICS | 2017年 / 18卷
关键词
GENE-EXPRESSION DATA; ALGORITHMS;
D O I
10.1186/s12859-017-1493-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biclustering has been largely applied for the unsupervised analysis of biological data, being recognised today as a key technique to discover putative modules in both expression data (subsets of genes correlated in subsets of conditions) and network data (groups of coherently interconnected biological entities). However, given its computational complexity, only recent breakthroughs on pattern-based biclustering enabled efficient searches without the restrictions that state-of-the-art biclustering algorithms place on the structure and homogeneity of biclusters. As a result, pattern-based biclustering provides the unprecedented opportunity to discover non-trivial yet meaningful biological modules with putative functions, whose coherency and tolerance to noise can be tuned and made problem-specific. Methods: To enable the effective use of pattern-based biclustering by the scientific community, we developed BicPAMS (Biclustering based on PAttern Mining Software), a software that: 1) makes available state-of-the-art pattern-based biclustering algorithms (BicPAM (Henriques and Madeira, Alg Mol Biol 9: 27, 2014), BicNET (Henriques and Madeira, Alg Mol Biol 11: 23, 2016), BicSPAM (Henriques and Madeira, BMC Bioinforma 15: 130, 2014), BiC2PAM (Henriques and Madeira, Alg Mol Biol 11: 1-30, 2016), BiP (Henriques and Madeira, IEEE/ACM Trans Comput Biol Bioinforma, 2015), DeBi (Serin and Vingron, AMB 6:1-12, 2011) and BiModule (Okada et al., IPSJ Trans Bioinf 48(SIG5):39-48, 2007)); 2) consistently integrates their dispersed contributions; 3) further explores additional accuracy and efficiency gains; and 4) makes available graphical and application programming interfaces. Results: Results on both synthetic and real data confirm the relevance of BicPAMS for biological data analysis, highlighting its essential role for the discovery of putative modules with non-trivial yet biologically significant functions from expression and network data. Conclusions: BicPAMS is the first biclustering tool offering the possibility to: 1) parametrically customize the structure, coherency and quality of biclusters; 2) analyze large-scale biological networks; and 3) tackle the restrictive assumptions placed by state-of-the-art biclustering algorithms. These contributions are shown to be key for an adequate, complete and user-assisted unsupervised analysis of biological data.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Pattern-Based Refactoring of Legacy Software Systems
    Hunold, Sascha
    Krellner, Bjoern
    Rauber, Thomas
    Reichel, Thomas
    Ruenger, Gudula
    ENTERPRISE INFORMATION SYSTEMS-BK, 2009, 24 : 78 - +
  • [22] Biclustering algorithms for biological data analysis: A survey
    Madeira, SC
    Oliveira, AL
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2004, 1 (01) : 24 - 45
  • [23] OLAP Patterns: A pattern-based approach to multidimensional data analysis
    Kovacic, Ilko
    Schuetz, Christoph G.
    Neumayr, Bernd
    Schrefl, Michael
    DATA & KNOWLEDGE ENGINEERING, 2022, 138
  • [24] Pattern-Based Conceptual Data Modelling
    Albdaiwi, Bader
    Noack, Rene
    Thalheim, Bernhard
    INFORMATION MODELLING AND KNOWLEDGE BASES XXVI, 2014, 272 : 1 - 20
  • [25] PBRP: Pattern-based approach for software release planning
    Danesh, Amir Seyed
    Ahmad, Rodina
    Shamshirband, Shahaboddin
    Zargarnataj, Seyed Mahdi
    ASIA LIFE SCIENCES, 2016, 25 (01): : 479 - 506
  • [26] A pattern-based outlier detection method identifying abnormal attributes in software project data
    Yoon, Kyung-A
    Bae, Doo-Hwan
    INFORMATION AND SOFTWARE TECHNOLOGY, 2010, 52 (02) : 137 - 151
  • [27] Pattern-Based Software Architecture for Service-Oriented Software Systems
    Pahl, Claus
    Barrett, Ronan
    E-INFORMATICA SOFTWARE ENGINEERING JOURNAL, 2010, 4 (01) : 31 - 45
  • [28] A pattern-based software engineering tool for grid environments
    Gomes, MC
    Cunha, JC
    Rana, OF
    Concurrent Information Processing and Computing, 2005, 195 : 213 - 222
  • [29] Pattern-based inference approach for data mining
    Sy, BK
    18TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1999, : 727 - 731
  • [30] Pattern-based clustering and attribute analysis
    Gabriela Alexe
    Sorin Alexe
    Peter L. Hammer
    Soft Computing, 2006, 10 : 442 - 452