SJARACNe: a scalable software tool for gene network reverse engineering from big data

被引:31
|
作者
Khatamian, Alireza [1 ]
Paull, Evan O. [2 ]
Califano, Andrea [2 ]
Yu, Jiyang [1 ]
机构
[1] St Jude Childrens Res Hosp, Dept Computat Biol, 332 N Lauderdale St, Memphis, TN 38105 USA
[2] Columbia Univ, Dept Syst Biol, New York, NY 10032 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/bty907
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Over the last two decades, we have observed an exponential increase in the number of generated array or sequencing-based transcriptomic profiles. Reverse engineering of biological networks from high-throughput gene expression profiles has been one of the grand challenges in systems biology. The Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNe) represents one of the most effective and widely-used tools to address this challenge. However, existing ARACNe implementations do not efficiently process big input data with thousands of samples. Here we present an improved implementation of the algorithm, SJARACNe, to solve this big data problem, based on sophisticated software engineering. The new scalable SJARACNe package achieves a dramatic improvement in computational performance in both time and memory usage and implements new features while preserving the network inference accuracy of the original algorithm. Given that large-sampled transcriptomic data is increasingly available and ARACNe is extremely demanding for network reconstruction, the scalable SJARACNe will allow even researchers with modest computational resources to efficiently construct complex regulatory and signaling networks from thousands of gene expression profiles. Availability and implementation SJARACNe is implemented in C++ (computational core) and Python (pipelining scripting wrapper, >= 3.6.1). It is freely available at https://github.com/jyyulab/SJARACNe. Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:2165 / 2166
页数:2
相关论文
共 50 条
  • [31] A Gene Network Simulator to Assess Reverse Engineering Algorithms
    Di Camillo, Barbara
    Toffolo, Gianna
    Cobelli, Claudio
    CHALLENGES OF SYSTEMS BIOLOGY: COMMUNITY EFFORTS TO HARNESS BIOLOGICAL COMPLEXITY, 2009, 1158 : 125 - 142
  • [32] Accelerated parallel algorithm for gene network reverse engineering
    He, Jing
    Zhou, Zhou
    Reed, Michael
    Califano, Andrea
    BMC SYSTEMS BIOLOGY, 2017, 11
  • [33] Reverse engineering the gap gene network of Drosophila melanogaster
    Perkins, Theodore J.
    Jaeger, Johannes
    Reinitz, John
    Glass, Leon
    PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (05) : 417 - 428
  • [34] Gene network reverse engineering: The Next Generation Preface
    Giorgi, Federico M.
    BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS, 2020, 1863 (06):
  • [35] A Scalable Data Science Workflow Approach for Big Data Bayesian Network Learning
    Wang, Jianwu
    Tang, Yan
    Nguyen, Mai
    Altintas, Ilkay
    2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2014, : 16 - 25
  • [36] The Design of a Software Engineering Lifecycle Process for Big Data Projects
    Lin, Yen-Tai
    Huang, Sun-Jen
    IT PROFESSIONAL, 2018, 20 (01) : 45 - 52
  • [37] A Collection of Software Engineering Challenges for Big Data System Development
    Hummel, Oliver
    Eichelberger, Holger
    Giloj, Andreas
    Werle, Dominik
    Schmid, Klaus
    44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, : 362 - 369
  • [38] Software Engineering for Big Data Projects: Domains, Methodologies and Gaps
    Kumar, Vijay Dipti
    Alencar, Paulo
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2886 - 2895
  • [39] A Generalized Scalable Software Architecture for Analyzing Temporally Structured Big Data in the Cloud
    Westerlund, Magnus
    Hedlund, Ulf
    Pulkkis, Goran
    Bjork, Kaj-Mikael
    NEW PERSPECTIVES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2014, 275 : 559 - 569
  • [40] Towards a Big Data Requirements Engineering Artefact Model in the Context of Big Data Software Development Projects
    Arruda, Darlan
    Madhavji, Nazim H.
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4725 - 4726