Reactome graph database: Efficient access to complex pathway data

被引:168
|
作者
Fabregat, Antonio [1 ,2 ]
Korninger, Florian [1 ]
Viteri, Guilherme [1 ]
Sidiropoulos, Konstantinos [1 ]
Marin-Garcia, Pablo [3 ,4 ]
Ping, Peipei [5 ,6 ]
Wu, Guanming [7 ]
Stein, Lincoln [8 ,9 ]
D'Eustachio, Peter [10 ]
Hermjakob, Henning [1 ,11 ]
机构
[1] European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Hinxton, England
[2] Open Targets, Wellcome Genome Campus, Hinxton, England
[3] Univ Valencia, Fdn Invest INCLIVA, Valencia, Spain
[4] Inst Med Genom, Valencia, Spain
[5] Univ Calif Los Angeles, NIH BD2K Ctr Excellence, Los Angeles, CA USA
[6] Univ Calif Los Angeles, Dept Physiol Med & Bioinformat, Los Angeles, CA USA
[7] Oregon Hlth & Sci Univ, Portland, OR 97201 USA
[8] Ontario Inst Canc Res, Toronto, ON, Canada
[9] Univ Toronto, Dept Mol Genet, Toronto, ON, Canada
[10] NYU, Langone Med Ctr, New York, NY USA
[11] Natl Ctr Prot Sci, Beijing Inst Radiat Med, Beijing Proteome Res Ctr, State Key Lab Prote, Beijing, Peoples R China
基金
美国国家卫生研究院;
关键词
D O I
10.1371/journal.pcbi.1005968
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its query language, Cypher, provide efficient access to the complex Reactome data model, facilitating easy traversal and knowledge discovery. The adoption of this technology greatly improved query efficiency, reducing the average query time by 93%. The web service built on top of the graph database provides programmatic access to Reactome data by object oriented queries, but also supports more complex queries that take advantage of the new underlying graph-based data storage. By adopting graph database technology we are providing a high performance pathway data resource to the community. The Reactome graph database use case shows the power of NoSQL database engines for complex biological data types.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Efficient and Private Access to Outsourced Data
    di Vimercati, Sabrina De Capitani
    Foresti, Sara
    Paraboschi, Stefano
    Pelosi, Gerardo
    Samarati, Pierangela
    31ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2011), 2011, : 710 - 719
  • [42] EFFICIENT ACCESS OF COMPRESSED DATA.
    Eggers, Susan J.
    Shoshani, Arie
    Very Large Data Bases, International Conference on Very Large Data Bases, 1980, : 205 - 211
  • [43] Organic materials database: An open-access online database for data mining
    Borysov, Stanislav S.
    Geilhufe, R. Matthias
    Balatsky, Alexander V.
    PLOS ONE, 2017, 12 (02):
  • [44] Efficient access methods for very large distributed graph databases
    Luaces, David
    Viqueira, Jose R. R.
    Cotos, Jose M.
    Flores, Julian C.
    INFORMATION SCIENCES, 2021, 573 (573) : 65 - 81
  • [45] An efficient multiversion access control in a Temporal Object Oriented Database
    Arumugam, G.
    Thangaraj, M.
    JOURNAL OF OBJECT TECHNOLOGY, 2006, 5 (01): : 105 - 116
  • [46] Medical video mining for efficient database indexing, management and access
    Zhu, XQ
    Aref, WG
    Fan, JP
    Catlin, AC
    Elmagarmid, AK
    19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 569 - 580
  • [47] An Efficient Hierarchical Key Management Scheme for Access Medical Database
    Xie, Chang-Ying
    Lin, Guan-Li
    Lai, Yi-An
    Yeh, Jung
    Huang, Yu-Min
    Chung, Yu-Fang
    Chen, Tzer-Shyong
    PROCEEDINGS OF THE 2017 IEEE INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND ENGINEERING (IEEE-ICICE 2017), 2017, : 461 - 463
  • [48] From database to web browser: The solutions to data access
    Ling, RR
    Yen, DC
    Chou, DC
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2000, 41 (02) : 58 - 63
  • [49] An active database framework for adaptive mobile data access
    Wu, SY
    Chang, CS
    ADVANCES IN DATABASE TECHNOLOGIES, 1999, 1552 : 335 - 346
  • [50] NEGOTIATING DATA ACCESS IN FEDERATED DATABASE-SYSTEMS
    ALONSO, R
    BARBARA, D
    PROCEEDINGS : FIFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1989, : 56 - 65