Reactome graph database: Efficient access to complex pathway data

被引:168
|
作者
Fabregat, Antonio [1 ,2 ]
Korninger, Florian [1 ]
Viteri, Guilherme [1 ]
Sidiropoulos, Konstantinos [1 ]
Marin-Garcia, Pablo [3 ,4 ]
Ping, Peipei [5 ,6 ]
Wu, Guanming [7 ]
Stein, Lincoln [8 ,9 ]
D'Eustachio, Peter [10 ]
Hermjakob, Henning [1 ,11 ]
机构
[1] European Bioinformat Inst, European Mol Biol Lab, Wellcome Genome Campus, Hinxton, England
[2] Open Targets, Wellcome Genome Campus, Hinxton, England
[3] Univ Valencia, Fdn Invest INCLIVA, Valencia, Spain
[4] Inst Med Genom, Valencia, Spain
[5] Univ Calif Los Angeles, NIH BD2K Ctr Excellence, Los Angeles, CA USA
[6] Univ Calif Los Angeles, Dept Physiol Med & Bioinformat, Los Angeles, CA USA
[7] Oregon Hlth & Sci Univ, Portland, OR 97201 USA
[8] Ontario Inst Canc Res, Toronto, ON, Canada
[9] Univ Toronto, Dept Mol Genet, Toronto, ON, Canada
[10] NYU, Langone Med Ctr, New York, NY USA
[11] Natl Ctr Prot Sci, Beijing Inst Radiat Med, Beijing Proteome Res Ctr, State Key Lab Prote, Beijing, Peoples R China
基金
美国国家卫生研究院;
关键词
D O I
10.1371/journal.pcbi.1005968
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Reactome is a free, open-source, open-data, curated and peer-reviewed knowledgebase of biomolecular pathways. One of its main priorities is to provide easy and efficient access to its high quality curated data. At present, biological pathway databases typically store their contents in relational databases. This limits access efficiency because there are performance issues associated with queries traversing highly interconnected data. The same data in a graph database can be queried more efficiently. Here we present the rationale behind the adoption of a graph database (Neo4j) as well as the new ContentService (REST API) that provides access to these data. The Neo4j graph database and its query language, Cypher, provide efficient access to the complex Reactome data model, facilitating easy traversal and knowledge discovery. The adoption of this technology greatly improved query efficiency, reducing the average query time by 93%. The web service built on top of the graph database provides programmatic access to Reactome data by object oriented queries, but also supports more complex queries that take advantage of the new underlying graph-based data storage. By adopting graph database technology we are providing a high performance pathway data resource to the community. The Reactome graph database use case shows the power of NoSQL database engines for complex biological data types.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Privacy Mechanism for Access Controlled Graph Data
    Arshad, Muhammad U.
    Felemban, Muhamad
    Pervaiz, Zahid
    Ghafoor, Arif
    Aref, Walid G.
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2019, 16 (05) : 819 - 832
  • [22] A Database Approach for Accelerate Video Data Access
    Zhou, Wenjing
    Xie, Xiangwei
    Li, Hui
    Zhang, Xiao
    Wang, Shan
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 45 - 57
  • [23] EFFICIENT GEOMETRIC ACCESS TO A MULTIREPRESENTATION GEO-DATABASE
    TAMMINEN, M
    GEO-PROCESSING, 1984, 2 (02): : 177 - 196
  • [24] Efficient access to and retrieval from a shape image database
    Wang, J
    Yang, WJ
    Acharya, R
    IEEE WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO LIBRARIES - PROCEEDINGS, 1998, : 63 - 67
  • [25] Data-Efficient Graph Learning
    Ding, Kaize
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22663 - 22663
  • [26] Data organization and access for efficient data mining
    Dunkel, B
    Soparkar, N
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 522 - 529
  • [27] Expression and Efficient Processing of Fuzzy Queries in a Graph Database Context
    Pivert, Olivier
    Smits, Gregory
    Thion, Virginie
    2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,
  • [28] Efficient graph spanning structures for large database image retrieval
    Mocanu, Bogdan
    Tapu, Ruxandra
    Zaharia, Titus
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 594 - 598
  • [29] From database to knowledge graph - using data in chemistry
    Menon, Angiras
    Krdzavac, Nenad B.
    Kraft, Markus
    CURRENT OPINION IN CHEMICAL ENGINEERING, 2019, 26 : 33 - 37
  • [30] A Distributed Graph Database for the Data Management of IoT Systems
    Ueta, Kosei
    Xue, Xiaoyong
    Nakamoto, Yukikazu
    Murakami, Sena
    2016 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2016, : 299 - 304