Querying semantic catalogues of biomedical databases

被引:7
|
作者
Pereira, Arnaldo [1 ]
Almeida, Joao Rafael [1 ,2 ]
Lopes, Rui Pedro [3 ]
Oliveira, Jose Luis [1 ]
机构
[1] Univ Aveiro, DETI, IEETA, LASI, Aveiro, Portugal
[2] Univ A Coruna, Dept Computat, La Coruna, Spain
[3] Polytech Inst Braganca, CeDRI, Braganca, Portugal
关键词
Biomedical data; Knowledge bases; Semantic data; Linked data; Information extraction; Natural language interfaces; Question answering; LINKED DATA; PLATFORM; CHALLENGES; DISCOVERY;
D O I
10.1016/j.jbi.2022.104272
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: Secondary use of health data is a valuable source of knowledge that boosts observational studies, leading to important discoveries in the medical and biomedical sciences. The fundamental guiding principle for performing a successful observational study is the research question and the approach in advance of executing a study. However, in multi-centre studies, finding suitable datasets to support the study is challenging, time-consuming, and sometimes impossible without a deep understanding of each dataset.Methods: We propose a strategy for retrieving biomedical datasets of interest that were semantically annotated, using an interface built by applying a methodology for transforming natural language questions into formal language queries. The advantages of creating biomedical semantic data are enhanced by using natural language interfaces to issue complex queries without manipulating a logical query language.Results: Our methodology was validated using Alzheimer's disease datasets published in a European platform for sharing and reusing biomedical data. We converted data to semantic information format using biomedical on-tologies in everyday use in the biomedical community and published it as a FAIR endpoint. We have considered natural language questions of three types: single-concept questions, questions with exclusion criteria, and multi-concept questions. Finally, we analysed the performance of the question-answering module we used and its limitations. The source code is publicly available at https:// bioinformatics-ua.github.io/BioKBQA/.Conclusion: We propose a strategy for using information extracted from biomedical data and transformed into a semantic format using open biomedical ontologies. Our method uses natural language to formulate questions to be answered by this semantic data without the direct use of formal query languages.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] KaBOB: ontology-based semantic integration of biomedical databases
    Kevin M Livingston
    Michael Bada
    William A Baumgartner
    Lawrence E Hunter
    BMC Bioinformatics, 16
  • [22] Integration and Querying of Heterogeneous Omics Semantic Annotations for Biomedical and Biomolecular Knowledge Discovery
    Irshad, Omer
    Khan, Muhammad Usman Ghani
    CURRENT BIOINFORMATICS, 2020, 15 (01) : 41 - 58
  • [23] Word image based latent semantic indexing for conceptual querying in document image databases
    Banerjee, Sameek
    Harit, Gaurav
    Chaudhury, Santanu
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 1208 - 1212
  • [24] SPARQLGraph: a web-based platform for graphically querying biological Semantic Web databases
    Dominik Schweiger
    Zlatko Trajanoski
    Stephan Pabinger
    BMC Bioinformatics, 15
  • [25] SPARQLGraph: a web-based platform for graphically querying biological Semantic Web databases
    Schweiger, Dominik
    Trajanoski, Zlatko
    Pabinger, Stephan
    BMC BIOINFORMATICS, 2014, 15
  • [26] Querying Probabilistic Preferences in Databases
    Kenig, Batya
    Kimelfeld, Benny
    Ping, Haoyue
    Stoyanovich, Julia
    PODS'17: PROCEEDINGS OF THE 36TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2017, : 21 - 36
  • [27] Querying databases with knowledge domains
    Ng, W
    2000 INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM - PROCEEDINGS, 2000, : 65 - 72
  • [28] Querying and computing with BioCyc databases
    Krummenacker, M
    Paley, S
    Mueller, L
    Yan, T
    Karp, PD
    BIOINFORMATICS, 2005, 21 (16) : 3454 - 3455
  • [29] Querying Databases by Snapping Blocks
    Silva, Yasin N.
    Chon, Jaime
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1472 - 1475
  • [30] Querying documents in object databases
    Abiteboul S.
    Cluet S.
    Christophides V.
    Milo T.
    Moerkotte G.
    Siméon J.
    International Journal on Digital Libraries, 1997, 1 (1) : 5 - 19