Querying semantic catalogues of biomedical databases

被引:7
|
作者
Pereira, Arnaldo [1 ]
Almeida, Joao Rafael [1 ,2 ]
Lopes, Rui Pedro [3 ]
Oliveira, Jose Luis [1 ]
机构
[1] Univ Aveiro, DETI, IEETA, LASI, Aveiro, Portugal
[2] Univ A Coruna, Dept Computat, La Coruna, Spain
[3] Polytech Inst Braganca, CeDRI, Braganca, Portugal
关键词
Biomedical data; Knowledge bases; Semantic data; Linked data; Information extraction; Natural language interfaces; Question answering; LINKED DATA; PLATFORM; CHALLENGES; DISCOVERY;
D O I
10.1016/j.jbi.2022.104272
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: Secondary use of health data is a valuable source of knowledge that boosts observational studies, leading to important discoveries in the medical and biomedical sciences. The fundamental guiding principle for performing a successful observational study is the research question and the approach in advance of executing a study. However, in multi-centre studies, finding suitable datasets to support the study is challenging, time-consuming, and sometimes impossible without a deep understanding of each dataset.Methods: We propose a strategy for retrieving biomedical datasets of interest that were semantically annotated, using an interface built by applying a methodology for transforming natural language questions into formal language queries. The advantages of creating biomedical semantic data are enhanced by using natural language interfaces to issue complex queries without manipulating a logical query language.Results: Our methodology was validated using Alzheimer's disease datasets published in a European platform for sharing and reusing biomedical data. We converted data to semantic information format using biomedical on-tologies in everyday use in the biomedical community and published it as a FAIR endpoint. We have considered natural language questions of three types: single-concept questions, questions with exclusion criteria, and multi-concept questions. Finally, we analysed the performance of the question-answering module we used and its limitations. The source code is publicly available at https:// bioinformatics-ua.github.io/BioKBQA/.Conclusion: We propose a strategy for using information extracted from biomedical data and transformed into a semantic format using open biomedical ontologies. Our method uses natural language to formulate questions to be answered by this semantic data without the direct use of formal query languages.
引用
收藏
页数:12
相关论文
共 50 条
  • [11] Querying Databases with Taxonomies
    Martinenghi, Davide
    Torlone, Riccardo
    CONCEPTUAL MODELING - ER 2010, 2010, 6412 : 377 - +
  • [12] On querying ontologies and databases
    Bulskov, H
    Knappe, R
    Andreasen, T
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2004, 3055 : 191 - 202
  • [13] Querying multidimensional databases
    Cabibbo, L
    Torlone, R
    DATABASE PROGRAMMING LANGUAGES, 1998, 1369 : 319 - 335
  • [14] QUERYING OBJECT DATABASES
    LOOMIS, MES
    JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1994, 7 (03): : 56 - &
  • [15] Querying XML Databases
    de Sousa, AA
    Pereira, JL
    Carvalho, JA
    XXII INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY, PROCEEDINGS, 2002, : 142 - 150
  • [16] QUERYING LOGICAL DATABASES
    VARDI, MY
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1986, 33 (02) : 142 - 160
  • [17] Querying inconsistent databases
    Greco, S
    Zumpano, E
    LOGIC FOR PROGRAMMING AND AUTOMATED REASONING, PROCEEDINGS, 2000, 1955 : 308 - 325
  • [18] QUERYING INDEPENDENT DATABASES
    BUNEMAN, OP
    DAVIDSON, SB
    WATTERS, A
    INFORMATION SCIENCES, 1990, 52 (01) : 1 - 34
  • [19] Using semantic web technologies for knowledge-driven querying of biomedical data
    O'Connor, Martin
    Shankar, Ravi
    Tu, Samson
    Nyulas, Csongor
    Parrish, Dave
    Musen, Mark
    Das, Amar
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2007, 4594 : 267 - 276
  • [20] KaBOB: ontology-based semantic integration of biomedical databases
    Livingston, Kevin M.
    Bada, Michael
    Baumgartner, William A., Jr.
    Hunter, Lawrence E.
    BMC BIOINFORMATICS, 2015, 16