Querying semantic catalogues of biomedical databases

被引:7
|
作者
Pereira, Arnaldo [1 ]
Almeida, Joao Rafael [1 ,2 ]
Lopes, Rui Pedro [3 ]
Oliveira, Jose Luis [1 ]
机构
[1] Univ Aveiro, DETI, IEETA, LASI, Aveiro, Portugal
[2] Univ A Coruna, Dept Computat, La Coruna, Spain
[3] Polytech Inst Braganca, CeDRI, Braganca, Portugal
关键词
Biomedical data; Knowledge bases; Semantic data; Linked data; Information extraction; Natural language interfaces; Question answering; LINKED DATA; PLATFORM; CHALLENGES; DISCOVERY;
D O I
10.1016/j.jbi.2022.104272
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: Secondary use of health data is a valuable source of knowledge that boosts observational studies, leading to important discoveries in the medical and biomedical sciences. The fundamental guiding principle for performing a successful observational study is the research question and the approach in advance of executing a study. However, in multi-centre studies, finding suitable datasets to support the study is challenging, time-consuming, and sometimes impossible without a deep understanding of each dataset.Methods: We propose a strategy for retrieving biomedical datasets of interest that were semantically annotated, using an interface built by applying a methodology for transforming natural language questions into formal language queries. The advantages of creating biomedical semantic data are enhanced by using natural language interfaces to issue complex queries without manipulating a logical query language.Results: Our methodology was validated using Alzheimer's disease datasets published in a European platform for sharing and reusing biomedical data. We converted data to semantic information format using biomedical on-tologies in everyday use in the biomedical community and published it as a FAIR endpoint. We have considered natural language questions of three types: single-concept questions, questions with exclusion criteria, and multi-concept questions. Finally, we analysed the performance of the question-answering module we used and its limitations. The source code is publicly available at https:// bioinformatics-ua.github.io/BioKBQA/.Conclusion: We propose a strategy for using information extracted from biomedical data and transformed into a semantic format using open biomedical ontologies. Our method uses natural language to formulate questions to be answered by this semantic data without the direct use of formal query languages.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Semantic Data Visualisation for Biomedical Database Catalogues
    Pereira, Arnaldo
    Almeida, Joao Rafael
    Lopes, Rui Pedro
    Oliveira, Jose Luis
    HEALTHCARE, 2022, 10 (11)
  • [2] Analysis and semantic querying in large biomedical image datasets
    Kumar, Vijay S.
    Narayanan, Sivaramakrishnan
    Kurc, Tahsin
    Kong, Jun
    Gurcan, Metin N.
    Saltz, Joel H.
    COMPUTER, 2008, 41 (04) : 52 - +
  • [3] Semantic Data Querying Over NoSQL Databases with Apache Spark
    Hassan, Mahmudul
    Bansal, Srividya K.
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 364 - 371
  • [4] An integrated framework for enhancing the semantic transformation, editing and querying of relational databases
    Vavliakis, Konstantinos N.
    Symeonidis, Andreas L.
    Karagiannis, Georgios T.
    Mitkas, Pericles A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (04) : 3844 - 3856
  • [5] Robust service-based semantic querying to distributed heterogeneous databases
    Buil-Aranda, Carlos
    Corcho, Oscar
    Krause, Amy
    PROCEEDINGS OF THE 20TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, 2009, : 74 - +
  • [6] Astronomical catalogues - Simultaneous querying and matching
    Adorf, HM
    Lemson, G
    Voges, W
    Enke, H
    Steinmetz, M
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XIII, 2004, 314 : 281 - 284
  • [7] Integration and Querying of Genomic and Proteomic Semantic Annotations for Biomedical Knowledge Extraction
    Masseroli, Marco
    Canakoglu, Arif
    Ceri, Stefano
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (02) : 209 - 219
  • [8] From biomedical knowledge graph construction to semantic querying: a comprehensive approach
    Wang, Ling
    Hao, Haoyu
    Yan, Xue
    Zhou, Tie Hua
    Ryu, Keun Ho
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [9] Querying faceted databases
    Ross, KA
    Janevski, A
    SEMANTIC WEB AND DATABASES, 2005, 3372 : 199 - 218
  • [10] Querying graph databases
    Flesca, S
    Greco, S
    ADVANCES IN DATABSE TECHNOLOGY-EDBT 2000, PROCEEDINGS, 2000, 1777 : 510 - 524