Document Retrieval System for Biomedical Question Answering

被引:0
|
作者
Bolat, Harun [1 ]
Sen, Baha [1 ]
机构
[1] Ankara Yildirim Beyazit Univ, Comp Engn Dept, TR-06010 Ankara, Turkiye
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 06期
关键词
information retrieval; document retrieval; biomedical question answering; search engine; natural language processing;
D O I
10.3390/app14062613
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application In the biomedical field, accessing data by classical methods is getting more difficult day by day, as it is in any other field, due to the data growth rate. Different methods are needed to access the desired data more quickly. In particular, more specific methods need to be developed for question answering systems. In this study, a model is proposed for the document retrieval and answer extraction modules which are a part of biomedical question answering systems.Abstract In this paper, we describe our biomedical document retrieval system and answers extraction module, which is part of the biomedical question answering system. Approximately 26.5 million PubMed articles are indexed as a corpus with the Apache Lucene text search engine. Our proposed system consists of three parts. The first part is the question analysis module, which analyzes the question and enriches it with biomedical concepts related to its wording. The second part of the system is the document retrieval module. In this step, the proposed system is tested using different information retrieval models, like the Vector Space Model, Okapi BM25, and Query Likelihood. The third part is the document re-ranking module, which is responsible for re-arranging the documents retrieved in the previous step. For this study, we tested our proposed system with 6B training questions from the BioASQ challenge task. We obtained the best MAP score on the document retrieval phase when we used Query Likelihood with the Dirichlet Smoothing model. We used the sequential dependence model at the re-rank phase, but this model produced a worse MAP score than the previous phase. In similarity calculation, we included the Named Entity Recognition (NER), UMLS Concept Unique Identifiers (CUI), and UMLS Semantic Types of the words in the question to find the sentences containing the answer. Using this approach, we observed a performance enhancement of roughly 25% for the top 20 outcomes, surpassing another method employed in this study, which relies solely on textual similarity.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Structured retrieval for question answering
    Bilotti, Matthew W.
    Ogilvie, Paul
    Callan, Jamie
    Nyberg, Eric
    [J]. Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'07, 2007, : 351 - 358
  • [22] A passage retrieval method based on probabilistic information retrieval model and UMLS concepts in biomedical question answering
    Sarrouti, Mourad
    Ouatik El Alaoui, Said
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 68 : 96 - 103
  • [23] BioAMA: Towards an End to End BioMedical Question Answering System
    Sharma, Vasu
    Kulkarni, Nitish
    Potharaju, Srividya Pranavi
    Bayomi, Gabriel
    Nyberg, Eric
    Mitamura, Teruko
    [J]. SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), 2018, : 109 - 117
  • [24] A Biomedical Question Answering System Based on SNOMED-CT
    Zhu, Xinhua
    Yang, Xuechen
    Chen, Hongchao
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 16 - 28
  • [25] Interactive Document Expansion for Answer Extraction of Question Answering System
    Fukumoto, Junichi
    Aburai, Noriaki
    Yamanishi, Ryosuke
    [J]. 17TH INTERNATIONAL CONFERENCE IN KNOWLEDGE BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS - KES2013, 2013, 22 : 991 - 1000
  • [26] Information retrieval system using UNL for Multilingual Question Answering
    Goel, Kanu
    Bhatia, Parteek
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 888 - 892
  • [27] KnowReQA: A Knowledge-aware Retrieval Question Answering System
    Wang, Chuanrui
    Bai, Jun
    Zhang, Xiaofeng
    Yan, Cen
    Ouyang, Yuanxin
    Rong, Wenge
    Xiong, Zhang
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2022, 13368 : 709 - 721
  • [28] Using a passage retrieval system to support question answering process
    Llopis, F
    Vicedo, JL
    Ferrández, A
    [J]. COMPUTATIONAL SCIENCE-ICCS 2002, PT I, PROCEEDINGS, 2002, 2329 : 61 - 69
  • [29] Flexible Classification, Question-Answering and Retrieval with Siamese Neural Networks for Biomedical Texts
    Menad, Safaa
    Abdeddaim, Said
    Soualmia, Lina F.
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS, FQAS 2023, 2023, 14113 : 27 - 38
  • [30] An Efficient Document Retrieval for Korean Open-Domain Question Answering Based on ColBERT
    Kang, Byungha
    Kim, Yeonghwa
    Shin, Youhyun
    Mourtzis, Dimitris
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (24):