Document Retrieval System for Biomedical Question Answering

被引:0
|
作者
Bolat, Harun [1 ]
Sen, Baha [1 ]
机构
[1] Ankara Yildirim Beyazit Univ, Comp Engn Dept, TR-06010 Ankara, Turkiye
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 06期
关键词
information retrieval; document retrieval; biomedical question answering; search engine; natural language processing;
D O I
10.3390/app14062613
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application In the biomedical field, accessing data by classical methods is getting more difficult day by day, as it is in any other field, due to the data growth rate. Different methods are needed to access the desired data more quickly. In particular, more specific methods need to be developed for question answering systems. In this study, a model is proposed for the document retrieval and answer extraction modules which are a part of biomedical question answering systems.Abstract In this paper, we describe our biomedical document retrieval system and answers extraction module, which is part of the biomedical question answering system. Approximately 26.5 million PubMed articles are indexed as a corpus with the Apache Lucene text search engine. Our proposed system consists of three parts. The first part is the question analysis module, which analyzes the question and enriches it with biomedical concepts related to its wording. The second part of the system is the document retrieval module. In this step, the proposed system is tested using different information retrieval models, like the Vector Space Model, Okapi BM25, and Query Likelihood. The third part is the document re-ranking module, which is responsible for re-arranging the documents retrieved in the previous step. For this study, we tested our proposed system with 6B training questions from the BioASQ challenge task. We obtained the best MAP score on the document retrieval phase when we used Query Likelihood with the Dirichlet Smoothing model. We used the sequential dependence model at the re-rank phase, but this model produced a worse MAP score than the previous phase. In similarity calculation, we included the Named Entity Recognition (NER), UMLS Concept Unique Identifiers (CUI), and UMLS Semantic Types of the words in the question to find the sentences containing the answer. Using this approach, we observed a performance enhancement of roughly 25% for the top 20 outcomes, surpassing another method employed in this study, which relies solely on textual similarity.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Document Retrieval for Biomedical Question Answering with Neural Sentence Matching
    Noh, Jiho
    Kavuluru, Ramakanth
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 194 - 201
  • [2] Document Retrieval Based on Question Answering System
    Nguyen Tuan Dang
    Do Thi Thanh Tuyen
    [J]. ICIC 2009: SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTING SCIENCE, VOL 1, PROCEEDINGS: COMPUTING SCIENCE AND ITS APPLICATION, 2009, : 183 - +
  • [3] A Generic Document Retrieval Framework Based on UMLS Similarity for Biomedical Question Answering System
    Sarrouti, Mourad
    El Alaoui, Said Ouatik
    [J]. INTELLIGENT DECISION TECHNOLOGIES 2016, PT II, 2016, 57 : 207 - 216
  • [4] Document image retrieval in a question answering system for document images
    Kise, K
    Fukushima, S
    Matsumoto, K
    [J]. DOCUMENT ANALYSIS SYSTEMS VI, PROCEEDINGS, 2004, 3163 : 521 - 532
  • [5] Document retrieval in the context of question answering
    Monz, C
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 571 - 579
  • [6] Adaptive Document Retrieval for Deep Question Answering
    Kratzwald, Bernhard
    Feuerriegel, Stefan
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 576 - 581
  • [7] Query expansion for answer document retrieval in Chinese Question answering system
    Yu, ZT
    Zheng, ZY
    Tang, SP
    Guo, JY
    [J]. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 72 - 77
  • [8] A RETRIEVAL MODEL FOR QUESTION IN COMMUNITY QUESTION ANSWERING SYSTEM
    Sun, Yueping
    Wang, Xiaojie
    Liu, Song
    Yuan, Caixia
    Wang, Xuwen
    [J]. 2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1534 - 1539
  • [9] Preliminary Evaluation of Passage Retrieval in Biomedical Multilingual Question Answering
    Neves, Mariana
    Herbst, Konrad
    Uflacker, Matthias
    Plattner, Hasso
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [10] Question Answering for the Operation of Software Applications: A Document Retrieval Approach
    Fujii, Atsushi
    Takegata, Seiji
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (06): : 1369 - 1377