External features enriched model for biomedical question answering

被引:14
|
作者
Xu, Gezheng [1 ,2 ]
Rong, Wenge [1 ,3 ]
Wang, Yanmeng [4 ]
Ouyang, Yuanxin [1 ,3 ]
Xiong, Zhang [1 ,3 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[2] Beihang Univ, Sino French Engineer Sch, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[3] Beihang Univ, Sch Comp Sci & Engn, 37 Xueyuan Rd, Beijing 100191, Peoples R China
[4] Ping Technol, Xinyuannanlu 3, Beijing 100027, Peoples R China
基金
中国国家自然科学基金;
关键词
Biomedical question answering; Feature fusion; Pre-trained language model; POS; NER; NAMED ENTITY RECOGNITION;
D O I
10.1186/s12859-021-04176-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundBiomedical question answering (QA) is a sub-task of natural language processing in a specific domain, which aims to answer a question in the biomedical field based on one or more related passages and can provide people with accurate healthcare-related information. Recently, a lot of approaches based on the neural network and large scale pre-trained language model have largely improved its performance. However, considering the lexical characteristics of biomedical corpus and its small scale dataset, there is still much improvement room for biomedical QA tasks. ResultsInspired by the importance of syntactic and lexical features in the biomedical corpus, we proposed a new framework to extract external features, such as part-of-speech and named-entity recognition, and fused them with the original text representation encoded by pre-trained language model, to enhance the biomedical question answering performance. Our model achieves an overall improvement of all three metrics on BioASQ 6b, 7b, and 8b factoid question answering tasks.ConclusionsThe experiments on BioASQ question answering dataset demonstrated the effectiveness of our external feature-enriched framework. It is proven by the experiments conducted that external lexical and syntactic features can improve Pre-trained Language Model's performance in biomedical domain question answering task.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] External features enriched model for biomedical question answering
    Gezheng Xu
    Wenge Rong
    Yanmeng Wang
    Yuanxin Ouyang
    Zhang Xiong
    [J]. BMC Bioinformatics, 22
  • [2] Biomedical question answering: A survey
    Athenikos, Sofia J.
    Han, Hyoil
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2010, 99 (01) : 1 - 24
  • [3] Question Answering in the Biomedical Domain
    Nguyen, Vincent
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 54 - 63
  • [4] Word embeddings and external resources for answer processing in biomedical factoid question answering
    Dimitriadis, Dimitris
    Tsoumakas, Grigorios
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 92
  • [5] Pre-trained Language Model for Biomedical Question Answering
    Yoon, Wonjin
    Lee, Jinhyuk
    Kim, Donghyeon
    Jeong, Minbyul
    Kang, Jaewoo
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 727 - 740
  • [6] Improving Biomedical Question Answering by Data Augmentation and Model Weighting
    Du, Yongping
    Yan, Jingya
    Lu, Yuxuan
    Zhao, Yiliang
    Jin, Xingnan
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1114 - 1124
  • [7] Question Processing and Clustering in INDOC: A Biomedical Question Answering System
    Sondhi, Parikshit
    Raj, Purushottam
    Kumar, V. Vinod
    Mittal, Ankush
    [J]. EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2007, (01)
  • [8] Biomedical question answering using semantic relations
    Dimitar Hristovski
    Dejan Dinevski
    Andrej Kastrin
    Thomas C Rindflesch
    [J]. BMC Bioinformatics, 16
  • [9] Document Retrieval System for Biomedical Question Answering
    Bolat, Harun
    Sen, Baha
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [10] Data Augmentation for Biomedical Factoid Question Answering
    Pappas, Dimitris
    Malakasiotis, Prodromos
    Androutsopoulos, Ion
    [J]. PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 63 - 81