relSCAN - A system for extracting chemical-induced disease relation from biomedical literature

被引:8
|
作者
Onye, Stanley Chika [1 ]
Akkeles, Arif [2 ]
Dimililer, Nazife [3 ]
机构
[1] Eastern Mediterranean Univ, Fac Arts & Sci, Dept Appl Math & Comp Sci, Via Mersin 10, Famagusta, North Cyprus, Turkey
[2] Eastern Mediterranean Univ, Fac Arts & Sci, Dept Math, Via Mersin 10, Famagusta, North Cyprus, Turkey
[3] Eastern Mediterranean Univ, Sch Comp & Technol, Dept Informat Technol, Via Mersin 10, TR-99628 Famagusta, North Cyprus, Turkey
关键词
Chemical disease relation; Chemical-induced diseases; Relation extraction; Classifier ensemble; SVM; J48 decision tree;
D O I
10.1016/j.jbi.2018.09.018
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes an effective and robust approach for Chemical-Induced Disease (CID) relation extraction from PubMed articles. The study was performed on the Chemical Disease Relation (CDR) task of BioCreative V track-3 corpus. The proposed system, named re1SCAN, is an efficient CID relation extraction system with two phases to classify relation instances from the Co-occurrence and Non-Co-occurrence mention levels. We describe the case of chemical and disease mentions that occur in the same sentence as 'Co-occurrence', or as 'Non-Co-occurrence' otherwise. In the first phase, the relation instances are constructed on both mention levels. In the second phase, we employ a hybrid feature set to classify the relation instances at both of these mention levels using the combination of two Machine Learning (ML) classifiers (Support Vector Machine (SVM) and J48 Decision tree). This system is entirely corpus dependent and does not rely on information from external resources in order to boost its performance. We achieved good results, which are comparable with the other state-of-the-art CID relation extraction systems on the BioCreative V corpus. Furthermore, our system achieves the best performance on the Non-Co-occurrence mention level.
引用
下载
收藏
页码:79 / 87
页数:9
相关论文
共 50 条
  • [1] CIDExtractor: a chemical-induced disease relation extraction system for biomedical literature
    Li, Zhiheng
    Yang, Zhihao
    Lin, Hongfei
    Wang, Jian
    Gui, Yingyi
    Zhang, Yin
    Wang, Lei
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 994 - 1001
  • [2] CD-REST: a system for extracting chemical-induced disease relation in literature
    Xu, Jun
    Wu, Yonghui
    Zhang, Yaoyun
    Wang, Jingqi
    Lee, Hee-Jin
    Xu, Hua
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [3] An effective neural model extracting document level chemical-induced disease relations from biomedical literature
    Zheng, Wei
    Lin, Hongfei
    Li, Zhiheng
    Liu, Xiaoxia
    Li, Zhengguang
    Xu, Bo
    Zhang, Yijia
    Yang, Zhihao
    Wang, Jian
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 83 : 1 - 9
  • [4] A crowdsourcing workflow for extracting chemical-induced disease relations from free text
    Li, Tong Shu
    Bravo, Alex
    Furlong, Laura I.
    Good, Benjamin M.
    Su, Andrew I.
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016, : 1 - 11
  • [5] Chemical-induced disease relation extraction with various linguistic features
    Gu, Jinghang
    Qian, Longhua
    Zhou, Guodong
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [6] Extracting chemical-induced disease relation by integrating a hierarchical concentrative attention and a hybrid graph-based neural network
    Lu, Hongbin
    Li, Lishuang
    Li, Zuocheng
    Zhao, Shiyi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 121
  • [7] Chemical-induced disease relation extraction with dependency information and prior knowledge
    Zhou, Huiwei
    Ning, Shixian
    Yang, Yunlong
    Liu, Zhuang
    Lang, Chengkun
    Lin, Yingyu
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 84 : 171 - 178
  • [8] Chemical-induced disease relation extraction via convolutional neural network
    Gu, Jinghang
    Sun, Fuqing
    Qian, Longhua
    Zhou, Guodong
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2017,
  • [9] Chemical disease relation extraction through the combination of multiple mention levels: RelSCAN
    Onye, Stanley Chika
    Dimililer, Nazife
    Akkeles, Arif
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (06) : 2237 - 2253
  • [10] Extracting Drug-Protein Relation from Literature Using Ensembles of Biomedical Transformers
    Das, Avisha
    Li, Zhao
    Wei, Qiang
    Li, Jianfu
    Huang, Liang-chin
    Hu, Yan
    Li, Rongbin
    Zheng, Wenjin Jim
    Xu, Hua
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 639 - 643