A context-based ABC model for literature-based discovery

被引:9
|
作者
Kim, Yong Hwan [1 ]
Song, Min [2 ]
机构
[1] CheongJu Univ, Div Humanities, Cheongju, South Korea
[2] Yonsei Univ, Dept Lib & Informat Sci, Seoul, South Korea
来源
PLOS ONE | 2019年 / 14卷 / 04期
基金
新加坡国家研究基金会;
关键词
LATERAL-SCLEROSIS; MOUSE MODEL; TAU; PHOSPHORYLATION; MUTATIONS; DEMENTIA; GENES;
D O I
10.1371/journal.pone.0215313
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background In the literature-based discovery, considerable research has been done based on the ABC model developed by Swanson. ABC model hypothesizes that there is a meaningful relation between entity A extracted from document set 1 and entity C extracted from document set 2 through B entities that appear commonly in both document sets. The results of ABC model are relations among entity A, B, and C, which is referred as paths. A path allows for hypothesizing the relationship between entity A and entity C, or helps discover entity B as a new evidence for the relationship between entity A and entity C. The co-occurrence based approach of ABC model is a well-known approach to automatic hypothesis generation by creating various paths. However, the co-occurrence based ABC model has a limitation, in that biological context is not considered. It focuses only on matching of B entity which commonly appears in relation between two entities. Therefore, the paths extracted by the co-occurrence based ABC model tend to include a lot of irrelevant paths, meaning that expert verification is essential. Methods In order to overcome this limitation of the co-occurrence based ABC model, we propose a context-based approach to connecting one entity relation to another, modifying the ABC model using biological contexts. In this study, we defined four biological context elements: cell, drug, disease, and organism. Based on these biological context, we propose two extended ABC models: a context-based ABC model and a context-assignment-based ABC model. In order to measure the performance of the both proposed models, we examined the relevance of the B entities between the well-known relations "APOE-MAPT" as well as "FUS-TARDBP". Each relation means interaction between neurodegenerative disease associated with proteins. The interaction between APOE and MAPT is known to play a crucial role in Alzheimer's disease as APOE affects tau-mediated neurodegeneration. It has been shown that mutation in FUS and TARDBP are associated with amyotrophic lateral sclerosis(ALS), a motor neuron disease by leading to neuronal cell death. Using these two relations, we compared both of proposed models to co-occurrence based ABC model. Results The precision of B entities by co-occurrence based ABC model was 27.1% for "APOE-MAPT" and 22.1% for "FUS-TARDBP",respectively. In context-based ABC model, precision of extracted B entities was 71.4% for "APOE-MAPT", and 77.9% for "FUS-TARDBP". Context-assignment based ABC model achieved 89% and 97.5% precision for the two relations, respectively. Both proposed models achieved a higher precision than co-occurrence-based ABC model.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Literature-Based Discovery
    Ruch, Patrick
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (07): : 1506 - 1508
  • [2] Validating discovery in literature-based discovery
    Kostoff, Ronald N.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (04) : 448 - 450
  • [3] Literature-based discovery by an enhanced information retrieval model
    Seki, Kazuhiro
    Mostafa, Javed
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2007, 4755 : 185 - +
  • [4] Context-driven automatic subgraph creation for literature-based discovery
    Cameron, Delroy
    Kavuluru, Ramakanth
    Rindflesch, Thomas C.
    Sheth, Amit P.
    Thirunarayan, Krishnaprasad
    Bodenreider, Olivier
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 54 : 141 - 157
  • [5] Validating discovery in literature-based discovery - Response
    Pratt, Wanda
    Yetisgen-Yildiz, Meliha
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (04) : 450 - 452
  • [6] Literature-Based Discovery: Beyond the ABCs
    Smalheiser, Neil R.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (02): : 218 - 224
  • [7] Literature-based discovery by lexical statistics
    Lindsay, RK
    Gordon, MD
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1999, 50 (07): : 574 - 587
  • [8] Literature-based discovery by lexical statistics
    Mental Health Research Institute, University of Michigan Medical School, 205 Zina Pitcher Place, Ann Arbor, MI, United States
    [J]. J. Am. Soc. Inf. Sci, 7 (574-587):
  • [9] A compound correlation model for disjoint literature-based knowledge discovery
    Huang, Shuiqing
    He, Lin
    Yang, Bo
    Zhang, Ming
    [J]. ASLIB PROCEEDINGS, 2012, 64 (04): : 423 - 436
  • [10] Literature-based discovery: New trends and techniques
    Smalheiser, NR
    Palmer, CL
    Swanson, DR
    Srinivasan, P
    Hearst, M
    [J]. ASIST 2003: PROCEEDINGS OF THE 66TH ASIST ANNUAL MEETING, VOL 40, 2003: HUMANIZING INFORMATION TECHNOLOGY: FROM IDEAS TO BITS AND BACK, 2003, 40 : 497 - 497