Exploring relation types for literature-based discovery

被引:21
|
作者
Preiss, Judita [1 ]
Stevenson, Mark [1 ]
Gaizauskas, Robert [1 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
literature based discovery; text mining; knowledge discovery; natural language processing; BIOMEDICAL LITERATURE; MEDICAL LITERATURES; FISH OIL; CONNECTIONS; MAGNESIUM; MIGRAINE; IMPLICIT; RAYNAUDS;
D O I
10.1093/jamia/ocv002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Literature-based discovery (LBD) aims to identify "hidden knowledge" in the medical literature by: (1) analyzing documents to identify pairs of explicitly related concepts (terms), then (2) hypothesizing novel relations between pairs of unrelated concepts that are implicitly related via a shared concept to which both are explicitly related. Many LBD approaches use simple techniques to identify semantically weak relations between concepts, for example, document co-occurrence. These generate huge numbers of hypotheses, difficult for humans to assess. More complex techniques rely on linguistic analysis, for example, shallow parsing, to identify semantically stronger relations. Such approaches generate fewer hypotheses, but may miss hidden knowledge. The authors investigate this trade-off in detail, comparing techniques for identifying related concepts to discover which are most suitable for LBD. Materials and methods A generic LBD system that can utilize a range of relation types was developed. Experiments were carried out comparing a number of techniques for identifying relations. Two approaches were used for evaluation: replication of existing discoveries and the "time slicing" approach.(1) Results Previous LBD discoveries could be replicated using relations based either on document co-occurrence or linguistic analysis. Using relations based on linguistic analysis generated many fewer hypotheses, but a significantly greater proportion of them were candidates for hidden knowledge. Discussion and Conclusion The use of linguistic analysis-based relations improves accuracy of LBD without overly damaging coverage. LBD systems often generate huge numbers of hypotheses, which are infeasible to manually review. Improving their accuracy has the potential to make these systems significantly more usable.
引用
收藏
页码:987 / 992
页数:6
相关论文
共 50 条
  • [1] Literature-Based Discovery
    Ruch, Patrick
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (07): : 1506 - 1508
  • [2] A Directional Recognition Algorithm of Semantic Relation for Literature-Based Discovery
    Liu, Xiaoyong
    Fu, Hui
    Jiang, Chaoyong
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2016, PT II, 2016, 9713 : 281 - 288
  • [3] Validating discovery in literature-based discovery
    Kostoff, Ronald N.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (04) : 448 - 450
  • [4] Exploring a deep learning neural architecture for closed Literature-based discovery
    Cuffy, Clint
    McInnes, Bridget T.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 143
  • [5] Exploring Self-Supervised Graph Learning in Literature-Based Discovery
    Ding, Juncheng
    Jin, Wei
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 53 - 62
  • [6] Validating discovery in literature-based discovery - Response
    Pratt, Wanda
    Yetisgen-Yildiz, Meliha
    JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (04) : 450 - 452
  • [7] Literature-based discovery by lexical statistics
    Lindsay, RK
    Gordon, MD
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1999, 50 (07): : 574 - 587
  • [8] Literature-Based Discovery: Beyond the ABCs
    Smalheiser, Neil R.
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (02): : 218 - 224
  • [9] Literature-based discovery by lexical statistics
    Mental Health Research Institute, University of Michigan Medical School, 205 Zina Pitcher Place, Ann Arbor, MI, United States
    J. Am. Soc. Inf. Sci., 7 (574-587):
  • [10] Literature-based discovery: New trends and techniques
    Smalheiser, NR
    Palmer, CL
    Swanson, DR
    Srinivasan, P
    Hearst, M
    ASIST 2003: PROCEEDINGS OF THE 66TH ASIST ANNUAL MEETING, VOL 40, 2003: HUMANIZING INFORMATION TECHNOLOGY: FROM IDEAS TO BITS AND BACK, 2003, 40 : 497 - 497