Mining impactful discoveries from the biomedical literature

被引:0
|
作者
Moreau, Erwan [1 ,2 ]
Hardiman, Orla [3 ]
Heverin, Mark [3 ]
O'Sullivan, Declan [1 ,2 ]
机构
[1] Trinity Coll Dublin, Adapt Ctr, Dublin, Ireland
[2] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland
[3] Trinity Coll Dublin, Sch Med, Dublin, Ireland
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
Literature-based discovery; Evaluation; Benchmark dataset; Time-sliced method; KNOWLEDGE; MEDLINE; MODELS;
D O I
10.1186/s12859-024-05881-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundLiterature-based discovery (LBD) aims to help researchers to identify relations between concepts which are worthy of further investigation by text-mining the biomedical literature. While the LBD literature is rich and the field is considered mature, standard practice in the evaluation of LBD methods is methodologically poor and has not progressed on par with the domain. The lack of properly designed and decent-sized benchmark dataset hinders the progress of the field and its development into applications usable by biomedical experts.ResultsThis work presents a method for mining past discoveries from the biomedical literature. It leverages the impact made by a discovery, using descriptive statistics to detect surges in the prevalence of a relation across time. The validity of the method is tested against a baseline representing the state-of-the-art "time-sliced" method.ConclusionsThis method allows the collection of a large amount of time-stamped discoveries. These can be used for LBD evaluation, alleviating the long-standing issue of inadequate evaluation. It might also pave the way for more fine-grained LBD methods, which could exploit the diversity of these past discoveries to train supervised models. Finally the dataset (or some future version of it inspired by our method) could be used as a methodological tool for systematic reviews. We provide an online exploration tool in this perspective, available at https://brainmend.adaptcentre.ie/.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] TermInformer: unsupervised term mining and analysis in biomedical literature
    Tiwari, Prayag
    Uprety, Sagar
    Dehdashti, Shahram
    Hossain, M. Shamim
    NEURAL COMPUTING & APPLICATIONS, 2020,
  • [42] Mining Biomedical Literature: An Open Source and Modular Approach
    Almeida, Hayda
    Jean-Louis, Ludovic
    Meurs, Marie-Jean
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2016, 2016, 9673 : 168 - 179
  • [43] Use of figures in literature mining for biomedical digital libraries
    Chen, Nawei
    Shatkay, Hagit
    Blostein, Dorothea
    SECOND INTERNATIONAL CONFERENCE ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2006, : 180 - +
  • [44] Text Mining for Discovering Implicit Relationships in Biomedical Literature
    Petric, Ingrid
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2010, 34 (02): : 261 - 262
  • [45] MBA: a literature mining system for extracting biomedical abbreviations
    Yun Xu
    ZhiHao Wang
    YiMing Lei
    YuZhong Zhao
    Yu Xue
    BMC Bioinformatics, 10
  • [46] Getting to the (c)ore of knowledge: mining biomedical literature
    de Bruijn, B
    Martin, J
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2002, 67 (1-3) : 7 - 18
  • [47] MBA: a literature mining system for extracting biomedical abbreviations
    Xu, Yun
    Wang, ZhiHao
    Lei, YiMing
    Zhao, YuZhong
    Xue, Yu
    BMC BIOINFORMATICS, 2009, 10
  • [48] Large-Scale Text Mining of Biomedical Literature
    Ginter, Filip
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2013, (116): : 43 - 44
  • [49] Dependency-Based Relation Mining for Biomedical Literature
    Rinaldi, Fabio
    Schneider, Gerold
    Kaljurand, Kaarel
    Hess, Michael
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2860 - 2864
  • [50] Mining and modeling linkage information from citation context for improving biomedical literature retrieval
    Yin, Xiaoshi
    Huang, Jimmy Xiangji
    Li, Zhoujun
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (01) : 53 - 67