Mining impactful discoveries from the biomedical literature

被引:0
|
作者
Moreau, Erwan [1 ,2 ]
Hardiman, Orla [3 ]
Heverin, Mark [3 ]
O'Sullivan, Declan [1 ,2 ]
机构
[1] Trinity Coll Dublin, Adapt Ctr, Dublin, Ireland
[2] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland
[3] Trinity Coll Dublin, Sch Med, Dublin, Ireland
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
Literature-based discovery; Evaluation; Benchmark dataset; Time-sliced method; KNOWLEDGE; MEDLINE; MODELS;
D O I
10.1186/s12859-024-05881-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundLiterature-based discovery (LBD) aims to help researchers to identify relations between concepts which are worthy of further investigation by text-mining the biomedical literature. While the LBD literature is rich and the field is considered mature, standard practice in the evaluation of LBD methods is methodologically poor and has not progressed on par with the domain. The lack of properly designed and decent-sized benchmark dataset hinders the progress of the field and its development into applications usable by biomedical experts.ResultsThis work presents a method for mining past discoveries from the biomedical literature. It leverages the impact made by a discovery, using descriptive statistics to detect surges in the prevalence of a relation across time. The validity of the method is tested against a baseline representing the state-of-the-art "time-sliced" method.ConclusionsThis method allows the collection of a large amount of time-stamped discoveries. These can be used for LBD evaluation, alleviating the long-standing issue of inadequate evaluation. It might also pave the way for more fine-grained LBD methods, which could exploit the diversity of these past discoveries to train supervised models. Finally the dataset (or some future version of it inspired by our method) could be used as a methodological tool for systematic reviews. We provide an online exploration tool in this perspective, available at https://brainmend.adaptcentre.ie/.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Prototypical case mining from biomedical literature for bootstrapping a case base
    Isabelle Bichindaritz
    Applied Intelligence, 2008, 28 : 222 - 237
  • [22] An Unsupervised Text Mining Method for Relation Extraction from Biomedical Literature
    Quan, Changqin
    Wang, Meng
    Ren, Fuji
    PLOS ONE, 2014, 9 (07):
  • [23] Prototypical case mining from biomedical literature for bootstrapping a case base
    Bichindaritz, Isabelle
    APPLIED INTELLIGENCE, 2008, 28 (03) : 222 - 237
  • [24] Biomedical Text Mining for Concept Identification from Traditional Medicine Literature
    Javed, Zeeshan
    Afzal, Hammad
    2014 INTERNATIONAL CONFERENCE ON OPEN SOURCE SYSTEMS AND TECHNOLOGIES (ICOSST), 2014, : 206 - 211
  • [25] Mining protein interactions from biomedical literature using semantic similarity
    Schmitt, Charles
    Cox, Steven
    Christopherson, Laura
    Scott, Erick
    Firrincieli, Stephen
    Baker, Nancy
    Tutubalina, Elena
    Tropsha, Alexander
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [26] Web-Based Biomedical Literature Mining
    安建福
    薛惠平
    陈瑛
    吴建国
    章鲁
    Journal of Shanghai Jiaotong University(Science), 2012, 17 (04) : 494 - 499
  • [27] Mining the biomedical literature in the genomic era: An overview
    Shatkay, H
    Feldman, R
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) : 821 - 855
  • [28] Terminology-driven mining of biomedical literature
    Nenadic, G
    Spasic, I
    Ananiadou, S
    BIOINFORMATICS, 2003, 19 (08) : 938 - 943
  • [29] Web-based biomedical literature mining
    Jian-Fu, An
    Hui-Ping, Xue
    Ying, Chen
    Jian-Guo, Wu
    Lu, Zhang
    Journal of Shanghai Jiaotong University (Science), 2012, 17 (04) : 494 - 499
  • [30] Mining generalized association rules on biomedical literature
    Berardi, M
    Lapi, M
    Leo, P
    Loglisci, C
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2005, 3533 : 500 - 509