Mining impactful discoveries from the biomedical literature

被引:0
|
作者
Moreau, Erwan [1 ,2 ]
Hardiman, Orla [3 ]
Heverin, Mark [3 ]
O'Sullivan, Declan [1 ,2 ]
机构
[1] Trinity Coll Dublin, Adapt Ctr, Dublin, Ireland
[2] Trinity Coll Dublin, Sch Comp Sci & Stat, Dublin, Ireland
[3] Trinity Coll Dublin, Sch Med, Dublin, Ireland
来源
BMC BIOINFORMATICS | 2024年 / 25卷 / 01期
关键词
Literature-based discovery; Evaluation; Benchmark dataset; Time-sliced method; KNOWLEDGE; MEDLINE; MODELS;
D O I
10.1186/s12859-024-05881-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundLiterature-based discovery (LBD) aims to help researchers to identify relations between concepts which are worthy of further investigation by text-mining the biomedical literature. While the LBD literature is rich and the field is considered mature, standard practice in the evaluation of LBD methods is methodologically poor and has not progressed on par with the domain. The lack of properly designed and decent-sized benchmark dataset hinders the progress of the field and its development into applications usable by biomedical experts.ResultsThis work presents a method for mining past discoveries from the biomedical literature. It leverages the impact made by a discovery, using descriptive statistics to detect surges in the prevalence of a relation across time. The validity of the method is tested against a baseline representing the state-of-the-art "time-sliced" method.ConclusionsThis method allows the collection of a large amount of time-stamped discoveries. These can be used for LBD evaluation, alleviating the long-standing issue of inadequate evaluation. It might also pave the way for more fine-grained LBD methods, which could exploit the diversity of these past discoveries to train supervised models. Finally the dataset (or some future version of it inspired by our method) could be used as a methodological tool for systematic reviews. We provide an online exploration tool in this perspective, available at https://brainmend.adaptcentre.ie/.
引用
收藏
页数:20
相关论文
共 50 条
  • [11] Mining Faces from Biomedical Literature using Deep Learning
    Dawson, Mitchell
    Zisserman, Andrew
    Nellaker, Christoffer
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 562 - 567
  • [12] Impactful COVID-19 discoveries from China are neglected in the media
    Yinxian Zhang
    Scientometrics, 2023, 128 (8) : 4523 - 4539
  • [13] Impactful COVID-19 discoveries from China are neglected in the media
    Zhang, Yinxian
    SCIENTOMETRICS, 2023, 128 (08) : 4523 - 4539
  • [14] Recent advances in biomedical literature mining
    Zhao, Sendong
    Su, Chang
    Lu, Zhiyong
    Wang, Fei
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [15] A statistical framework for biomedical literature mining
    Chung, Dongjun
    Lawson, Andrew
    Zheng, W. Jim
    STATISTICS IN MEDICINE, 2017, 36 (22) : 3461 - 3474
  • [16] Mining biomarker information in biomedical literature
    Erfan Younesi
    Luca Toldo
    Bernd Müller
    Christoph M Friedrich
    Natalia Novac
    Alexander Scheer
    Martin Hofmann-Apitius
    Juliane Fluck
    BMC Medical Informatics and Decision Making, 12
  • [17] Mining biomarker information in biomedical literature
    Younesi, Erfan
    Toldo, Luca
    Mueller, Bernd
    Friedrich, Christoph M.
    Novac, Natalia
    Scheer, Alexander
    Hofmann-Apitius, Martin
    Fluck, Juliane
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [18] Biomedical discoveries
    不详
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1998, 280 (15): : 1298 - 1298
  • [19] ON BIOMEDICAL DISCOVERIES
    MENDELSOHN, E
    SWAZEY, JP
    REISER, SJ
    SCIENCE, 1966, 153 (3741) : 1194 - +
  • [20] Mining protein interaction from biomedical literature with relation kernel method
    Eom, Jae-Hong
    Zhang, Byoung Tak
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 642 - 647