Automatic classification of experimental models in biomedical literature to support searching for alternative methods to animal experiments

被引:0
|
作者
Neves, Mariana [1 ]
Klippert, Antonina [1 ,2 ]
Knoespel, Fanny [1 ]
Rudeck, Juliane [1 ]
Stolz, Ailine [1 ]
Ban, Zsofia [1 ]
Becker, Markus [1 ]
Diederich, Kai [1 ]
Grune, Barbara [1 ]
Kahnau, Pia [1 ]
Ohnesorge, Nils [1 ]
Pucher, Johannes [1 ]
Schoenfelder, Gilbert [1 ,3 ]
Bert, Bettina [1 ]
Butzke, Daniel [1 ]
机构
[1] German Fed Inst Risk Assessment BfR, German Ctr Protect Lab Anim Bf3R, Berlin, Germany
[2] Nuvisan ICB GmbH, Mullerstr 178, D-13353 Berlin, Germany
[3] Charite Univ Med Berlin, Inst Clin Pharmacol & Toxicol, Charite Pl 1, D-10117 Berlin, Germany
关键词
Alternatives to animal experiments; Corpus annotation; Text classification; Replacement; RECOGNITION;
D O I
10.1186/s13326-023-00292-w
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Current animal protection laws require replacement of animal experiments with alternative methods, whenever such methods are suitable to reach the intended scientific objective. However, searching for alternative methods in the scientific literature is a time-consuming task that requires careful screening of an enormously large number of experimental biomedical publications. The identification of potentially relevant methods, e.g. organ or cell culture models, or computer simulations, can be supported with text mining tools specifically built for this purpose. Such tools are trained (or fine tuned) on relevant data sets labeled by human experts. We developed the GoldHamster corpus, composed of 1,600 PubMed (Medline) articles (titles and abstracts), in which we manually identified the used experimental model according to a set of eight labels, namely: "in vivo", "organs", "primary cells", "immortal cell lines", "invertebrates", "humans", "in silico" and "other" (models). We recruited 13 annotators with expertise in the biomedical domain and assigned each article to two individuals. Four additional rounds of annotation aimed at improving the quality of the annotations with disagreements in the first round. Furthermore, we conducted various machine learning experiments based on supervised learning to evaluate the corpus for our classification task. We obtained more than 7,000 document-level annotations for the above labels. After the first round of annotation, the inter-annotator agreement (kappa coefficient) varied among labels, and ranged from 0.42 (for "others") to 0.82 (for "invertebrates"), with an overall score of 0.62. All disagreements were resolved in the subsequent rounds of annotation. The best-performing machine learning experiment used the PubMedBERT pre-trained model with fine-tuning to our corpus, which gained an overall f-score of 0.83. We obtained a corpus with high agreement for all labels, and our evaluation demonstrated that our corpus is suitable for training reliable predictive models for automatic classification of biomedical literature according to the used experimental models. Our SMAFIRA - "Smart feature-based interactive" - search tool (https://smafira.bf3r.de) will employ this classifier for supporting the retrieval of alternative methods to animal experiments. The corpus is available for download (https://doi.org/10.5281/zenodo.7152295), as well as the source code (https://github.com/mariananeves/goldhamster) and the model (https://huggingface.co/SMAFIRA/goldhamster).
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Alternative methods to animal experiments. What can they afford in the safety testing of chemical substances under REACH?
    Lilienblum, W.
    BUNDESGESUNDHEITSBLATT-GESUNDHEITSFORSCHUNG-GESUNDHEITSSCHUTZ, 2008, 51 (12) : 1434 - 1443
  • [42] Experimental Skin-Wound Methods and Healing-Assessment in Animal Models: A Review
    Ozaydin, Isa
    Aydin, Ugur
    PAKISTAN VETERINARY JOURNAL, 2023, 43 (03) : 396 - 404
  • [43] ECG features and methods for automatic classification of ventricular premature and ischemic heartbeats: A comprehensive experimental study
    Lucie Maršánová
    Marina Ronzhina
    Radovan Smíšek
    Martin Vítek
    Andrea Němcová
    Lukas Smital
    Marie Nováková
    Scientific Reports, 7
  • [44] ECG features and methods for automatic classification of ventricular premature and ischemic heartbeats: A comprehensive experimental study
    Marsanova, Lucie
    Ronzhina, Marina
    Smisek, Radovan
    Vitek, Martin
    Nemcova, Andrea
    Smital, Lukas
    Novakova, Marie
    SCIENTIFIC REPORTS, 2017, 7
  • [45] Using classification models for the generation of disease-specific medications from biomedical literature and clinical data repository
    Wang, Liqin
    Haug, Peter J.
    Del Fiol, Guilherme
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 69 : 259 - 266
  • [46] VETERINARY PROTECTION AND INDUSTRY - ANIMAL-EXPERIMENTAL RESTRICTIONS - COMMON FOUNDATION FOR THE ADVANCEMENT OF ALTERNATIVE METHODS FOUNDED
    不详
    PRAKTISCHE TIERARZT, 1986, 67 (05): : 425 - 425
  • [47] Deep Learning to improve Experimental Sensitivity and Generative Models for Monte Carlo simulations for searching for New Physics in LHC experiments
    Salt, Jose
    Balanza, Raul
    Garcia, Azael
    Ander Gomez, Jon
    Gonzalez de la Hoz, Santiago
    Lozano, Julio
    Ruiz de Austri, Roberto
    Villaplana, Miguel
    26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS, CHEP 2023, 2024, 295
  • [48] Evaluating the osseointegration of nanostructured titanium implants in animal models: Current experimental methods and perspectives (Review)
    Babuska, Vaclav
    Moztarzadeh, Omid
    Kubikova, Tereza
    Moztarzadeh, Amin
    Hrusak, Daniel
    Tonar, Zbynek
    BIOINTERPHASES, 2016, 11 (03)
  • [49] Comparative Analysis of Experimental Methods to Quantify Animal Activity in Caenorhabditis elegans Models of Mitochondrial Disease
    Lavorato, Manuela
    Mathew, Neal D.
    Shah, Nina
    Nakamaru-Ogiso, Eiko
    Falk, Marni J.
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2021, (170):
  • [50] Methods for Automatic Image-Based Classification of Winged Insects Using Computational Techniques A Systematic Literature Review
    Rebelo, Allan Rodrigues
    Garcia Fagundes, Joao Marcos
    Digiampietri, Luciano Antonio
    Biscaro, Helton Hideraldo
    PROCEEDINGS OF 16TH BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS ON DIGITAL TRANSFORMATION AND INNOVATION, SBSI 2020, 2020,