Risk of bias assessment in preclinical literature using natural language processing

被引:11
|
作者
Wang, Qianying [1 ]
Liao, Jing [1 ]
Lapata, Mirella [2 ]
Macleod, Malcolm [1 ]
机构
[1] Univ Edinburgh, Ctr Clin Brain Sci, 49 Little France Crescent, Edinburgh EH16 4SB, Midlothian, Scotland
[2] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
关键词
automatic assessment; natural language processing; preclinical research synthesis; risk of bias;
D O I
10.1002/jrsm.1533
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We sought to apply natural language processing to the task of automatic risk of bias assessment in preclinical literature, which could speed the process of systematic review, provide information to guide research improvement activity, and support translation from preclinical to clinical research. We use 7840 full-text publications describing animal experiments with yes/no annotations for five risk of bias items. We implement a series of models including baselines (support vector machine, logistic regression, random forest), neural models (convolutional neural network, recurrent neural network with attention, hierarchical neural network) and models using BERT with two strategies (document chunk pooling and sentence extraction). We tune hyperparameters to obtain the highest F1 scores for each risk of bias item on the validation set and compare evaluation results on the test set to our previous regular expression approach. The F1 scores of best models on test set are 82.0% for random allocation, 81.6% for blinded assessment of outcome, 82.6% for conflict of interests, 91.4% for compliance with animal welfare regulations and 46.6% for reporting animals excluded from analysis. Our models significantly outperform regular expressions for four risk of bias items. For random allocation, blinded assessment of outcome, conflict of interests and animal exclusions, neural models achieve good performance; for animal welfare regulations, BERT model with a sentence extraction strategy works better. Convolutional neural networks are the overall best models. The tool is publicly available which may contribute to the future monitoring of risk of bias reporting for research improvement activities.
引用
收藏
页码:368 / 380
页数:13
相关论文
共 50 条
  • [21] The Gender Gap Tracker: Using Natural Language Processing to measure gender bias in media
    Asr, Fatemeh Torabi
    Mazraeh, Mohammad
    Lopes, Alexandre
    Gautam, Vasundhara
    Gonzales, Junette
    Rao, Prashanth
    Taboada, Maite
    PLOS ONE, 2021, 16 (01):
  • [22] Personality in just a few words: Assessment using natural language processing
    Sikstrom, Sverker
    Valaviciute, Ieva
    Kajonius, Petri
    PERSONALITY AND INDIVIDUAL DIFFERENCES, 2025, 238
  • [23] Development of a natural language processing pipeline for assessment of cardiovascular risk in myeloproliferative neoplasms
    Duminuco, Andrea
    Au Yeung, Joshua
    Vaghela, Raj
    Virdee, Sukhraj
    Woodley, Claire
    Asirvatham, Susan
    Curto-Garcia, Natalia
    Sriskandarajah, Priya
    O'Sullivan, Jennifer
    de Lavallade, Hugues
    Radia, Deepti
    Kordasti, Shahram
    Palumbo, Giuseppe
    Harrison, Claire
    Harrington, Patrick
    HEMASPHERE, 2024, 8 (08):
  • [24] Quantitative Topic Analysis of Materials Science Literature Using Natural Language Processing
    Choi, Jaewoong
    Lee, Byungju
    ACS APPLIED MATERIALS & INTERFACES, 2023, 16 (02) : 1957 - 1968
  • [25] Using Natural Language Processing to Predict Risk in Electronic Health Records
    Duy Van Le
    Montgomery, James
    Kirkby, Kenneth
    Scanlan, Joel
    MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 574 - 578
  • [26] Harmonization of Stroke Risk Prediction Variables Using Natural Language Processing
    Mallya, Pratheek
    Zhao, Juan
    Hong, Chuan
    Henao, Ricardo
    Wojdyla, Daniel
    Schibler, Tony
    Manchanda, Vihaan
    Pencina, Michael
    Hall, Jennifer L.
    STROKE, 2024, 55
  • [27] Identifying Patients with Hypoglycemia Using Natural Language Processing: A Systematic Literature Review
    Zheng, Yaguang
    Dickson, Victoria Vaughan
    Blecker, Saul
    Ng, Jason M.
    Rice, Brynne Campbell
    Shenkar, Liat
    Mortejo, Marie Claire R.
    Johnson, Stephen B.
    NURSING RESEARCH, 2022, 71 (03) : S33 - S33
  • [28] Phishing Email Detection Using Natural Language Processing Techniques: A Literature Survey
    Salloum, Said
    Gaber, Tarek
    Vadera, Sunil
    Shaalan, Khaled
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 19 - 28
  • [29] A Systematic Literature Review on Using Natural Language Processing in Software Requirements Engineering
    Necula, Sabina-Cristiana
    Dumitriu, Florin
    Greavu-Serban, Valerica
    ELECTRONICS, 2024, 13 (11)
  • [30] Analysing quality of textual requirements using Natural Language Processing: A Literature Review
    Kocerka, Jerzy
    Krzeslak, Micha
    Galuszka, Adam
    2018 23RD INTERNATIONAL CONFERENCE ON METHODS & MODELS IN AUTOMATION & ROBOTICS (MMAR), 2018, : 876 - 880