Risk of bias assessment in preclinical literature using natural language processing

被引:11
|
作者
Wang, Qianying [1 ]
Liao, Jing [1 ]
Lapata, Mirella [2 ]
Macleod, Malcolm [1 ]
机构
[1] Univ Edinburgh, Ctr Clin Brain Sci, 49 Little France Crescent, Edinburgh EH16 4SB, Midlothian, Scotland
[2] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
关键词
automatic assessment; natural language processing; preclinical research synthesis; risk of bias;
D O I
10.1002/jrsm.1533
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We sought to apply natural language processing to the task of automatic risk of bias assessment in preclinical literature, which could speed the process of systematic review, provide information to guide research improvement activity, and support translation from preclinical to clinical research. We use 7840 full-text publications describing animal experiments with yes/no annotations for five risk of bias items. We implement a series of models including baselines (support vector machine, logistic regression, random forest), neural models (convolutional neural network, recurrent neural network with attention, hierarchical neural network) and models using BERT with two strategies (document chunk pooling and sentence extraction). We tune hyperparameters to obtain the highest F1 scores for each risk of bias item on the validation set and compare evaluation results on the test set to our previous regular expression approach. The F1 scores of best models on test set are 82.0% for random allocation, 81.6% for blinded assessment of outcome, 82.6% for conflict of interests, 91.4% for compliance with animal welfare regulations and 46.6% for reporting animals excluded from analysis. Our models significantly outperform regular expressions for four risk of bias items. For random allocation, blinded assessment of outcome, conflict of interests and animal exclusions, neural models achieve good performance; for animal welfare regulations, BERT model with a sentence extraction strategy works better. Convolutional neural networks are the overall best models. The tool is publicly available which may contribute to the future monitoring of risk of bias reporting for research improvement activities.
引用
收藏
页码:368 / 380
页数:13
相关论文
共 50 条
  • [41] A Systematic Literature Review on Natural Language Processing (NLP)
    Castanha, Jick
    Indrawati
    Pillai, Subhash K. B.
    Ramantoko, Gadang
    Widarmanti, Tri
    2022 INTERNATIONAL CONFERENCE ON ADVANCED CREATIVE NETWORKS AND INTELLIGENT SYSTEMS, ICACNIS, 2022, : 130 - 135
  • [42] Natural Language Processing in Advertising - A Systematic Literature Review
    Truong, Vinh
    Proceedings - 2022 5th Asia Conference on Machine Learning and Computing, ACMLC 2022, 2022, : 89 - 98
  • [43] Natural Language Processing Challenges and Issues: A Literature Review
    Abro, Abdul Ahad
    Talpur, Mir Sajjad Hussain
    Jumani, Awais Khan
    GAZI UNIVERSITY JOURNAL OF SCIENCE, 2023, 36 (04): : 1522 - 1536
  • [44] Natural Language Query Processing Framework for Biomedical Literature
    De Maio, Carmen
    Fenza, Giuseppe
    Loia, Vincenzo
    Parente, Mimmo
    PROCEEDINGS OF THE 2015 CONFERENCE OF THE INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY, 2015, 89 : 1628 - 1635
  • [45] Automated Assessment of the Quality of Peer Reviews using Natural Language Processing Techniques
    Ramachandran L.
    Gehringer E.F.
    Yadav R.K.
    International Journal of Artificial Intelligence in Education, 2017, 27 (3) : 534 - 581
  • [46] Reducing bias in language assessment: Processing-dependent measures
    Campbell, T
    Dollaghan, C
    Needleman, H
    Janosky, J
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1997, 40 (03): : 519 - 525
  • [47] A Systematic Literature Review on Phishing Email Detection Using Natural Language Processing Techniques
    Salloum, Said
    Gaber, Tarek
    Vadera, Sunil
    Shaalan, Khaled
    IEEE ACCESS, 2022, 10 : 65703 - 65727
  • [48] Classifying literature mentions of biological pathogens as experimentally studied using natural language processing
    Jimeno Yepes, Antonio Jose
    Verspoor, Karin
    JOURNAL OF BIOMEDICAL SEMANTICS, 2023, 14 (01)
  • [49] Classifying literature mentions of biological pathogens as experimentally studied using natural language processing
    Antonio Jose Jimeno Yepes
    Karin Verspoor
    Journal of Biomedical Semantics, 14
  • [50] Financial Risk Prediction and Management using Machine Learning and Natural Language Processing
    Li, Tianyu
    Dai, Xiangyu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (06) : 211 - 219