Risk of bias assessment in preclinical literature using natural language processing

被引:11
|
作者
Wang, Qianying [1 ]
Liao, Jing [1 ]
Lapata, Mirella [2 ]
Macleod, Malcolm [1 ]
机构
[1] Univ Edinburgh, Ctr Clin Brain Sci, 49 Little France Crescent, Edinburgh EH16 4SB, Midlothian, Scotland
[2] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
关键词
automatic assessment; natural language processing; preclinical research synthesis; risk of bias;
D O I
10.1002/jrsm.1533
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We sought to apply natural language processing to the task of automatic risk of bias assessment in preclinical literature, which could speed the process of systematic review, provide information to guide research improvement activity, and support translation from preclinical to clinical research. We use 7840 full-text publications describing animal experiments with yes/no annotations for five risk of bias items. We implement a series of models including baselines (support vector machine, logistic regression, random forest), neural models (convolutional neural network, recurrent neural network with attention, hierarchical neural network) and models using BERT with two strategies (document chunk pooling and sentence extraction). We tune hyperparameters to obtain the highest F1 scores for each risk of bias item on the validation set and compare evaluation results on the test set to our previous regular expression approach. The F1 scores of best models on test set are 82.0% for random allocation, 81.6% for blinded assessment of outcome, 82.6% for conflict of interests, 91.4% for compliance with animal welfare regulations and 46.6% for reporting animals excluded from analysis. Our models significantly outperform regular expressions for four risk of bias items. For random allocation, blinded assessment of outcome, conflict of interests and animal exclusions, neural models achieve good performance; for animal welfare regulations, BERT model with a sentence extraction strategy works better. Convolutional neural networks are the overall best models. The tool is publicly available which may contribute to the future monitoring of risk of bias reporting for research improvement activities.
引用
收藏
页码:368 / 380
页数:13
相关论文
共 50 条
  • [1] Mitigating Gender Bias in Natural Language Processing: Literature Review
    Sun, Tony
    Gaut, Andrew
    Tang, Shirlyn
    Huang, Yuxin
    ElSherief, Mai
    Zhao, Jieyu
    Mirza, Diba
    Belding, Elizabeth
    Chang, Kai-Wei
    Wang, William Yang
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1630 - 1640
  • [2] Textual data transformations using natural language processing for risk assessment
    Kamil, Mohammad Zaid
    Taleb-Berrouane, Mohammed
    Khan, Faisal
    Amyotte, Paul
    Ahmed, Salim
    RISK ANALYSIS, 2023, 43 (10) : 2033 - 2052
  • [3] Development of a Fall Risk Assessment Dashboard using Natural Language Processing
    Burningham, Z.
    Leng, J.
    Bell, J.
    Callaway-Lane, C.
    Wingard, S.
    Johnson, M.
    Ganz, D. A.
    Douglas, J.
    Kramer, J.
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2024, 72 : S180 - S180
  • [4] Using Clinical Notes and Natural Language Processing for Automated HIV Risk Assessment
    Feller, Daniel J.
    Zucker, Jason
    Yin, Michael T.
    Gordon, Peter
    Elhadad, Noemie
    JAIDS-JOURNAL OF ACQUIRED IMMUNE DEFICIENCY SYNDROMES, 2018, 77 (02) : 160 - 166
  • [5] Toward Bias Analysis Using Tweets and Natural Language Processing
    Tankard, Earl, Jr.
    Flowers, Christopher
    Li, Jiang
    Rawat, Danda B.
    2021 IEEE 18TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2021,
  • [6] Five sources of bias in natural language processing
    Hovy, Dirk
    Prabhumoye, Shrimai
    LANGUAGE AND LINGUISTICS COMPASS, 2021, 15 (08):
  • [7] Screening for Depression Using Natural Language Processing:Literature Review
    Teferra, Bazen Gashaw
    Rueda, Alice
    Pang, Hilary
    Valenzano, Richard
    Samavi, Reza
    Krishnan, Sridhar
    Bhat, Venkat
    INTERACTIVE JOURNAL OF MEDICAL RESEARCH, 2024, 13
  • [8] Retrieving reproductive biology literature using natural language processing
    Farhan, R
    Aplin, JD
    Attwood, TK
    Wood, MM
    Sibley, CP
    PLACENTA, 2005, 26 (8-9) : A22 - A22
  • [9] Automatic Assessment of Mathematical Creativity using Natural Language Processing
    Marrone, Rebecca
    Cropley, David H.
    Wang, Z.
    CREATIVITY RESEARCH JOURNAL, 2023, 35 (04) : 661 - 676
  • [10] Natural Language Processing Risk Assessment Application Developed for Marble Quarries
    Eker, Hasan
    APPLIED SCIENCES-BASEL, 2024, 14 (19):