Development and validation of a machine-learning algorithm to predict the relevance of scientific articles within the field of teratology

被引:2
|
作者
Habets, Philippe C. [1 ]
van IJzendoorn, David G. P. [1 ]
Vinkers, Christiaan H. [1 ]
Harmark, Linda [2 ]
de Vries, Loes C. [2 ]
Otte, Willem M. [1 ,3 ]
机构
[1] DeepDoc Acad, Rotterdam, Netherlands
[2] Netherlands Pharmacovigilance Ctr Lareb, sHertogenbosch, Netherlands
[3] Univ Med Ctr UMC Utrecht, Dept Child Neurol, UMC Utrecht Brain Ctr, Utrecht, Netherlands
关键词
Pharmacovigilance; TIS; Deep learning; Literature screening;
D O I
10.1016/j.reprotox.2022.09.001
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Dutch Teratology Information Service Lareb counsels healthcare professionals and patients about medication use during pregnancy and lactation. To keep the evidence up to date, employees perform a standardized weekly PubMed query where relevant literature is identified manually. We aimed to develop an accurate machinelearning algorithm to predict the relevance of PubMed entries, thereby reducing the labor-intensive task of manually screening the articles. We fine-tuned a pre-trained natural language processing transformer model to identify relevant entries. We split 15,540 labeled entries into case-control-balanced train, validation, and test datasets. Additionally, we externally validated the model prospectively with 1288 labeled entries obtained from weekly queries after developing the model. This dataset was also independently labeled by a team of six experienced human raters to evaluate our model's performance. The validation of our machine learning model on the retrospectively collected outheld dataset obtained an area under the sensitivity-versus-specificity curve of 89.3 % (CI: 88.2- 90.4). In the prospective external validation of the model, our model classified relevant literature with a sensitivity versus specificity curve area of 87.4 % (CI: 85.0-89.8). Our model achieved a higher sensitivity than the human raters' team without sacrificing too much specificity. The team of human raters showed weak to moderate levels of agreement in their article classifications (kappa range 0.40-0.64). The human selection of the latest relevant literature is indispensable to keep the teratology information up to date. We show that automatic preselection of relevant abstracts using machine learning is possible without sacrificing the selection performance.
引用
收藏
页码:150 / 154
页数:5
相关论文
共 50 条
  • [1] Development and validation of a machine-learning algorithm to predict the relevance of scientific articles in teratology
    de Vriesa, Loes C.
    Habets, Philippe C.
    van IJzendoorn, David G. P.
    Vinkers, Christiaan H.
    Otte, Willem M.
    Harmark, Linda
    [J]. NEUROTOXICOLOGY AND TERATOLOGY, 2022, 92
  • [2] Development and Validation of a Machine-Learning Model to Predict Early Recurrence of Intrahepatic Cholangiocarcinoma
    Laura Alaimo
    Henrique A. Lima
    Zorays Moazzam
    Yutaka Endo
    Jason Yang
    Andrea Ruzzenente
    Alfredo Guglielmi
    Luca Aldrighetti
    Matthew Weiss
    Todd W. Bauer
    Sorin Alexandrescu
    George A. Poultsides
    Shishir K. Maithel
    Hugo P. Marques
    Guillaume Martel
    Carlo Pulitano
    Feng Shen
    François Cauchy
    Bas Groot Koerkamp
    Itaru Endo
    Minoru Kitago
    Timothy M. Pawlik
    [J]. Annals of Surgical Oncology, 2023, 30 : 5406 - 5415
  • [3] Development and validation of echocardiography-based machine-learning models to predict mortality
    Valsaraj, Akshay
    Kalmady, Sunil Vasu
    Sharma, Vaibhav
    Frost, Matthew
    Sun, Weijie
    Sepehrvand, Nariman
    Ong, Marcus
    Equilbec, Cyril
    Dyck, Jason R. B.
    Anderson, Todd
    Becher, Harald
    Weeks, Sarah
    Tromp, Jasper
    Hung, Chung-Lieh
    Ezekowitz, Justin A.
    Kaul, Padma
    [J]. EBIOMEDICINE, 2023, 90
  • [4] Development and validation of an imageless machine-learning algorithm for the initial screening of prostate cancer
    Martelin, Nicolas
    De Witt, Brian
    Chen, Benjamin
    Eschwege, Pascal
    [J]. PROSTATE, 2024, 84 (09): : 842 - 849
  • [5] DEVELOPMENT AND VALIDATION OF A NOVEL MACHINE LEARNING ALGORITHM TO PREDICT SEPSIS READMISSIONS
    Wardi, Gabriel
    Shashikumar, Supreeth
    Allen, Thomas
    Nemati, Shamim
    [J]. CRITICAL CARE MEDICINE, 2021, 49 (01) : 620 - 620
  • [6] Artificial Intelligence Screening of Medical School Applications: Development and Validation of a Machine-Learning Algorithm
    Triola, Marc M.
    Reinstein, Ilan
    Marin, Marina
    Gillespie, Colleen
    Abramson, Steven
    Grossman, Robert I.
    Rivera Jr, Rafael
    [J]. ACADEMIC MEDICINE, 2023, 98 (09) : 1036 - 1043
  • [7] Development and Validation of a Machine-Learning Model to Predict POD24 Risk of Follicular Lymphoma
    Zha, Jie
    Chen, Qinwei
    Zhang, Wei
    Jing, Hongmei
    Ye, Jingjing
    Yu, Haifeng
    Yi, Shuhua
    Li, Caixia
    Zheng, Zhong
    Xu, Wei
    Li, Zhifeng
    Ping, Lingyan
    He, Xiaohua
    Zhang, Liling
    Xie, Ying
    Chen, Feili
    Sun, Xiuhua
    Su, Liping
    Zhang, Huilai
    Lin, Zhijuan
    Yang, Haiyan
    Zhao, Weili
    Qiu, Lugui
    Li, Zhiming
    Song, Yuqin
    Xu, Bing
    [J]. BLOOD, 2023, 142
  • [8] DEVELOPMENT AND VALIDATION OF A MACHINE-LEARNING MODEL TO PREDICT ADEQUATE BOWEL PREPARATION IN A SAMPLE OF US VETERANS
    Kurlander, Jacob E.
    Saini, Sameer D.
    Lipson, Rachel
    Menees, Stacy B.
    Sultan, Shahnaz
    Kokaly, Alex N.
    Waljee, Akbar K.
    [J]. GASTROENTEROLOGY, 2019, 156 (06) : S263 - S264
  • [9] DEVELOPMENT AND VALIDATION OF A MACHINE LEARNING ALGORITHM TO PREDICT BACTEREMIA AND FUNGEMIA IN HOSPITALIZED PATIENTS
    Bhavani, Sivasubramanium
    Lonjers, Zachary
    Carey, Kyle
    Afshar, Majid
    Gilbert, Emily
    Shah, Nirav
    Huang, Elbert
    Churpek, Matthew
    [J]. JOURNAL OF INVESTIGATIVE MEDICINE, 2020, 68 (05) : 1094 - 1095
  • [10] MySurgeryRisk: Development and Validation of a Machine-learning Risk Algorithm for Major Complications and Death After Surgery
    Bihorac, Azra
    Ozrazgat-Baslanti, Tezcan
    Ebadi, Ashkan
    Motaei, Amir
    Madkour, Mohcine
    Pardalos, Panagote M.
    Lipori, Gloria
    Hogan, William R.
    Efron, Philip A.
    Moore, Frederick
    Moldawer, Lyle L.
    Wang, Daisy Zhe
    Hobson, Charles E.
    Rashidi, Parisi
    Li, Xiaolin
    Momcilovic, Petar
    [J]. ANNALS OF SURGERY, 2019, 269 (04) : 652 - 662