Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine

被引:33
|
作者
Cohen, Aaron M. [1 ]
Smalheiser, Neil R. [2 ]
McDonagh, Marian S. [1 ]
Yu, Clement [3 ]
Adams, Clive E. [4 ]
Davis, John M. [2 ]
Yu, Philip S. [3 ]
机构
[1] Oregon Hlth & Sci Univ, Dept Med Informat & Clin Epidemiol, Portland, OR 97239 USA
[2] Univ Illinois, Dept Psychiat, Chicago, IL 60612 USA
[3] Univ Illinois, Dept Comp Sci, Chicago, IL 60612 USA
[4] Univ Nottingham, Div Psychiat, Nottingham NG7 2RD, England
基金
美国国家卫生研究院;
关键词
Support Vector Machines; Natural Language Processing; Randomized Controlled Trials as Topic; Evidence-Based Medicine; Systematic Reviews; Information Retrieval; SYSTEMATIC REVIEWS; RETRIEVAL; WORKLOAD; MEDLINE; UPDATE;
D O I
10.1093/jamia/ocu025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: For many literature review tasks, including systematic review (SR) and other aspects of evidence-based medicine, it is important to know whether an article describes a randomized controlled trial (RCT). Current manual annotation is not complete or flexible enough for the SR process. In this work, highly accurate machine learning predictive models were built that include confidence predictions of whether an article is an RCT. Materials and Methods: The LibSVM classifier was used with forward selection of potential feature sets on a large human-related subset of MEDLINE to create a classification model requiring only the citation, abstract, and MeSH terms for each article. Results: The model achieved an area under the receiver operating characteristic curve of 0.973 and mean squared error of 0.013 on the held out year 2011 data. Accurate confidence estimates were confirmed on a manually reviewed set of test articles. A second model not requiring MeSH terms was also created, and performs almost as well. Discussion: Both models accurately rank and predict article RCT confidence. Using the model and the manually reviewed samples, it is estimated that about 8000 (3%) additional RCTs can be identified in MEDLINE, and that 5% of articles tagged as RCTs in Medline may not be identified. Conclusion: Retagging human-related studies with a continuously valued RCT confidence is potentially more useful for article ranking and review than a simple yes/no prediction. The automated RCT tagging tool should offer significant savings of time and effort during the process of writing SRs, and is a key component of a multistep text mining pipeline that we are building to streamline SR workflow. In addition, the model may be useful for identifying errors in MEDLINE publication types. The RCT confidence predictions described here have been made available to users as a web service with a user query form front end at: http://arrowsmith.psych.uic.edu/cgi-bin/arrowsmith_uic/RCT_Tagger.cgi.
引用
收藏
页码:707 / 717
页数:11
相关论文
共 50 条
  • [1] Evidence-based medicine training during residency: a randomized controlled trial of efficacy
    Feldstein, David A.
    Maenner, Matthew J.
    Srisurichan, Rachaya
    Roach, Mary A.
    Vogelman, Bennett S.
    BMC MEDICAL EDUCATION, 2010, 10
  • [2] Teaching of evidence-based medicine to medical students in Mexico: a randomized controlled trial
    Sanchez-Mendiola, Melchor
    Kieffer-Escobar, Luis F.
    Marin-Beltran, Salvador
    Downing, Steven M.
    Schwartz, Alan
    BMC MEDICAL EDUCATION, 2012, 12
  • [3] Evidence-based medicine training during residency: a randomized controlled trial of efficacy
    David A Feldstein
    Matthew J Maenner
    Rachaya Srisurichan
    Mary A Roach
    Bennett S Vogelman
    BMC Medical Education, 10
  • [4] Teaching of evidence-based medicine to medical students in Mexico: a randomized controlled trial
    Melchor Sánchez-Mendiola
    Luis F Kieffer-Escobar
    Salvador Marín-Beltrán
    Steven M Downing
    Alan Schwartz
    BMC Medical Education, 12
  • [5] Randomized controlled trials, evidence-based medicine and India
    Jacob, K. S.
    NATIONAL MEDICAL JOURNAL OF INDIA, 2012, 25 (01): : 1 - 4
  • [6] A Series of Articles on Evidence-Based Medicine
    Schoenfeld, Philip S.
    Camilleri, Michael
    CLINICAL GASTROENTEROLOGY AND HEPATOLOGY, 2003, 1 (01) : 2 - 2
  • [7] Is "Evidence-Based Medicine" Followed by "Confidence-Based Medicine"?
    Porzsolt, Franz
    Fangerau, Heiner
    MEDIZINISCHE KLINIK, 2010, 105 (08) : 560 - 566
  • [8] Orthopaedia and evidence-based medicine: commentary on articles
    Canellas Trobat, A.
    MEDICINA BALEAR, 2007, 22 (02): : 5 - 6
  • [9] Evidence-based articles related to sports medicine
    Rompe, JD
    Decking, J
    Schoellner, C
    Theis, C
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 2006, 88A (02): : 467 - 468
  • [10] Evidence-based medicine on trial
    Hogan, RA
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2004, 291 (14): : 1696 - 1697