Text similarity: an alternative way to search MEDLINE

被引:60
|
作者
Lewis, James [1 ]
Ossowski, Stephan [1 ]
Hicks, Justin [1 ]
Errami, Mounir [1 ]
Garner, Harold R. [1 ]
机构
[1] Univ Texas, SW Med Ctr, Eugene McDermott Ctr Human Growth & Dev, Div Translat Res, Dallas, TX 75390 USA
关键词
D O I
10.1093/bioinformatics/btl388
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The most widely used literature search techniques, such as those offered by NCBI's PubMed system, require significant effort on the part of the searcher, and inexperienced searchers do not use these systems as effectively as experienced users. Improved literature search engines can save researchers time and effort by making it easier to locate the most important and relevant literature. Results: We have created and optimized a new, hybrid search system for Medline that takes natural text as input and then delivers results with high precision and recall. The combination of a fast, low-sensitivity weighted keyword-based first pass algorithm to cast a wide net to gather an initial set of literature, followed by a unique sentence-alignment based similarity algorithm to rank order those results was developed that is sensitive, fast and easy to use. Several text similarity search algorithms, both standard and novel, were implemented and tested in order to determine which obtained the best results in information retrieval exercises.
引用
收藏
页码:2298 / 2304
页数:7
相关论文
共 50 条
  • [21] Text information similarity search algorithm based on segment estimation and PageRank
    Zhai L.
    Cui X.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (04): : 910 - 915
  • [22] An Information Intelligent Search Method for Computer Forensics Based on Text Similarity
    Yang, Zhongxin
    Chen, Zhifeng
    Zhang, Ping
    Liu, Ming
    Li, Qingbao
    2020 4TH INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, SECURITY AND PRIVACY (ICCSP 2020), 2020, : 79 - 83
  • [23] Mapping your way through MEDLINE
    Green, D
    CANADIAN MEDICAL ASSOCIATION JOURNAL, 1999, 160 (12) : 1747 - 1747
  • [24] Text Categorization via Similarity Search An Efficient and Effective Novel Algorithm
    Duan, Hubert Haoyang
    Pestov, Vladimir G.
    Singla, Varun
    SIMILARITY SEARCH AND APPLICATIONS (SISAP), 2013, 8199 : 182 - 193
  • [25] On the Semantic Similarity of Disease Mentions in MEDLINE® and Twitter
    Thorne, Camilo
    Klinger, Roman
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 324 - 332
  • [26] Alternative access to Medline via Compuserve
    Creasey, SJ
    BRITISH DENTAL JOURNAL, 1995, 179 (11-12) : 409 - 409
  • [27] HOW TO USE MEDLINE TO SEARCH LITERATURE
    BIRON, P
    UNION MEDICALE DU CANADA, 1987, 116 (01): : 30 - 31
  • [28] Alternative access to Medline via Compuserve
    Farbey, RA
    BRITISH DENTAL JOURNAL, 1996, 180 (02) : 50 - 50
  • [29] A PROGRAM FOR CONSTRUCTING SVT TESTS - AN ALTERNATIVE WAY OF ASSESSING TEXT COMPREHENSION
    WALCZYK, JJ
    ROYER, JM
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1989, 21 (03): : 369 - 370
  • [30] READER RESPONSE - AN ALTERNATIVE WAY TO TEACH STUDENTS TO THINK ABOUT TEXT
    CHASE, ND
    HYND, CR
    JOURNAL OF READING, 1987, 30 (06): : 530 - 540