Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language

被引:16
|
作者
Lovis, C
Baud, RH
机构
[1] Univ Hosp Geneva, Div Med Informat, CH-1211 Geneva 4, Switzerland
[2] Puget Sound Hlth Care Syst, Seattle, WA USA
关键词
D O I
10.1136/jamia.2000.0070378
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: The authors consider the problem of exact string pattern matching using algorithms that do not require any preprocessing. To choose the most appropriate algorithm, distinctive features of the medical language must be taken into account. The characteristics of medical language are emphasized in this regard, the best algorithm of those reviewed is proposed, and detailed evaluations of time complexity for processing medical texts are provided. Design: The authors first illustrate and discuss the techniques of various string pattern-matching algorithms. Next, the source code and the behavior of representative exact string pattern-matching algorithms are presented in a comprehensive manner to promote their implementation. Detailed explanations of the use of various techniques to improve performance are given. Measurements: Real-time measures of time complexity with English medical texts are presented. They lead to results distinct from those found in the computer science literature, which are typically computed with normally distributed texts. Results: The Boyer-Moore-Horspool algorithm achieves the best overall results when used with medical texts. This algorithm usually performs at least twice as fast as the other algorithms tested. Conclusion: The time performance of exact string pattern matching can be greatly improved ii: an efficient algorithm is used. Considering the growing amount of text handled in the electronic patient record, it is worth implementing this efficient algorithm.
引用
收藏
页码:378 / 391
页数:14
相关论文
共 50 条
  • [31] OPTIMAL PARALLEL ALGORITHMS FOR MULTIDIMENSIONAL IMAGE TEMPLATE MATCHING AND PATTERN-MATCHING
    SAOUDI, A
    NIVAT, M
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 654 : 240 - 246
  • [32] A Survey of the Hybrid Exact String Matching Algorithms
    Almazroi, Abdulwahab Ali
    Shah, Asad Ali
    Almazroi, Abdulaleem Ali
    Mohammed, Fathey
    Al-Kumaim, Nabil Hasan
    ADVANCES ON INTELLIGENT INFORMATICS AND COMPUTING: HEALTH INFORMATICS, INTELLIGENT SYSTEMS, DATA SCIENCE AND SMART COMPUTING, 2022, 127 : 173 - 189
  • [33] Technology beats algorithms (in exact string matching)
    Tarhio, Jorma
    Holub, Jan
    Giaquinta, Emanuele
    SOFTWARE-PRACTICE & EXPERIENCE, 2017, 47 (12): : 1877 - 1885
  • [34] Pattern-matching algorithms based on term rewrite systems
    Katoen, JP
    Nymeyer, A
    THEORETICAL COMPUTER SCIENCE, 2000, 238 (1-2) : 439 - 464
  • [35] A FAST, SENSITIVE PATTERN-MATCHING APPROACH FOR PROTEIN SEQUENCES
    ROHDE, K
    BORK, P
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1993, 9 (02): : 183 - 189
  • [36] DETERMINISTIC SAMPLING - A NEW TECHNIQUE FOR FAST PATTERN-MATCHING
    VISHKIN, U
    SIAM JOURNAL ON COMPUTING, 1991, 20 (01) : 22 - 40
  • [39] ALGORITHMS FOR MINIMIZATION OF FINITE ACYCLIC AUTOMATA AND PATTERN-MATCHING IN TERMS
    KRIVOI, SL
    CYBERNETICS, 1991, 27 (03): : 324 - 331
  • [40] Fast exact pattern matching algorithm
    College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
    J. Comput. Inf. Syst., 2009, 1 (235-243):